TY - GEN
T1 - Probabilistic joint feature selection for multi-task learning
AU - Xiong, Tao
AU - Bit, Jinbo
AU - Rao, Bharat
AU - Cherkassky, Vladimir S
PY - 2007
Y1 - 2007
N2 - We study the joint feature selection problem when learning multiple related classification or regression tasks. By imposing an automatic relevance determination prior on the hypothesis classes associated with each of the tasks and regularizing the variance of the hypothesis parameters, similar feature patterns across different tasks are encouraged and features that are relevant to all (or most) of the tasks are identified. Our analysis shows that the proposed probabilistic framework can be seen as a generalization of previous result from adaptive ridge regression to the multi-task learning setting. We provide a detailed description of the proposed algorithms for simultaneous model construction and justify the proposed algorithms in several aspects. Our experimental results show that this approach outperforms a regularized multi-task learning approach and the traditional methods where individual tasks are solved independently on synthetic data and the real-world data sets for lung cancer prognosis.
AB - We study the joint feature selection problem when learning multiple related classification or regression tasks. By imposing an automatic relevance determination prior on the hypothesis classes associated with each of the tasks and regularizing the variance of the hypothesis parameters, similar feature patterns across different tasks are encouraged and features that are relevant to all (or most) of the tasks are identified. Our analysis shows that the proposed probabilistic framework can be seen as a generalization of previous result from adaptive ridge regression to the multi-task learning setting. We provide a detailed description of the proposed algorithms for simultaneous model construction and justify the proposed algorithms in several aspects. Our experimental results show that this approach outperforms a regularized multi-task learning approach and the traditional methods where individual tasks are solved independently on synthetic data and the real-world data sets for lung cancer prognosis.
UR - http://www.scopus.com/inward/record.url?scp=70449089773&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70449089773&partnerID=8YFLogxK
U2 - 10.1137/1.9781611972771.30
DO - 10.1137/1.9781611972771.30
M3 - Conference contribution
AN - SCOPUS:70449089773
SN - 9780898716306
T3 - Proceedings of the 7th SIAM International Conference on Data Mining
SP - 332
EP - 342
BT - Proceedings of the 7th SIAM International Conference on Data Mining
PB - Society for Industrial and Applied Mathematics Publications
T2 - 7th SIAM International Conference on Data Mining
Y2 - 26 April 2007 through 28 April 2007
ER -