Abstract
'Big Data' classification is hindered by the large volume of often high-dimensional data, missing or absent features and, in streaming operation, the need for real-time processing. This paper aims at learning a kernelized support-vector-machine (SVM) classifier from (generally nonlinearly separable) large-scale incomplete data 'on the fly.' Leveraging the low-rank attribute of the (even incomplete) data matrix, a novel online algorithm is developed for tracking the latent linear subspace jointly with the nonlinear classifier. Tailored for big data applications, dimensionality reduction based on the learned subspace is carried out online, while at the same time seeking the classifier in the reduced dimension. Performance analysis along with preliminary tests corroborate the effectiveness of the novel approach.
Original language | English (US) |
---|---|
Title of host publication | 2015 49th Annual Conference on Information Sciences and Systems, CISS 2015 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781479984282 |
DOIs | |
State | Published - Apr 15 2015 |
Event | 2015 49th Annual Conference on Information Sciences and Systems, CISS 2015 - Baltimore, United States Duration: Mar 18 2015 → Mar 20 2015 |
Publication series
Name | 2015 49th Annual Conference on Information Sciences and Systems, CISS 2015 |
---|
Other
Other | 2015 49th Annual Conference on Information Sciences and Systems, CISS 2015 |
---|---|
Country/Territory | United States |
City | Baltimore |
Period | 3/18/15 → 3/20/15 |
Bibliographical note
Publisher Copyright:© 2015 IEEE.