Abstract
Data mining tasks typically require significant effort in data preparation to find, transform, integrate and prepare the data for the relevant data mining tools. In addition, the work performed in data preparation is often not recorded and is difficult to reproduce from the raw data. In this paper we present an integrated approach to data preparation and data mining that combines the two steps into a single integrated process and maintains detailed metadata about the data sources, the steps in the process, and the resulting learned classifier produced from data mining algorithms. We present results on an example scenario, which shows that our approach provides significant reduction in the time in takes to perform a data mining task.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 14th IEEE International Conference on Data Mining Workshops, ICDMW 2014 |
Editors | Zhi-Hua Zhou, Wei Wang, Ravi Kumar, Hannu Toivonen, Jian Pei, Joshua Zhexue Huang, Xindong Wu |
Publisher | IEEE Computer Society |
Pages | 1076-1085 |
Number of pages | 10 |
Edition | January |
ISBN (Electronic) | 9781479942749 |
DOIs | |
State | Published - Jan 26 2015 |
Externally published | Yes |
Event | 14th IEEE International Conference on Data Mining Workshops, ICDMW 2014 - Shenzhen, China Duration: Dec 14 2014 → … |
Publication series
Name | IEEE International Conference on Data Mining Workshops, ICDMW |
---|---|
Number | January |
Volume | 2015-January |
ISSN (Print) | 2375-9232 |
ISSN (Electronic) | 2375-9259 |
Other
Other | 14th IEEE International Conference on Data Mining Workshops, ICDMW 2014 |
---|---|
Country/Territory | China |
City | Shenzhen |
Period | 12/14/14 → … |
Bibliographical note
Publisher Copyright:© 2014 IEEE.