| R. Musick, J. Catlett, and S. Russell. Decision theortic subsampling for induction on large databases. In Proc. of the 10th Inter. Conf. on Machine Learning, San Mateo, CA, 1993. 11 |
....up is the process of handling large amounts of data. Some also consider it as the process of increasing the speed of data mining. Although large data sets are necessary for reliable results, large databases are not necessarily advantages for the following reasons: ffl Not all data is informative [14, 19, 15]. ffl High degree of redundancy in the databases [9, 11] 2 ffl Experimental studies on the entire database are expensive [3] This is the basic problem in genebank collections and in drug industry. For example to conduct genetic studies on a single gene we need many resources and it is ....
R. Musick, J. Catlett, and S. Russell. Decision theortic subsampling for induction on large databases. In Proc. of the 10th Inter. Conf. on Machine Learning, San Mateo, CA, 1993. 11
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC