R. Musick, J. Catlett, and S. Russell. Decision theortic subsampling for induction on large databases. In Proc. of the 10th Inter. Conf. on Machine Learning, San Mateo, CA, 1993. 11

 Home/Search   Document Not in Database   Summary   Related Articles  

This paper is cited in the following contexts:
Sampling: An efficient, simple and robust technique for scaling.. - Addala   (Correct)

....up is the process of handling large amounts of data. Some also consider it as the process of increasing the speed of data mining. Although large data sets are necessary for reliable results, large databases are not necessarily advantages for the following reasons: ffl Not all data is informative [14, 19, 15]. ffl High degree of redundancy in the databases [9, 11] 2 ffl Experimental studies on the entire database are expensive [3] This is the basic problem in genebank collections and in drug industry. For example to conduct genetic studies on a single gene we need many resources and it is ....

R. Musick, J. Catlett, and S. Russell. Decision theortic subsampling for induction on large databases. In Proc. of the 10th Inter. Conf. on Machine Learning, San Mateo, CA, 1993. 11

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC