See this document in CiteSeerX!

Giga-Mining (1998)  (Make Corrections)  (2 citations)
Corinna Cortes Daryl Pregibon ATT Labs-Research 180 Park Ave Bldg 103 Florham ...
Knowledge Discovery and Data Mining



  Home/Search   Context   Related

 
View or download:
att.com/~corinna/pa...giga.mining.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  att.com/info/corinna (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We describe an industrial-strength data mining application in telecommunications. The application requires building a short (7 byte) profile for all telephone numbers seen on a large telecom network. By large, we mean very large: we maintain approximately 350 million profiles. In addition, the procedure for updating these profiles is based on processing approximately 275 million call records per day. We discuss the motivation for massive tracking and fully describe the definition and... (Update)

Context of citations to this paper:   More

...was placed. It might also contain derived information such as the degree to which the calling pattern from the number is business like [4]. Programs to compute signatures must be highly optimized because of the size of the data stream and the number of signatures tracked....

Cited by:   More
Data Mining in Telecommunications - Weiss   (Correct)
Hancock: A Language for Extracting Signatures from.. - Cortes, Fisher.. (2000)   (Correct)

Similar documents based on text:   More   All
0.1:   Communities of Interest - Cortes, Pregibon, Volinsky   (Correct)
0.1:   Wardialing Brief - Kingpin Stake Inc   (Correct)
0.1:   Squashing Flat Files Flatter - DuMouchel, Volinsky, Johnson.. (1999)   (Correct)

Related documents from co-citation:   More   All
2:   Data Mining and Knowledge Discovery (context) - Fawcett, Provost et al. - 1997

BibTeX entry:   (Update)

C. Cortes and D. Pregibon. Giga mining. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, 1998. http://citeseer.ist.psu.edu/469772.html   More

@inproceedings{ cortes98gigamining,
    author = "Corinna Cortes and Daryl Pregibon",
    title = "Giga-Mining",
    booktitle = "Knowledge Discovery and Data Mining",
    pages = "174-178",
    year = "1998",
    url = "citeseer.ist.psu.edu/469772.html" }
Citations not processed or no citations identified.

Documents on the same site (http://www.research.att.com/info/corinna):   More
Limits on Learning Machine Accuracy Imposed by Data Quality - Cortes, Jackel, Chiang (1995)   (Correct)
Hancock: A Language for Extracting Signatures from.. - Cortes, Fisher.. (2000)   (Correct)
Support-Vector Networks - Cortes, Vapnik (1995)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC