2 citations found. Retrieving documents...
P.-N. Tan and V. Kumar. Modeling of web robot navigational patterns. In WEBKDD 2000.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:
SUGGEST : A Web Usage Mining System - Ranieri Baraglia Paolo (2002)   (Correct)

....removing all the uninteresting entries from the input access log file, supposed to be in Common Log Format. Namely, we remove all the non html requests, like images or CGI scripts. Also the dumb scans of the entire site coming from robot like agents are removed. We used the technique described in [10] to model robots behavior. Then we create user sessions by identifying users with their IP address and sessions by means of a predefined timeout between two subsequent requests from the same user. According to Catledge et al. in [1] we fixed a timeout value equal to 30 minutes. The clustering ....

P.-N. Tan and V. Kumar. Modeling of web robot navigational patterns. In WEBKDD 2000.


Analyzing Web Robots and Their Impact on Caching - Almeida, Menascé.. (2001)   (5 citations)  (Correct)

....behavior. There are very few studies on Web robots available. Most concentrate on defining architectures and implementations for crawlers and shopbots. In [8] the authors survey the state of the art of Web robots and discuss robot crawling, atechnique for building indices for search engines. In [3], the authors examine the problem of identifying navigational patterns of Web robot sessions using standard classification techniques but do not cover features and statistical characterization of robot accesses. Reference [4] searches for invariants in e business workloads. The authors studied the ....

....demands not only the determination of visiting patterns and arrival process, but also the nature of the parameters requested by robots. For instance, acrawler visits each object served bytheWeb site just once, producing an access pattern that is quite different from human users. Inspired by [3, 4] weintroduce now several criteria, based on the aforementioned hierarchical model, whichallowus to identify robots in real logs. We group these characteristic criteria according to the three layers of the hierarchical model. 2.1 The Robot Criteria 2.1.0.1 Session Layer The session layer ....

P.Tan and V. Kumar, "Modeling of Web Robot Navigational Patterns," Proc. ACM WebKDD Workshop, 2000.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC