Results 1 - 10
of
461
StegHTML: A message hiding mechanism in HTML tags
, 2007
"... Traditional steganographic techniques have made use of data such as audio, video and text files for encoding information. This work looks makes an attempt to define a scheme to embed information in HTML files. Most HTML tags take attributes to finetune their effect. HTML tag attributes do not take a ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Traditional steganographic techniques have made use of data such as audio, video and text files for encoding information. This work looks makes an attempt to define a scheme to embed information in HTML files. Most HTML tags take attributes to finetune their effect. HTML tag attributes do not take
Deriving Link-context from HTML Tag Tree
, 2003
"... HTML anchors are often surrounded by text that seems to describe the destination page appropriately. The text surrounding a link or the link-context is used for a variety of tasks associated with Web information retrieval. These tasks can benefit by identifying regularities in the manner in which &q ..."
Abstract
-
Cited by 11 (4 self)
- Add to MetaCart
HTML anchors are often surrounded by text that seems to describe the destination page appropriately. The text surrounding a link or the link-context is used for a variety of tasks associated with Web information retrieval. These tasks can benefit by identifying regularities in the manner in which
Deriving Link-context from HTML Tag Tree
, 2003
"... HTML anchors are often surrounded by text that seems to describe the destination page appropriately. The text surrounding a link or the link-context is used for a variety of tasks associated with Web information retrieval. These tasks can benefit by identifying regularities in the manner in which &q ..."
Abstract
- Add to MetaCart
HTML anchors are often surrounded by text that seems to describe the destination page appropriately. The text surrounding a link or the link-context is used for a variety of tasks associated with Web information retrieval. These tasks can benefit by identifying regularities in the manner in which
Web-Document retrieval by genetic learning of importance factors for HTML tags
- Tags, Proceedings of PRICAI 2000 Workshop on Text and Web Mining
, 2000
"... Abstract. In contrast to conventional documents, a Web document con-sists of a number of tags which provide hints on the structure of the docu-ments. In this paper, we propose a Web-document retrieval method using the characteristics of HTML tags. This method learns the importance of tags from a tra ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
Abstract. In contrast to conventional documents, a Web document con-sists of a number of tags which provide hints on the structure of the docu-ments. In this paper, we propose a Web-document retrieval method using the characteristics of HTML tags. This method learns the importance of tags from a
HTML Tags as Extraction Cues for Web Page Description Construction
- Informing Science Journal
, 2003
"... Using four previously identified samples of Web pages containing meta-tagged descriptions, the value of meta-tagged keywords, the first 200 characters of the body, and text marked with common HTML tags as extracts helpful for writing summaries was estimated by applying two measures: density of descr ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
Using four previously identified samples of Web pages containing meta-tagged descriptions, the value of meta-tagged keywords, the first 200 characters of the body, and text marked with common HTML tags as extracts helpful for writing summaries was estimated by applying two measures: density
Do Web Authors Use HTML Tags to Flag Semantic Content?
"... An investigation into the use of HTML tags as flags for semantic content as found in web pages related to logic programming. Statistics on the tags used and examples of informative and uninformative tagged text are given. 1 Expectations of Tag Sematics This paper is the result of the initial stages ..."
Abstract
- Add to MetaCart
An investigation into the use of HTML tags as flags for semantic content as found in web pages related to logic programming. Statistics on the tags used and examples of informative and uninformative tagged text are given. 1 Expectations of Tag Sematics This paper is the result of the initial
24 Conf. on Data Mining (DMIN’05) Enhanced Information Retrieval by Using HTML Tags
"... Abstract- Whenever digital libraries or knowledge management systems are to be automatically filled with web pages from the internet, document classification of the web pages is one of the major challenges. We present an approach which uses HTML tags in order to improve the quality of the hypertext ..."
Abstract
- Add to MetaCart
Abstract- Whenever digital libraries or knowledge management systems are to be automatically filled with web pages from the internet, document classification of the web pages is one of the major challenges. We present an approach which uses HTML tags in order to improve the quality of the hypertext
HTML Tag Based Metrics for use in Web Page Type Classification
"... Traditional machine learning classifications of HTML documents focus on features drawn from terms in the documents, the link structure of groups of documents, or a combination of both. These techniques attempt to generate topical classifications of documents, with the hopes of mirroring a human&apos ..."
Abstract
- Add to MetaCart
Traditional machine learning classifications of HTML documents focus on features drawn from terms in the documents, the link structure of groups of documents, or a combination of both. These techniques attempt to generate topical classifications of documents, with the hopes of mirroring a human
Record-Boundary Discovery In Web Documents
, 1998
"... Extraction of information from unstructured or semistructured Web documents often requires a recognition and delimitation of records. (By "record" we mean a group of information relevant to some entity.) Without first chunking documents that contain multiple records according to record bou ..."
Abstract
-
Cited by 127 (20 self)
- Add to MetaCart
boundaries, extraction of record information will not likely succeed. In this thesis we describe a heuristic approach to discovering record boundaries in Web documents. In our approach, we capture the structure of a document as a tree of nested HTML tags, locate the subtree containing the records of interest
Structure and Content Analysis for HTML Medical Articles: A
- Hidden Markov Model Approach,” Proc. ACM DocEng
, 2007
"... We describe ongoing research on segmenting and labeling HTML medical journal articles. In contrast to existing approaches in which HTML tags usually serve as strong indicators, we seek to minimize dependence on HTML tags. Designing logical component models for general Web pages is a challenging task ..."
Abstract
-
Cited by 7 (3 self)
- Add to MetaCart
We describe ongoing research on segmenting and labeling HTML medical journal articles. In contrast to existing approaches in which HTML tags usually serve as strong indicators, we seek to minimize dependence on HTML tags. Designing logical component models for general Web pages is a challenging
Results 1 - 10
of
461