| Charles F. Goldfarb and Paul Prescod. XML Handbook (5th ed.). Prentice Hall, 2003. |
....the length of encoding under the assumption that document elements are independent of each other. Our presentation is a preliminary investigation; it remains to carry out experiments to validate our approach on real data. 1 Introduction Extensible Markup Language (XML; www.w3c.org XML ) [GP01] is the universal format for structured documents and data on the Web. With the vision of the semantic Web [BLHL01] becoming a reality, communication of information on the machine level will ultimately be carried out through XML. As the level of XML tra#c grows so will the demand for compression ....
....i.e. one model for element and attribute compression and another for text compression, and, in addition, it utilises the hierarchical structure of XML documents to further compress documents. None of the previous compression algorithms for XML assume that a Document Type Definition (DTD) [GP01] is available as we do in our proposal. To simplify the presentation, in this paper we consider DTDs which define only elements, rather than allowing the definition of attributes and entities as well. The compression techniques and algorithms could be adapted to cater for these additional ....
Charles F. Goldfarb and Paul Prescod. The XML Handbook. Prentice-Hall, third edition, 2001.
....[W3C98] XML Data Reduced (XDR) FT98] BizTalk [Mic00] XML Schema De nition (XSD) W3C00a] Both DTDs and schemas have their pros and cons. As an older practice DTDs have wider support than XML Schemas the rst recommendation of which is from October 2000. DTDs are also human readable [GP00] thanks to their simplicity, and the use of entity declarations makes them quite space economic, at least when compared to XML Schemas. But DTDs have a few drawbacks, too. First of all, the syntax of DTDs is not well formed XML, which means more work for the validating parser, and furthermore, ....
....piece of information manually tagged by a technical writer. The conversion tool then ts the content into a structure with the help of the tags. To reduce the amount of manual work, more automatic methods have been developed. I will brie y describe a visual conversion environment called DynaTag [GP00] and an automatic element type classi cation method [Hei00] DynaTag is a graphical environment for converting word processing (WP) documents to XML. In the beginning, a document is analysed and converted into an intermediate tagged form which is ready to be mapped automatically. If no DTD is ....
Charles F Goldfarb and Paul Prescod. The XML Handbook. Prentice-Hall, second edition, 2000.
....3 specially designed for Web applications. This subset retains the key SGML advantages of extensibility, structure, and validation in a language that is designed to be vastly easier to learn, use, and implement than full SGML. We do not want to give a tutorial on XML but refer to literature [14, 21] and Internet resources 4 . Our intention is to introduce the notions used in the following of the paper. Data in XML is grouped into elements delimited by tags. Elements can be nested and may have assigned attributes. Each tag may have several attribute value pairs; values are always strings. ....
Charles F. Goldfarb and Paul Prescod. The XML Handbook. Prentice Hall, 1998.
....we discuss the use of constraint techniques for enhancing the expressive power of the formalism, both on the level of the logical structure and the index structure. 1 Introduction With the advent and success of SGML ( ISO86, Gol90] and related standards for document representation (XML: GP98] ODA: ISO89] the interest in Information Retrieval (IR) shifted from systems that essentially treat documents as flat unstructured files to systems and querying languages that explicitly take both logical structure and the contents of documents into account. 1 Various formalisms have been ....
....database. If applied to SGML documents, the formalism is also schema independent, which means that documents can be stored without storing the DTD. 3 Schema independence facilitates the storage of heterogeneous document collections and probably will become more important with the use of XML ( GP98] with its liberal concept of a well formed document. The paper is organized as follows. The next section motivates the usage of DAGs as a model for documents and document collections (or other entities) and provides the mathematical foundation for modeling and querying documents before ....
Charles F. Goldfarb and Paul Prescod. The XML Handbook. PrenticeHall, 1998.
No context found.
Charles F. Goldfarb and Paul Prescod. XML Handbook (5th ed.). Prentice Hall, 2003.
No context found.
Charles F. Goldfarb and Paul Prescod. XML Handbook. Prentice Hall, 4th edition, 2001.
No context found.
Charles F. Goldfarb and Paul Prescod. The XML Handbook. Prentice-Hall, 1998.
No context found.
Charles F. Goldfarb and Paul Prescod. XML Handbook. Prentice Hall, 4th edition, 2001.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC