| Ide, N., Vronis, J. (Eds.) 1995. The Text Encoding Initiative: Background and Context. Dordrecht: Kluwer Academic Publishers. |
....attachment even when precise attachment fails. Using IDs as stable starting points for relative locations vastly improves robustness. XPointer technical overview XPointer is based largely upon a widely used technology, the Text Encoding Initiative extended pointer [Sperberg McQueen 1994] [Ide 1995]. Extended pointers provide axes for navigating within trees and a rudimentary predicate language for selecting nodes along axes, and have been implemented in several SGML based browsing systems. TEI extended pointers introduced location terms including root, here, id, child, descendant, ....
Nancy Ide and Jean Veronis (editors). Text Encoding Initiative: Background and Context, Boston, Kluwer Academic Publishers, ISBN 0792336895, 1995.
....and a dependency formalism [19] We used a constituent grammar to encode the lexicon and phrase structure rules. Together they accept all the 400 utterances of the corpus we collected in the experiment [15] 16] The lexicon is using parts of speech that are a variation of MULTEXT TEI categories [20]. Phrase structure rules are rewriting the utterance structure using unification constraints and non terminal categories such as noun groups, verb groups, prepositional groups, determiner groups, adverb groups, and adjective groups. Rules were adapted to accept missing and unknown words. They also ....
Nancy Ide and Jean Veronis, editors. The Text Encoding Initiative: Background and Context, Dordrecht, 1995. Kluwer Academic Publishers.
....to be offered and accessed globally. In addition, the growing acceptance of certain standards that enable the exchange and platform independence of corpora encodings have also had a important impact on corpora availability and reuse. In particular, the Text Encoding Initiative guidelines [SMB94, IV95], which adopt the ISO standard SGML [Gol90] as their markup (meta)language are a significant contribution to the standardisation effort in this area. This ease of availability and adoption of standards is important not only for corpora themselves, but also for software that helps in producing ....
Nancy Ide and Jean Veronis, editors. The Text Encoding Initiative: Background and Context. Kluwer Academic Publishers, Dordrecht, 1995.
....to be offered and accessed globally. In addition, the growing acceptance of certain standards that enable the exchange and platform independence of corpora encodings have also had a important impact on corpora availability and reuse. In particular, the Text Encoding Initiative guidelines, TEI 3 [20, 17], which adopt the ISO standard SGML 4 [15] as their markup (meta)language are a significant contribution to the standardisation effort in this area. This ease of availability and adoption of standards is important not only for corpora themselves, but also for software that helps in producing ....
Nancy Ide and Jean Veronis, editors. The Text Encoding Initiative: Background and Context. Kluwer Academic Publishers, Dordrecht, 1995.
.... annotation tool which supports non hierarchical markup (particularly necessary for speech due to overlapping annotation schemes and multiple overlapping speakers) The idea of using XML as a annotation standard for natural language processing goes back to the work of the Text Encoding Initiative (Ide and Veronis, 1995), and the idea of using a database as a central resource for natural language processing was developed by the GATE project (Cunningham et al. 1996) Data models related to XML and associated query languages were developed by a number of groups including the Lore project (Goldman et al. 1999) and ....
Ide, Nancy and Jean Veronis (eds.), 1995. The Text Encoding Initiative: Background and Context. Dordrecht: Kluwer. See also www.tei-c.org.
....is used to represent annotations, and then the format in which the XML data are stored internally in the MATE workbench. 3. 1 XML XML has proved to be a widely used and e ective format for annotating text and speech corpora, and a number of projects, including the Text Encoding Initiative (TEI) [Ide and V eronis, 1995] and the Corpus Encoding Standard (CES) Ide et al. 2000] have developed general purpose SGML standards for linguistic annotation. XML s exible hierarchical structure matches well with many kinds of linguistic representation, and the MATE workbench uses XML as its input output format, and uses ....
Ide, N. and Veronis, J., editors (1995). The Text Encoding Initiative: Background and Context. Dordrecht: Kluwer. See also www.tei-c.org.
No context found.
Ide, N., Vronis, J. (Eds.) 1995. The Text Encoding Initiative: Background and Context. Dordrecht: Kluwer Academic Publishers.
No context found.
Ide, N., Vronis, J. (Eds.) 1995. The Text Encoding Initiative: Background and Context. Dordrecht: Kluwer Academic Publishers.
....for a sentence level alignment. Therefore, diversity in the nature of texts was preferred to the collection of a very big amount of similar data. 3. 1 Format ARCADE contributed to the development and testing of the Corpus Encoding Standard (CES) which was initiated in the MULTEXT project (Ide et al. 1995). The CES is based on SGML and it is an extension of the recommendations of the Text Encoding Initiative (Ide and V eronis, 1995) today internationally accepted. Both the JOC and BAF parts of the ARCADE corpus (described below) are encoded in CES format. 3.2 JOC The JOC corpus is composed of ....
....big amount of similar data. 3. 1 Format ARCADE contributed to the development and testing of the Corpus Encoding Standard (CES) which was initiated in the MULTEXT project (Ide et al. 1995) The CES is based on SGML and it is an extension of the recommendations of the Text Encoding Initiative (Ide and V eronis, 1995), today internationally accepted. Both the JOC and BAF parts of the ARCADE corpus (described below) are encoded in CES format. 3.2 JOC The JOC corpus is composed of records of questions and answers regarding European Community matters. The data is regularly published as one section of the C ....
N. Ide and J. V'eronis, 1995. The Text Encoding Initiative: background and context, chapter 342p. Kluwer Academic Publishers, Dordrecht.
No context found.
Ide, N. and J. Véronis (eds.). 1995a. The Text Encoding Initiative: background and context. Dordrecht: Kluwer Academic Publishers Ide, N. and J. Véronis. 1995b. Corpus Encoding Standard.
....and two editors. Information about the Initiative appears primarily in the Guidelines [14] and on the TEI s World Wide Web page [16] The first three numbers of the 1995 volume of Computers and the Humanities deal with various aspects of the TEI; these have been reprinted in monograph form [12]. The new encoding scheme was required to : ffl adequately represent all the textual features needed for research, ffl be simple, clear and concrete, ffl be easy for researchers to use without special purpose software, ffl allow the rigorous definition and efficient processing of texts, ffl ....
Nancy M. Ide and Jean Veronis, editors. The Text Encoding Initiative: Background and Context. Kluwer Academic Publishers, Dordrecht, 1995.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC