| T. G. Vosse, "The Word Connection," Ph.D. dissertation, University of Leiden, The Netherlands. Neslia Paniculata Uitgeverij, Enschede, 1994. |
....may and what may not occur. It makes use of what trigrams of characters and what triphones (trigrams of phonemes) are viable in the Dutch language (given a dictionary of words that may occur) Using these trigrams substrings of the input string are compared to words in the dictionary. We refer to (Vosse 1994) for details on this error correction method. For reasons of compositionality we have chosen the latter approach: for the Maf module to be kept divisible in submodules, this option offers the best possibility to partition Maf in a number of submodules that have clear input output specifications ....
Vosse, T. G. (1994). The Word Connection. PhD dissertation, Rijksuniversiteit Leiden. Neslia Paniculata.
....may and what may not occur. It makes use of what trigrams of characters and what triphones (trigrams of phonemes) are viable in the Dutch language (given a dictionary of words that may occur) Using these trigrams substrings of the input string are compared to words in the dictionary. We refer to (Vosse 1994) for details on this error correction method. For reasons of compositionality we have chosen the latter approach: for the Maf module to be kept dividable in submodules, this option offers the best possibility to partition Maf in a number of submodules that have clear input output specifications ....
....number; 16 Lex tags the string ok and thanks with a feature type indicating the end of the dialogue. 3.5 Stage of Development Currently modules DBrec, CorSe and Lex are available. DBrec and Lex have been developed by the Schisma partners, and CorSe has been adapted from the source code based on (Vosse 1994). The other modules Date, Time and Number are still under construction. Future research concerning Maf will follow the integrated approach as discussed in section 3.2. Expertise accumulated in working on the separate modules of Maf, will then be incorporated in the new design. 4 Parsing The ....
Vosse, T. G. (1994). The Word Connection. PhD dissertation, Rijksuniversiteit Leiden. Neslia Paniculata.
....actually occur in the document collection are added to the query. every dictionary is necessarily incomplete in this respect. To handle this problem, some stemmer versions were extended with a compound analyser, the word splitter developed by Theo Vosse for the CORRie (grammar checker) project [Vosse, 1994]. The word splitter will try to split a compound into its components (stems) on the basis of word combination rules for Dutch and a lexicon. If the splitter is unsuccessful, the word is left unchanged. The following results were obtained with the compound splitter using a random sample of ....
Vosse, T. G. (1994). The Word Connection. PhD thesis, Rijksuniversiteit Leiden, Neslia Paniculata Uitgeverij, Enschede.
....was used analogous to the one in Definition 4; the filtering function remains the same. The first grammar generates a subset of the programming language ALGOL 68 (van Wijngaarden and others, 1975) The second and third grammars generate a fragment of Dutch, and are referred to as the CORRiegrammar (Vosse, 1994) and the Deltra grammar (Schoorl and Belder, 1990) respectively. These grammars were stripped of their arguments in order to convert them into context free grammars. The fourth grammar, referred to as the Alvey grammar (Carroll, 1993) generates a fragment of English and was automatically ....
Vosse, T.G. 1994. The Word Connection. Ph.D. thesis, University of Leiden.
....processes. We restricted compound splitting by creating system variants which only add the heads or both heads and modifiers as separate index terms. To split up compounds into their constituents we used the dictionary based compound splitter developed by Theo Vosse for the CORRie project (cf. Vosse (1994)) The compound splitter does not assign structure to the compound but simply yields a list of constituents. Identifying head modifier relationships in compounds is not trivial because of possible structural ambiguities. In Dutch, compounds existing of two parts are usually right headed (a ....
Vosse, T. G. (1994). The Word Connection. Ph. D. thesis, Rijksuniversiteit Leiden, Neslia Paniculata Uitgeverij, Enschede.
No context found.
T. G. Vosse, "The Word Connection," Ph.D. dissertation, University of Leiden, The Netherlands. Neslia Paniculata Uitgeverij, Enschede, 1994.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC