15 citations found. Retrieving documents...
H. Ku cera and W. N. Francis. Computational Analysis of Present-Day American English. Brown University Press, Providence, RI, 1967.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:
Guiding a Linguistically Well-Founded Parser with Head Patterns - Seagull, Schubert (2001)   (Correct)

....it is the hyponymy (subtype supertype) and hyperonymy (supertypesubtype) relationships that we use here. To enable the supervised learning of WordNet senses and abstractions in syntactic constructs, we require a corpus that is annotated with both syntactic and word sense tags. The Brown Corpus [Ku cera and Francis, 1967], the rst large scale computerized text collection project, luckily has been annotated both with syntactic tags (as a part of the Penn Treebank project) and (a third of it) with WordNet sense tags (the SEMCOR project [Landes et al. 1998] In Section 4.2 we discuss how these two corpora were ....

Henry Kucera and W. Nelson Francis, Computational Analysis of Present-day American English, Brown University Press, Providence, R.I., 1967.


Compiling and Using the IJS-ELAN Parallel Corpus - Erjavec (2002)   (Correct)

....contain many errors, although less than for Slovene, given the much smaller tagset. Rather than try to improve the accuracy of our own tagging, we opted for additional annotations with other, better, models and also with a better known tagset, namely (variants of) the one used in the Brown corpus [15]. For the additional annotation of the En glish part we combined the output of two taggers. First, the TnT tagger distribution already includes some English models. We chose the one produced by training on the concatenation of the Brown corpus with the Wall Street Journal corpus; this training ....

Henry Ku6era and William Nelson Francis. Com- putational Analysis of Present Day American English. Brown University Press, Providence, Rhode Island, 1967.


Age of Acquisiton Effects in Reading and Other Tasks - Zevin, Seidenberg   (Correct)

....length, spelling sound consistency, and imageability affect performance (see Balota, 1994; Seidenberg, 1995, for reviews) Some properties, such as length in letters, are objective; others, such as frequency, are estimates of variables whose actual values are unknown. For example, the widely used Ku cera and Francis (1967) norms indicate how often each word occurs in a corpus of about 1 million words. These frequencies are then taken as an estimate of something more interesting: how often people encounter the words in reading or using spoken language. However, due Research supported by NIMH grant PO1 MH47566, NICHD ....

....Neuroscience Program, University of Southern California, Los Angeles, CA 90089 2520. Electronic mail address: marks gizmo.usc.edu. AGE OF ACQUISITION EFFECTS 2 to variables such as the size of the corpus, the sample of texts used in generating the corpus, and individual differences among subjects, Ku cera and Francis (1967) and other corpora provide only a rough estimate of how often words are actually encountered. These sources of error can complicate the interpretation of frequency effects in behavioral studies (Gernsbacher, 1984) Several recent word recognition studies (Morrison Ellis, 1995; Gerhand Barry, ....

[Article contains additional citation context not shown here]

Ku cera, H., & Francis, W. N. (1967). Computational analysis of present-day American English. Providence, RI: Brown University Press.


Random Texts Exhibit Zipf's-Law-Like Word Frequency Distribution - Li (1992)   (21 citations)  (Correct)

....P (2) and the rank r word has the frequency P (r) the frequency distribution is P (r) C r ; 1) 1 Zipf s law 2 with C 0:1 and 1. This distribution, also called Zipf s law, has been checked for accuracy for the standard corpus of the present day English with very good results [2]. The fall o of the distribution as the rank is increased is obvious, because the more frequently occurring words are guaranteed to have larger frequencies than those less frequently occurring. Nevertheless, it seems to be a puzzle as why the decay is a power law instead of an exponential ....

Henry Kucera and W. Nelson Francis, Computational Analysis of Present-Day American English (Brown University, 1967).


Estimating Probability Distributions over Hypotheses with Variable.. - Wu (1993)   (1 citation)  (Correct)

....is assumed to signify linear order locatives more frequently than containment. The marginals in 3: effectively reduce the conditional probability of thinking of roads along the seacoast, given one is thinking of roads in the context of seacoasts. The West Coast res 2 From the Brown corpus (Kucera Francis 1967). Our approach to nominal compounds is discussed in Wu (1990) which proposes the use of probability to address long standing problems from the linguistics literature (e.g. Lees 1963; Downing 1977; Levi 1978; Warren 1978; McDonald 1982) AAAI 93 WU 5 (a) 2 6 4 ABBRV: C:coast:seacoast PROB: ....

KU CERA, HENRY & W. NELSON FRANCIS. 1967. Computational analysis of present-day American English. Providence, RI: Brown University Press.


A Compaction of WordNet Senses for Evaluation of Word Sense.. - Seagull (2000)   (Correct)

....a large lexicon, and as a result, introspection is often the nal arbiter for a lexicographer making such distinctions. A more practical problem with these microdistinctions is that they are dicult to reliably annotate. Ng and Lee [1996] note that in comparing their annotation of the Brown Corpus [Ku cera and Francis, 1967] with the SEMCOR annotation [Landes et al. 1998] both with 1 The average noun sense ambiguity for a sample of English text is almost certainly greater. Ng and Lee [1996] report that their collection of 121 most common nouns (which account for 20 of all noun occurrences in English) average 7.8 ....

Henry Kucera and W. Nelson Francis, Computational Analysis of Present-day American English, Brown University Press, Providence, R.I., 1967.


Impaired Oral Reading in Surface Dyslexia.. - Plaut, Behrmann.. (1993)   (Correct)

....leaving a corpus of 2395 responses. These data were collected over a four month period approximately two years after the data for Experiments 1 3 were collected. Words with unique bodies (e.g. SOAP) were excluded from the analysis. The remaining 2184 trials were divided into three frequency bands (Ku cera Francis, 1967): high ( 100 per million; n=325) medium ( 100 and =10; n=792) and low ( 10; n=1067) The trials were also classified according to neighborhood consistency (number of friends, F, and enemies, E) regular ( F 1) E =4; n=1837) ambiguous (0.5 (F 1) E 4; n=224) or exception ( F 1) E =0.5; n=123) ....

Ku cera, H., & Francis, W. N. (1967). Computational analysis of present-day American English. Providence, RI: Brown University Press.


Strategic Effects in Visual Word Recognition - Voice   (Correct)

....A control group also saw all 420 items in blocks of twenty one, but the items in these blocks were randomly assigned. 2.3 Materials All the stimuli used in the present experiment were four letters long and monosyllabic. Word frequency was measured using the number of samples measure from the Kucera and Francis (1967) norms. Orthographic neighbourhood size was measured using Coltheart s N metric (Coltheart, Davelaar and Besner 1977) N is the number of words which can be generated by replacing any letter of a target word with another letter in the same position (e.g. HILL: fill, hell, hilt) Concreteness ....

Ku cera, H. & Francis, W. N. 1967. Computational analysis of present--day American English. Providence: Brown University Press.


Minimizing Binding Errors Using Learned Conjunctive Features - Mel, Fiser (1999)   (6 citations)  (Correct)

.... case punctuation free words and their relative frequencies as found in 5 million words in The Wall Street Journal (available from the Association for Computational Linguistics at http: morph.ldc.upenn.edu ) The English text database consisted of 1 million words drawn from a variety of sources (Ku cera Francis, 1967). Recognition Using N grams A natural class of visual features in text world is the position invariant n gram , de ned here as a binary detector that responds when a speci c spatial con guration of n letters is found anywhere (one or more times) in the input array 2 . The value of n is termed ....

Kucera, H., & Francis, W. (1967). Computational analysis of present-day American English.


Log-Linear Interpolation of Language Models - Gutkin (2006)   (Correct)

No context found.

H. Ku cera and W. N. Francis. Computational Analysis of Present-Day American English. Brown University Press, Providence, RI, 1967.


Log-Linear Interpolation of Language Models - Gutkin (2000)   (Correct)

No context found.

H. Ku cera and W. N. Francis. Computational Analysis of Present-Day American English. Brown University Press, Providence, RI, 1967.


A Figure of Merit for the Evaluation of Web-Corpus Randomness - Ciaramita, Baroni (2006)   (Correct)

No context found.

H. Ku cera and W. Francis. 1967. Computational Analysis of Present-Day American English. Brown University Press, Providence, RI.


Merl A Mitsubishi Electric Research Laboratory - Http Www Merl (1995)   (Correct)

No context found.

H. Ku#cera and W. N. Francis. Computational Analysis of Present-Day American English. Brown University Press, Providence, RI, 1967.


Merl A Mitsubishi Electric Research Laboratory - Http Www Merl (1996)   (Correct)

No context found.

H. Ku#cera and W. N. Francis. Computational Analysis of Present-Day American English. Brown University Press, Providence, RI, 1967.


Merl A Mitsubishi Electric Research Laboratory - Http Www Merl (1996)   (Correct)

No context found.

Ku#cera, H. and W. N. Francis. 1967. Computational Analysis of Present-Day American English.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC