Intelligent Systems Dept.
Abstract:
This paper gives an overview of language engineering public domain and freely available software. The focus is on lingware tools that are available via the World Wide Web for the Unix platform and concerned with corpora production. Discussed is the relation of tools to standards, in particular SGML, and the benefits and disadvantages of using public domain tools. Given is an overview of a number of generic string processing and corpus conversion tools of statistically based annotations systems and computational linguistic software. Some on-going initiatives on production, standardisation and availability of language tools are mentioned and a number of Web sites, related to the discussed topics are listed.

