Text Augmentation: Inserting XML tags into natural language text with PPM Models and Viterbi-like search (2003)
| Citations: | 2 - 0 self |
BibTeX
@MISC{Yeates03textaugmentation:,
author = {Stuart A. Yeates},
title = {Text Augmentation: Inserting XML tags into natural language text with PPM Models and Viterbi-like search},
year = {2003}
}
OpenURL
Abstract
This thesis develops work on using Hidden Markov Models to insert tags natural language text. A taxonomy of tags is developed unifying the fields of text segmentation tagging, part-of-speech tagging, proper noun extraction and hierarchical entity extraction. The search spaces for inserting tags are examined from both a theoretical and experimental point of view across the taxonomy and on four corpora. A analysis of different correctness measures for different types of tag insertion problem is undertaken and a technique to determine whether tag-insertion errors are the result of a modelling failure or a searching failure is discovered.







