| R. Dale and C. Mellish. Towards the evaluation of natural language generation. In The First International Conference on Evaluation of Natural Language Processing Systems, Granada, Spain, May 1998. |
....and robust generation techniques was necessary due to the real time interaction with the users. The main difference from other generation approaches is the use of a generic agent modelling framework instead of a custom made user model. Previous attempts at evaluating language generation systems [6] have shown that it is a difficult task and that the results seldom measure the performance and contribution of each module within the system. A measure of the overall performance can be obtained by using a black box evaluation assessing the quality of the generated hypertext based on users ....
Robert Dale and Chris Mellish. Towards evaluation in natural language generation. In Proceedings of First International Conference on Language Resources and Evaluation, Granada, Spain, 28-30 May, 1998.
....the texts generated by the LGM. Evaluation of the prosodic quality may be done by formally comparing the output of D2S with that of the best text to speech systems available for Dutch. Informal comparison has so far given encouraging results. Evaluation of the LGM is a more complicated matter. As Dale Mellish (1998) have pointed out, evaluation of natural language generation systems is still in its infancy, and there are no wellestablished evaluation methods in this area. An evaluation method which seems promising is the one adopted by Coch (1996) and Lester Porter (1997) They compared computer generated ....
....An evaluation method which seems promising is the one adopted by Coch (1996) and Lester Porter (1997) They compared computer generated texts to texts from human authors by having a panel of judges, who did not know the source of the texts, rate their quality on several dimensions. However, see Dale Mellish (1998) for a discussion of some problems related to such a black box evaluation. In addition to having separate evaluations of the LGM, the prosody module and the SGM, it would also be interesting to see an evaluation of D2S as a whole. However, as data to speech systems are obviously even more ....
Dale, R., & Mellish, C. 1998. Towards evaluation in natural language generation. Pages 555--562 of: Rubio, A., Gallardo, N., Castro, R., & Tejada, A. (eds), Proceedings of the First International Conference on Language Resources and Evaluation.
No context found.
R. Dale and C. Mellish. Towards the evaluation of natural language generation. In The First International Conference on Evaluation of Natural Language Processing Systems, Granada, Spain, May 1998.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC