Coreference Resolution on Blogs and Commented News
- Publication type
- Publication status
- Hendrickx, I., & Hoste, V.
- S. Lalitha Devi, A. Branco, and R. Mitkov
- Anaphora Processing and Applications, Lecture Notes in Artificial Intelligence
- Springer - Verlag (Heidelberg)
We focus on automatic coreference resolution for blogs and news articles with user comments as part of a project on opinion mining. We aim to study the effect of the genre shift from edited structured news- paper text to unedited, unstructured blog data. We compare our coref- erence resolution system on three data sets: newspaper articles, mixed newspaper articles and reader comments, and blog data. As can be ex- pected the performance of the automatic coreference resolution system drops drastically when tested on unedited text. We describe the char- acteristics of the different data sets and we examine the typical errors made by the resolution system.