Publications

2014

Dutch parallel corpus en SoNaR (.pdf)

Macken, L., De Clercq, O., Desmet, B., & Hoste, V. (2014).

SoNaR-500

Oostdijk, N., Hoste, V., de Jong, F., Reynaert, M., De Clercq, O., Desmet, B., & van den Heuvel, H. (2014).

SoNaR-1

Oostdijk, N., Reynaert, M., Hoste, V., van den Heuvel, H., Monachesi, P., Desmet, B., De Clercq, O., & Schuurman, I. (2014).

SoNaR nieuw media corpus

Oostdijk, N., Reynaert, M., Hoste, V., van den Heuvel, H., De Clercq, O., & Sanders, E. (2014).

LT3: sentiment classification in user-generated content using a rich feature set (.pdf)

Van Hee, C., Van de Kauter, M., De Clercq, O., Lefever, E., & Hoste, V. (2014).

Terminologie: op het snijvlak van ambacht en technologie (.pdf)

Vanopstal, K., Macken, L., Lefever, E., Van de Kauter, M., Buysschaert, J., & Hoste, V. (2014).

SemEval 2014 Task 5 - L2 Writing Assistant (.pdf)

van Gompel, M., Hendrickx, I., van den Bosch, A., Lefever, E., & Hoste, V. (2014).

2013

Normalization of Dutch user-generated content (.pdf)

De Clercq, O., Desmet, B., Schulz, S., Lefever, E., & Hoste, V. (2013).

Emotion detection in suicide notes (.pdf)

Desmet, B., & Hoste, V. (2013).

Gallop Documentation (.pdf)

Desmet, B., Hoste, V., Verstraeten, D., & Verhasselt, J. (2013).

COREA: coreference resolution for extracting answers for Dutch (.pdf)

Hendrickx, I., Bouma, G., Daelemans, W., & Hoste, V. (2013).

The construction of a 500-million-word reference corpus of contemporary written Dutch (.pdf)

Oostdijk, N., Reynaert, M., Hoste, V., & Schuurman, I. (2013).

LeTs preprocess: the multilingual LT3 linguistic preprocessing toolkit (.pdf)

Van de Kauter, M., Coorman, G., Lefever, E., Desmet, B., Macken, L., & Hoste, V. (2013).

2012

Evaluating automatic cross-domain Dutch semantic role annotation (.pdf)

De Clercq, O., Hoste, V., & Monachesi, P. (2012).

Evaluating Automatic Cross-Domain Semantic Role Annotation (.pdf)

De Clercq, O., Hoste, V., & Monachesi, P. (2012).

From Character to Word Level: Enabling the Linguistic Analyses of Inputlog Process Data (.pdf)

Leijten, M, Macken, L., Hoste, V., Van Horenbeeck, E, & Van Waes, L. (2012).

From character to word level: enabling the linguistic analyses of Inputlog process data (.pdf)

Leijten, M, Macken, L., Hoste, V., Van Horenbeeck, E, & Van Waes, L. (2012).

Beyond SoNaR: towards the facilitation of large corpus building efforts (.pdf)

Reynaert, M., Schuurman, I., Hoste, V., Oostdijk, N., & van Gompel, M. (2012).

2011

Cross-domain Dutch coreference resolution (.pdf)

De Clercq, O., Hoste, V., & Hendrickx, I. (2011).

Using parallel corpora for word sense disambiguation (.pdf)

Lefever, E., Hoste, V., & De Cock, M. (2011).

Taal- en spraaktechnologie: een stand van zaken (.pdf)

Lefever, E., Macken, L., & Hoste, V. (2011).

2010

Classification-based scientific term detection in patient information (.pdf)

Hoste, V., Vanopstal, K., Lefever, E., & Delaere, I. (2010).

Clustering web people search results using fuzzy ants (.pdf)

Lefever, E., Fayruzov, T., Hoste, V., & De Cock, M. (2010).