ISST–TANL corpus

Date:13 Apr, 2016

ISST–TANL corpus

A manually annotated corpus, encoded in the CoNLL standard format and including PoS tagging and syntactic dependency annotation. Jointly developed by ILC–CNR and the University of Pisa, it exemplifies general language usage and consists of articles from newspapers and periodicals, selected to cover a high variety of topics. This corpus was used for training and testing in the “Domain Adaptation” Shared Task of EVALITA 2011 (http://www.evalita.it/2011/tasks/dependency_parsing).

ILC