Institut de Lingüística Aplicada
 

IULA Resources. Corpus & Tools. IULA Spanish LSP Treebank

IULA Spanish LSP Treebank is an Spanish treebank containing the syntactic annotation of 42,000 sentences (almost 590,000 tokens). It has been developed within the frame of Metanet4U project (Enhancing the European Linguistic Infrastructure, GA 270893).

The sentences in IULA Spanish LSP Treebank are extracted from the Corpus Tècnic de l'IULA, a collection of written texts from the fields of Law, Economy, Genomics, Medicine, and Environment, as well as a contrastive corpus from the press.

1.

Access the online query interface Treebankbrowser

Access Accés

Online query interface: TreebankBrowser

 

 

2.

Download the corpus in CoNLL format

Download Accés

Corpus texts in CoNLL format: e-repositori

  See the especifications of CoNLL format.

3.

Related Tools

An instance of MaltParser for Spanish has been trained using this corpus

4.

Related Publications

arrow Marimon, Montserrat; Fisas, Beatriz; Bel, Núria; Arias, Blanca; Vázquez, Silvia; Vivaldi, Jorge; Torner, Sergi; Villegas, Marta; Lorente, Mercè (2012). "The IULA Treebank"
in Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12). Istanbul, Turkey: European Language Resources Association (ELRA). p. 1920-1926. web

5.

Acknowledgments

The creation of the Treebank was Funded by METANET4U project (CIP-PSP-270893) and IULA.

 

© INSTITUT DE LINGÜÍSTICA APLICADA - UNIVERSITAT POMPEU FABRA