SciELO - Scientific Electronic Library Online

 
vol.57 número2CORPUS ORAL DE ESTUDIANTES DE INGLÉS EN CHILE (ESOC-CHILE): DISEÑO, ESTRUCTURA Y APLICACIONESANÁLISIS DEL CONTENIDO Y EVIDENCIAS DE VALIDEZ DEL CONSTRUCTO EN LOS EXÁMENES DE ESPAÑOL CON FINES GENERALES Y ACADÉMICOS índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Em processo de indexaçãoCitado por Google
  • Não possue artigos similaresSimilares em SciELO
  • Em processo de indexaçãoSimilares em Google

Compartilhar


RLA. Revista de lingüística teórica y aplicada

versão On-line ISSN 0718-4883

Resumo

CALDERON CAMPOS, MIGUEL. CLASSIC AND MODERN SPANISH CORPORA: BETWEEN PHILOLOGY AND COMPUTATIONAL LINGUISTICS. RLA [online]. 2019, vol.57, n.2, pp.41-64. ISSN 0718-4883.  http://dx.doi.org/10.4067/S0718-48832019000200041.

This article analyses the standard practice when compiling and producing European and American Spanish corpora for the period spanning from the end of the 15th century to the late 19th century. Special attention will be given to the model used for six diachronic corpora: CHARTA, CODEA 2015, CORDIAM, CorLexIn, Post Scriptum and Cíbola, in order to reach methodological conclusions applicable to any future or incipient projects - such as the Oralia diacrónica del español (ODE) corpus, currently being prepared at the University of Granada. The analysis shows that while there are no appreciable differences in the rigor and criteria applied to document transcription, there does not seem to be any agreement as to the way to process and structure the information - textual as well as meta-textual. This paper will argue for the usefulness of adopting a standardized model based on the XML markup language, following the TEI consortium guidelines for the codification and labelling of historical corpora. This model will make it possible to integrate the different corpora and, more importantly, to provide easier user access to the information.

Palavras-chave : History of the Spanish language; diachronic corpora; corpus linguistics; XML; orality in written texts.

        · resumo em Espanhol     · texto em Espanhol     · Espanhol ( pdf )