DESIGNING AND DEVELOPING A MULTIDIALECTAL ORAL CORPUS. THE AMERESCO CORPUS
DOI:
https://doi.org/10.7203/Normas.v9i1.16007Abstract
This paper describes the protocol used to build the Ameresco corpus (America Colloquial Spanish). Collecting a corpus containing more than one dialect poses a series of challenges. On the one hand, managing a large number of external teams requires that the methodology used is sound. On the other hand, the methodology should be in line with the goals that the project aims to reach and with essential corpus design features such as issues when recording, the transcription and labelling system and the anonymisation of sensitive data. All these aspects should be thoughtfully chosen so that the quality standards set by the scientific community are reached.
Downloads
Downloads
Published
How to Cite
-
Abstract1120
-
PDF (Español)880
Issue
Section
License
This article is under this license: Creative Commons Attribution 3.0 .
Authors agree with the following statements:
- The authors retain the copyright and guarantee the journal the right to be the first publication of the work as well as a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the authorship of the work and the initial publication in this journal.
- Authors may separately establish additional agreements for the non-exclusive distribution of the version of the work published in the journal (for example, place it in an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are allowed and encouraged to disseminate their work electronically (for example, in institutional repositories or on their own website) before and during the submission process, as it can lead to productive scientific exchanges.