After two years of intense work the new version of the corpus has been updated with new data sets, significant improvements in the consistency of annotations and metadata, and a number of minor changes and bug fixes.
This update brings new datasets, substantial improvements to the consistency of annotations and metadata, and a myriad of smaller changes and bug fixes.
DoReCo 2.0 hosts annotated speech data from 53 low-resource and endangered languages from all inhabited continents, inviting cross-linguistic research into phonetics, phonology, and morphology.
DoReCo (Language Documentation Reference Corpus) is jointly edited by Frank Seifart, Ludger Paschen, and Matt Stave. The bulk of the update from v.1.2 to v.2.0 was developed within the AIRAL project at Leibniz-Zentrum Allgemeine Sprachwissenschaft.
More Information: https://doreco.huma-num.fr/