Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/848> ?p ?o. }
Showing items 1 to 15 of
15
with 100 items per page.
- 848 creator christina-thornell.
- 848 creator harald-hammarstroem.
- 848 creator malin-petzell.
- 848 creator torbjoern-westerlund.
- 848 type InProceedings.
- 848 label "Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic)".
- 848 sameAs 848.
- 848 abstract "Linguists have long been producing grammatical decriptions of yet undescribed languages. This is a time-consuming process, which has already adapted to improved technology for recording and storage. We present here a novel application of NLP techniques to bootstrap analysis of collected data and speed-up manual selection work. To be more precise, we argue that unsupervised induction of morphology and part-of-speech analysis from raw text data is mature enough to produce useful results. Experiments with Latent Semantic Analysis were less fruitful. We exemplify this on Mpiemo, a so-far essentially undescribed Bantu language of the Central African Republic, for which raw text data was available.".
- 848 hasAuthorList authorList.
- 848 hasTopic Linguistics.
- 848 isPartOf proceedings.
- 848 keyword "Acquisition, Machine Learning".
- 848 keyword "Endangered languages".
- 848 keyword "Language modelling".
- 848 title "Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic)".