Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/249> ?p ?o. }
Showing items 1 to 14 of
14
with 100 items per page.
- 249 creator andreas-nuernberger.
- 249 creator ernesto-william-de-luca.
- 249 creator lena-grothe.
- 249 type InProceedings.
- 249 label "A Comparative Study on Language Identification Methods".
- 249 sameAs 249.
- 249 abstract "In this paper we present two experiments conducted for comparison of different language identification algorithms. Short words-, frequent words- and n-gram-based approaches are considered and combined with the Ad-Hoc Ranking classification method. The language identification process can be subdivided into two main steps: first a document model is generated for the document and a language model for the language; second the language of the document is determined on the basis of the language model and is added to the document as additional information. In this work we present our evaluation results and discuss the importance of a dynamic value for the out-of-place measure.".
- 249 hasAuthorList authorList.
- 249 hasTopic Linguistics.
- 249 isPartOf proceedings.
- 249 keyword "Document Classification, Text categorisation".
- 249 keyword "Language modelling".
- 249 keyword "Multilinguality".
- 249 title "A Comparative Study on Language Identification Methods".