Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/858> ?p ?o. }
Showing items 1 to 15 of
15
with 100 items per page.
- 858 creator andras-kornai.
- 858 creator daniel-varga.
- 858 creator peter-halacsy.
- 858 creator peter-nemeth.
- 858 type InProceedings.
- 858 label "Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report".
- 858 sameAs 858.
- 858 abstract "For increased speed in developing gigaword language resources for medium resource density languages we integrated several FOSS tools in the HUN* toolkit. While the speed and efficiency of the resulting pipeline has surpassed our expectations, our experience in developing LDC-style resource packages for Uzbek and Kurdish makes clear that neither the data collection nor the subsequent processing stages can be fully automated.".
- 858 hasAuthorList authorList.
- 858 hasTopic Linguistics.
- 858 isPartOf proceedings.
- 858 keyword "Corpus (creation, annotation, etc.)".
- 858 keyword "Multilinguality".
- 858 keyword "Tools, systems, applications".
- 858 title "Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report".