Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/304> ?p ?o. }
Showing items 1 to 16 of
16
with 100 items per page.
- 304 creator miroslav-janicek.
- 304 creator ondrej-bojar.
- 304 creator pavel-ceska.
- 304 creator peter-bena.
- 304 creator zdenek-abokrtsky.
- 304 type InProceedings.
- 304 label "CzEng 0.7: Parallel Corpus with Community-Supplied Translations".
- 304 sameAs 304.
- 304 abstract "This paper describes CzEng 0.7, a new release of Czech-English parallel corpus freely available for research and educational purposes. We provide basic statistics of the corpus and focus on data produced by a community of volunteers. Anonymous contributors manually correct the output of a machine translation (MT) system, generating on average 2000 sentences a month, 70% of which are indeed correct translations. We compare the utility of community-supplied and of professionally translated training data for a baseline English-to-Czech MT system.".
- 304 hasAuthorList authorList.
- 304 hasTopic Linguistics.
- 304 isPartOf proceedings.
- 304 keyword "Corpus (creation, annotation, etc.)".
- 304 keyword "Machine Translation, SpeechToSpeech Translation".
- 304 keyword "Validation of LRs".
- 304 title "CzEng 0.7: Parallel Corpus with Community-Supplied Translations".