Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/wac7/data.semanticweb.org/workshop/wac7/2012/paper/3> ?p ?o. }
Showing items 1 to 13 of
13
with 100 items per page.
- 3 creator marco-brunello.
- 3 type InProceedings.
- 3 label "Understanding the composition of parallel corpora from the web".
- 3 sameAs 3.
- 3 abstract "Although it is fundamental to have a good fit between the text typology of training data and to-be-translated data in machine translation, there is a lack of studies on analysing parallel data under this point of view. This paper describes some studies made with the aim of understanding the composition of parallel corpora, in particular by using topic modeling.".
- 3 hasAuthorList authorList.
- 3 isPartOf proceedings.
- 3 keyword "camera-ready version".
- 3 keyword "machine translation".
- 3 keyword "parallel corpora".
- 3 keyword "topic modeling".
- 3 keyword "web as corpus".
- 3 title "Understanding the composition of parallel corpora from the web".