Matches in UGent Biblio for { <https://biblio.ugent.be/publication/4158367#aggregation> ?p ?o. }
Showing items 1 to 34 of
34
with 100 items per page.
- aggregation classification "C1".
- aggregation creator person.
- aggregation creator person.
- aggregation creator person.
- aggregation creator person.
- aggregation creator person.
- aggregation date "2013".
- aggregation format "application/pdf".
- aggregation hasFormat 4158367.bibtex.
- aggregation hasFormat 4158367.csv.
- aggregation hasFormat 4158367.dc.
- aggregation hasFormat 4158367.didl.
- aggregation hasFormat 4158367.doc.
- aggregation hasFormat 4158367.json.
- aggregation hasFormat 4158367.mets.
- aggregation hasFormat 4158367.mods.
- aggregation hasFormat 4158367.rdf.
- aggregation hasFormat 4158367.ris.
- aggregation hasFormat 4158367.txt.
- aggregation hasFormat 4158367.xls.
- aggregation hasFormat 4158367.yaml.
- aggregation isPartOf urn:issn:1313-8502.
- aggregation language "eng".
- aggregation publisher "INCOMA".
- aggregation rights "I have transferred the copyright for this publication to the publisher".
- aggregation subject "Languages and Literatures".
- aggregation title "Normalization of Dutch user-generated content".
- aggregation abstract "This paper describes a phrase-based machine translation approach to normalize Dutch user-generated content (UGC). We compiled a corpus of three different social media genres (text messages, message board posts and tweets) to have a sample of this recent domain. We describe the various characteristics of this noisy text material and explain how it has been manually normalized using newly developed guidelines. For the automatic normalization task we focus on text messages, and find that a cascaded SMT system where a token-based module is followed by a translation at the character level gives the best word error rate reduction. After these initial experiments, we investigate the system’s robustness on the complete domain of UGC by testing it on the other two social media genres, and find that the cascaded approach performs best on these genres as well. To our knowledge, we deliver the first proof-of-concept system for Dutch UGC normalization, which can serve as a baseline for future work.".
- aggregation authorList BK280585.
- aggregation endPage "188".
- aggregation startPage "179".
- aggregation aggregates 4158368.
- aggregation isDescribedBy 4158367.
- aggregation similarTo LU-4158367.