Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/732> ?p ?o. }
Showing items 1 to 12 of
12
with 100 items per page.
- 732 creator anca-dinu.
- 732 type InProceedings.
- 732 label "On Classifying Coherent/Incoherent Romanian Short Texts".
- 732 sameAs 732.
- 732 abstract "In this paper we present and discuss the results of a text coherence experiment performed on a small corpus of Romanian text from a number of alternative high school manuals. During the last 10 years, an abundance of alternative manuals for high school was produced and distributed in Romania. Due to the large amount of material and to the relative short time in which it was produced, the question of assessing the quality of this material emerged; this process relied mostly of subjective human personal opinion, given the lack of automatic tools for Romanian. Debates and claims of poor quality of the alternative manuals resulted in a number of examples of incomprehensible / incoherent paragraphs extracted from such manuals. Our goal was to create an automatic tool which may be used as an indication of poor quality of such texts. We created a small corpus of representative texts from Romanian alternative manuals. We manually classified the chosen paragraphs from such manuals into two categories: comprehensible/coherent text and incomprehensible/incoherent text. We then used different machine learning techniques to automatically classify them in a supervised manner. Our approach is rather simple, but the results are encouraging.".
- 732 hasAuthorList authorList.
- 732 hasTopic Linguistics.
- 732 isPartOf proceedings.
- 732 keyword "Acquisition, Machine Learning".
- 732 keyword "Document Classification, Text categorisation".
- 732 keyword "Semantics".
- 732 title "On Classifying Coherent/Incoherent Romanian Short Texts".