Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/706> ?p ?o. }
Showing items 1 to 14 of
14
with 100 items per page.
- 706 creator ann-bies.
- 706 creator mohamed-maamouri.
- 706 creator seth-kulick.
- 706 type InProceedings.
- 706 label "Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation".
- 706 sameAs 706.
- 706 abstract "The Arabic Treebank (ATB), released by the Linguistic Data Consortium, contains multiple annotation files for each source file, due in part to the role of diacritic inclusion in the annotation process. The data is made available in both vocalized and unvocalized forms, with and without the diacritic marks, respectively. Much parsing work with the ATB has used the unvocalized form, on the basis that it more closely represents the real-world situation. We point out some problems with this usage of the unvocalized data and explain why the unvocalized form does not in fact represent real-world data. This is due to some aspects of the treebank annotation that to our knowledge have never before been published.".
- 706 hasAuthorList authorList.
- 706 hasTopic Linguistics.
- 706 isPartOf proceedings.
- 706 keyword "Corpus (creation, annotation, etc.)".
- 706 keyword "Morphology".
- 706 keyword "Parsing Systems".
- 706 title "Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation".