Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/109> ?p ?o. }
Showing items 1 to 12 of
12
with 100 items per page.
- 109 creator jack-halpern.
- 109 type InProceedings.
- 109 label "Exploiting Lexical Resources for Disambiguating CJK and Arabic Orthographic Variants".
- 109 sameAs 109.
- 109 abstract "The orthographical complexities of Chinese, Japanese, Korean (CJK) and Arabic pose a special challenge to developers of NLP applications. These difficulties are exacerbated by the lack of a standardized orthography in these languages, especially the highly irregular Japanese orthography and the ambiguities of the Arabic script. This paper focuses on CJK and Arabic orthographic variation and provides a brief analysis of the linguistic issues. The basic premise is that statistical methods by themselves are inadequate, and that linguistic knowledge supported by large-scale lexical databases should play a central role in achieving high accuracy in disambiguating and normalizing orthographic variants.".
- 109 hasAuthorList authorList.
- 109 hasTopic Linguistics.
- 109 isPartOf proceedings.
- 109 keyword "Lexicon, lexical database".
- 109 keyword "Machine Translation, SpeechToSpeech Translation".
- 109 keyword "Morphology".
- 109 title "Exploiting Lexical Resources for Disambiguating CJK and Arabic Orthographic Variants".