Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/631> ?p ?o. }
Showing items 1 to 14 of
14
with 100 items per page.
- 631 creator kristina-vuckovic.
- 631 creator marko-tadic.
- 631 creator zdravko-dovedan.
- 631 type InProceedings.
- 631 label "Rule-Based Chunker for Croatian".
- 631 sameAs 631.
- 631 abstract "In this paper we discuss a rule-based approach to chunking sentences in Croatian, implemented using local regular grammars within the NooJ development environment. We describe the rules and their implementation by regular grammars and at the same time show that in NooJ environment it is extremely easy to fine tune their different sub-rules. Since Croatian has strong morphosyntactic features that are shared between most or all elements of a chunk, the rules are built by taking these features into account and strongly relying on them. For the evaluation of our chunker we used a extracted set of manually annotated sentences from 100 kw MSD/tagged and disambiguated Croatian corpus. Our chunker performed the best on VP-chunks (F: 97.01), while NP-chunks (F: 92.31) and PP-chunks (F: 83.08) were of lower quality. The results are comparable to chunker performance of CoNLL-2000 shared task of chunking.".
- 631 hasAuthorList authorList.
- 631 hasTopic Linguistics.
- 631 isPartOf proceedings.
- 631 keyword "Grammars".
- 631 keyword "MultiWord Expressions & Collocations".
- 631 keyword "Parsing Systems".
- 631 title "Rule-Based Chunker for Croatian".