Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/538> ?p ?o. }
Showing items 1 to 15 of
15
with 100 items per page.
- 538 creator christopher-brewster.
- 538 creator fabio-ciravegna.
- 538 creator jose-iria.
- 538 creator ziqi-zhang.
- 538 type InProceedings.
- 538 label "A Comparative Evaluation of Term Recognition Algorithms".
- 538 sameAs 538.
- 538 abstract "Automatic Term recognition (ATR) is a fundamental processing step preceding more complex tasks such as semantic search and ontology learning. From a large number of methodologies available in the literature only a few are able to handle both single and multi-word terms. In this paper we present a comparison of five such algorithms and propose a combined approach us¬ing a voting mechanism. We evaluated the six approaches using two different corpora and show how the voting algo¬rithm performs best on one corpus (a collection of texts from Wikipedia) and less well using the Genia corpus (a standard life science corpus). This indicates that choice and design of corpus has a major impact on the evaluation of term recog¬nition algorithms. Our experiments also showed that single-word terms can be equally important and occupy a fairly large proportion in certain domains. As a result, algorithms that ignore single-word terms may cause problems to tasks built on top of ATR. Effective ATR systems also need to take into account both the unstructured text and the structured aspects and this means information extraction techniques need to be integrated into the term recognition process.".
- 538 hasAuthorList authorList.
- 538 hasTopic Linguistics.
- 538 isPartOf proceedings.
- 538 keyword "Evaluation methodologies".
- 538 keyword "MultiWord Expressions & Collocations".
- 538 keyword "Text mining".
- 538 title "A Comparative Evaluation of Term Recognition Algorithms".