Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/717> ?p ?o. }
Showing items 1 to 13 of
13
with 100 items per page.
- 717 creator fabio-ciravegna.
- 717 creator jonathan-butters.
- 717 type InProceedings.
- 717 label "Using Similarity Metrics For Terminology Recognition".
- 717 sameAs 717.
- 717 abstract "In this paper we present an approach to terminology recognition whereby a sublanguage term (e.g. an aircraft engine component term extracted from a maintenance log) is matched to its corresponding term from a pre-defined list (such as a taxonomy representing the official break-down of the engine). Terminology recognition is addressed as a classification task whereby the extracted term is associated to one or more potential terms in the official description list via the application of string similarity metrics. The solution described in the paper uses dynamically computed similarity cut-off thresholds calculated on the basis of modeling a noise curve. Dissimilar string matches form a Gaussian distributed noise curve that can be identified and extracted leaving only mostly similar string matches. Dynamically calculated thresholds are preferable over fixed similarity thresholds as fixed thresholds are inherently imprecise, that is, there is no similarity boundary beyond which any two strings always describe the same concept.".
- 717 hasAuthorList authorList.
- 717 hasTopic Linguistics.
- 717 isPartOf proceedings.
- 717 keyword "Information Extraction, Information Retrieval".
- 717 keyword "Statistical methods".
- 717 keyword "Tools, systems, applications".
- 717 title "Using Similarity Metrics For Terminology Recognition".