Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/296> ?p ?o. }
Showing items 1 to 14 of
14
with 100 items per page.
- 296 creator jorge-vivaldi.
- 296 creator rogelio-nazar.
- 296 creator teresa-cabre.
- 296 type InProceedings.
- 296 label "A Suite to Compile and Analyze an LSP Corpus".
- 296 sameAs 296.
- 296 abstract "This paper presents a series of tools for the extraction of specialized corpora from the web and its subsequent analysis mainly with statistical techniques. It is an integrated system of original as well as standard tools and has a modular conception that facilitates its re-integration on different systems. The first part of the paper describes the original techniques, which are devoted to the categorization of documents as relevant or irrelevant to the corpus under construction, considering relevant a specialized document of the selected technical domain. Evaluation figures are provided for the original part, but not for the second part involving the analysis of the corpus, which is composed of algorithms that are well known in the field of Natural Language Processing, such as Kwic search, measures of vocabulary richness, the sorting of n-grams by frequency of occurrence or by measures of statistical association, distribution or similarity.".
- 296 hasAuthorList authorList.
- 296 hasTopic Linguistics.
- 296 isPartOf proceedings.
- 296 keyword "Corpus (creation, annotation, etc.)".
- 296 keyword "LR web services".
- 296 keyword "Statistical methods".
- 296 title "A Suite to Compile and Analyze an LSP Corpus".