Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/798> ?p ?o. }
Showing items 1 to 13 of
13
with 100 items per page.
- 798 creator mats-rooth.
- 798 creator tejaswini-deoskar.
- 798 type InProceedings.
- 798 label "Induction of Treebank-Aligned Lexical Resources".
- 798 sameAs 798.
- 798 abstract "We describe the induction of lexical resources from unannotated corpora that are aligned with treebank grammars, providing a systematic correspondence between features in the lexical resource and a treebank syntactic resource. We first describe a methodology based on parsing technology for augmenting a treebank database with linguistic features. A PCFG containing these features is created from the augmented treebank. We then use a procedure based on the inside-outside algorithm to learn lexical resources aligned with the treebank PCFG from large unannotated corpora. The method has been applied in creating a feature-annotated English treebank based on the Penn Treebank. The unsupervised estimation procedure gives a substantial error reduction (up to 31.6%) on the task of learning the subcategorization preference of novel verbs that are not present in the annotated training sample.".
- 798 hasAuthorList authorList.
- 798 hasTopic Linguistics.
- 798 isPartOf proceedings.
- 798 keyword "Lexicon, lexical database".
- 798 keyword "Parsing Systems".
- 798 keyword "Statistical methods".
- 798 title "Induction of Treebank-Aligned Lexical Resources".