Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/lrec2008/papers/874> ?p ?o. }
Showing items 1 to 13 of
13
with 100 items per page.
- 874 creator erin-fitzgerald.
- 874 creator frederick-jelinek.
- 874 type InProceedings.
- 874 label "Linguistic Resources for Reconstructing Spontaneous Speech Text".
- 874 sameAs 874.
- 874 abstract "The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accomplish speech reconstruction of its spontaneous speech input if its output were to represent, in flawless, fluent, and content-preserving English, the message that the speaker intended to convey. These cleaner speech transcripts would allow for more accurate language processing as needed for NLP tasks such as machine translation and conversation summarization, which often rely on grammatical input. Recognizing that supervised statistical methods to identify and transform ill-formed areas of the transcript will require richly labeled resources, we have built the Spontaneous Speech Reconstruction corpus. This small corpus of reconstructed and aligned conversational telephone speech transcriptions for the Fisher conversational telephone speech corpus (Strassel and Walker, 2004) was annotated on several levels including string transformations and predicate-argument structure, and will be shared with the linguistic research community.".
- 874 hasAuthorList authorList.
- 874 hasTopic Linguistics.
- 874 isPartOf proceedings.
- 874 keyword "Corpus (creation, annotation, etc.)".
- 874 keyword "Dialogue & Natural Interactivity".
- 874 keyword "Paraphrasing".
- 874 title "Linguistic Resources for Reconstructing Spontaneous Speech Text".