Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2012/paper/1014> ?p ?o. }
Showing items 1 to 17 of
17
with 100 items per page.
- 1014 creator christian-schallhart.
- 1014 creator georg-gottlob.
- 1014 creator giorgio-orsi.
- 1014 creator giovanni-grasso.
- 1014 creator tim-furche.
- 1014 creator xiaonan-guo.
- 1014 type InProceedings.
- 1014 label "Forms form Patterns: Reusable Form Understanding".
- 1014 sameAs 1014.
- 1014 abstract "Forms are our gates to the web. They enable us to access the deep content of web sites. Automatic form understanding unlocks this content for applications ranging from crawlers to meta-search engines and is essential for improving usability and accessibility of the web. Form understanding has received surprisingly little attention other than as component in specific applications such as crawlers. No comprehensive approach to form understanding exists and previous works disagree even in the definition of the problem. In this paper, we present OPAL, the first comprehensive approach to form understanding. We identify form labeling and form interpretation as the two main tasks involved in form understanding. On both problems OPAL pushes the state of the art: For form labeling, it combines signals from the text, structure, and visual rendering of a web page, yielding robust characterisations of common design patterns. In extensive experiments on the ICQ and TEL-8 benchmarks and a set of 200 modern web forms OPAL outperforms previous approaches by a significant margin. For form interpretation, we introduce a template language to describe frequent form patterns. These two parts of OPAL combined yield form understanding with near perfect accuracy (> 98%).".
- 1014 hasAuthorList authorList.
- 1014 isPartOf proceedings.
- 1014 keyword "deep web".
- 1014 keyword "form understanding".
- 1014 keyword "template language".
- 1014 keyword "web usability".
- 1014 title "Forms form Patterns: Reusable Form Understanding".