Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2010/paper/main/66> ?p ?o. }
Showing items 1 to 15 of
15
with 100 items per page.
- 66 creator adwait-ratnaparkhi.
- 66 creator dragomir-yankov.
- 66 creator scott-gaffney.
- 66 creator suju-rajan.
- 66 type InProceedings.
- 66 label "A Large Scale Active Learning System for Topical Categorization on the Web".
- 66 sameAs 66.
- 66 abstract "Many web applications such as ad matching systems, vertical search engines, and page categorization systems require the identification of a particular type or class of pages on the Web. The sheer number and diversity of the pages on the web, however, makes the problem of obtaining a good sample of the class of interest hard. In this paper, we describe a successfully deployed end-to-end system that starts from a manually collected biased training sample and makes use of several state-of-the-art machine learning systems working in tandem, including a powerful active learning component, in order to achieve a good classification system. The performance of the system is evaluated on the traffic to a real-world ad-matching platform and is shown to have significant reduction in editorial effort and labeling time, while maintaining pre-specified performance criteria.".
- 66 hasAuthorList authorList.
- 66 isPartOf proceedings.
- 66 keyword "Negative content filtering".
- 66 keyword "porn".
- 66 keyword "spam".
- 66 keyword "viruses".
- 66 title "A Large Scale Active Learning System for Topical Categorization on the Web".