Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2007/paper/main/753> ?p ?o. }
Showing items 1 to 9 of
9
with 100 items per page.
- 753 creator maxim-gurevich.
- 753 creator ziv-bar-yossef.
- 753 type InProceedings.
- 753 label "Efficient Search Engine Measurements".
- 753 sameAs 753.
- 753 abstract "We address the problem of measuring relevance neutral search quality metrics, like corpus size, index freshness, and density of duplicates in the index. The recently proposed estimators for such metrics [Bar-Yossef and Gurevich, WWW2006][Broder et al, CIKM 2006] suffer from significant bias and/or poor performance, due to inaccurate approximation of the so called ``document degrees''.<br /><br /> We present two new estimators that are able to overcome the bias introduced by approximate degrees. Our estimators are based on a careful implementation of an approximate importance sampling procedure. Comprehensive theoretical and empirical analysis of the estimators demonstrates that they have essentially no bias even in situations where document degrees are poorly approximated.<br /><br /> Building on an idea from [Broder et al, CIKM 2006], we discuss Rao-Blackwellization as a generic method for reducing variance in search engine estimators. We show that Rao-Blackwellizing our estimators results in significant performance improvements, while not compromising quality.".
- 753 hasAuthorList authorList.
- 753 isPartOf proceedings.
- 753 title "Efficient Search Engine Measurements".