Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2010/paper/main/405> ?p ?o. }
Showing items 1 to 11 of
11
with 100 items per page.
- 405 creator andrew-tomkins.
- 405 creator flavio-chierichetti.
- 405 creator ravi-kumar.
- 405 type InProceedings.
- 405 label "Max-Cover in Map-Reduce".
- 405 sameAs 405.
- 405 abstract "The NP-hard \mkc problem requires selecting $k$ sets from a collection so as to maximize the size of the union. This classic problem occurs commonly in many settings in web search and advertising. For moderately-sized instances, a greedy algorithm gives an approximation of $(1-1/e)$. However, the greedy algorithm requires updating scores of arbitrary elements after each step, and hence becomes intractable for large datasets. We give the first max cover algorithm designed for today's large-scale commodity clusters. Our algorithm has provably almost the same approximation as greedy, but runs much faster. Furthermore, it can be easily expressed in the \mr programming paradigm, and requires only polylogarithmically many passes over the data. Our experiments on five large problem instances show that our algorithm is practical and can achieve good speedups compared to the sequential greedy algorithm.".
- 405 hasAuthorList authorList.
- 405 isPartOf proceedings.
- 405 keyword "Efficient algorithms for large-scale analysis".
- 405 title "Max-Cover in Map-Reduce".