Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2007/paper/main/342> ?p ?o. }
Showing items 1 to 10 of
10
with 100 items per page.
- 342 creator ramakrishnan-srikant.
- 342 creator roberto-bayardo.
- 342 creator yiming-ma.
- 342 type InProceedings.
- 342 label "Scaling Up All-Pairs Similarity Search".
- 342 sameAs 342.
- 342 abstract "Given a large collection of sparse vector data in a high dimensional space, we investigate the problem of finding all pairs of vectors whose similarity score (as determined by a function such as cosine distance) is above a given threshold. We propose novel optimization and indexing techniques for this problem, resulting in an algorithm that is both faster and simpler than the previous state-of-the-art approaches. We demonstrate the effectiveness of our algorithm on the public DBLP dataset, and on two real-world web applications: generating recommendations for the Orkut social network, and computing pairs of similar queries from search snippet data among the 5 million most frequently issued Google queries. Our algorithm is between 5 times to 20 times faster than previous algorithms on these datasets.".
- 342 hasAuthorList authorList.
- 342 isPartOf proceedings.
- 342 title "Scaling Up All-Pairs Similarity Search".