Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2012/poster/128> ?p ?o. }
Showing items 1 to 18 of
18
with 100 items per page.
- 128 creator adi-littman.
- 128 creator ari-rappoport.
- 128 creator oren-tsur.
- 128 type InProceedings.
- 128 label "Scalable Multi Stage Clustering of Tagged Micro-Messages".
- 128 sameAs 128.
- 128 abstract "The growing popularity of microblogging backed by services like Twitter, Facebook, Google+ and LinkedIn, raises the challenge of clustering short and extremely sparse documents. In this work we propose SMSC -- a scalable, accurate and efficient multi stage clustering algorithm. Our algorithm leverages users practice of adding tags to some messages by bootstrapping over virtual non sparse documents. We experiment on a large corpus of tweets from Twitter, and evaluate results against a gold-standard classification validated by seven clustering evaluation measures (information theoretic, paired and greedy). Results show that the algorithm presented is both accurate and efficient, significantly outperforming other algorithms. Under reasonable practical assumptions, our algorithm scales up sublinearly in time.".
- 128 hasAuthorList authorList.
- 128 isPartOf proceedings.
- 128 isPartOf proceedings.
- 128 keyword "Twitter".
- 128 keyword "clustering".
- 128 keyword "hashtags".
- 128 keyword "micro messages".
- 128 keyword "microblogging".
- 128 keyword "scalability".
- 128 keyword "short documents".
- 128 title "Scalable Multi Stage Clustering of Tagged Micro-Messages".