Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2007/paper/main/91> ?p ?o. }
Showing items 1 to 9 of
9
with 100 items per page.
- 91 creator hung-chim.
- 91 creator xiaotie-deng.
- 91 type InProceedings.
- 91 label "A New Suffix Tree Similarity Measure for Document Clustering".
- 91 sameAs 91.
- 91 abstract "In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree similarity measure in Group-average Agglomerative Hierarchical Clustering (GAHC) algorithm, we developed a new suffix tree document clustering algorithm (NSTC). Our experimental results on two standard document clustering benchmark corpus OHSUMED and RCV1 indicate that the new clustering algorithm is a very effective document clustering algorithm. Comparing with the results of traditional keyword tfidf similarity measure in the same GHAC algorithm, NSTC achieved an improvement of 51% on the average of F-measure score. Furthermore, we apply the new clustering algorithm in analyzing the Web documents in online forum communities. A topic oriented clustering algorithm is developed to help people in assessing, classifying and searching the the Web documents in a large forum community.".
- 91 hasAuthorList authorList.
- 91 isPartOf proceedings.
- 91 title "A New Suffix Tree Similarity Measure for Document Clustering".