Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2008/paper/509> ?p ?o. }
Showing items 1 to 15 of
15
with 100 items per page.
- 509 creator barbara-poblete.
- 509 creator ricardo-baeza-yates.
- 509 type InProceedings.
- 509 label "Query-Sets: Using Implicit Feedback and Query Patterns to Organize Web Documents".
- 509 sameAs 509.
- 509 abstract "In this paper we present a new document representation model based on implicit user feedback obtained from search engine queries. The main objective of this model is to achieve better results in non-supervised tasks, such as clustering and labeling, through the incorporation of usage data obtained from search engine queries. This type of model allows us to discover the motivations of users when visiting a certain document. The terms used in queries can provide a better choice of features, from the user's point of view, for summarizing the Web pages that were clicked from these queries. In this work we extend and formalize as "query model" an existing but not very well known idea of "query view" for documents representation. Furthermore, we create a novel model based on "frequent query patterns" called the "query-set model". Our evaluation shows that both "query-based" models outperform the vector-space model when used for organizing into topics documents in a website. In our experiments, the query-set model reduces by more than 90% the number of features needed to represent a set of documents and improves by over 90% the quality of the results. We believe that this can be explained because our model chooses better features and provides more accurate labels according to the user's expectations.".
- 509 hasAuthorList authorList.
- 509 hasTopic World_Wide_Web.
- 509 isPartOf proceedings.
- 509 keyword "feature selection".
- 509 keyword "labeling".
- 509 keyword "search engine queries".
- 509 keyword "usage mining".
- 509 keyword "web page organization".
- 509 title "Query-Sets: Using Implicit Feedback and Query Patterns to Organize Web Documents".