Matches in ESWC 2020 for { ?s ?p ?o. }
- Adriane_Chapman holdsRole Author.78.3.
- Elena_Simperl type Person.
- Elena_Simperl name "Elena Simperl".
- Elena_Simperl label "Elena Simperl".
- Elena_Simperl holdsRole Author.78.4.
- Paper.79 type SubmissionsPaper.
- Paper.79 label "Generating knowledge graphs from unstructured texts: Experiences in the eCommerce field".
- Paper.79 title "Generating knowledge graphs from unstructured texts: Experiences in the eCommerce field".
- Paper.79 issued "2001-12-03T15:04:00.000Z".
- Paper.79 authorList b0_g209.
- Paper.79 submission Paper.79.
- Paper.79 track Track.In-Use%20Track.
- b0_g209 first Author.79.1.
- b0_g209 rest b0_g210.
- Author.79.1 type RoleDuringEvent.
- Author.79.1 label "Lucas Ramos, 1st Author for Paper 79".
- Author.79.1 withRole PublishingRole.
- Author.79.1 isHeldBy Lucas_Ramos.
- b0_g210 first Author.79.2.
- b0_g210 rest b0_g211.
- Author.79.2 type RoleDuringEvent.
- Author.79.2 label "Victor Hochgreb, 2nd Author for Paper 79".
- Author.79.2 withRole PublishingRole.
- Author.79.2 isHeldBy Victor_Hochgreb.
- b0_g211 first Author.79.3.
- b0_g211 rest nil.
- Author.79.3 type RoleDuringEvent.
- Author.79.3 label "Julio Cesar Dos Reis, 3rd Author for Paper 79".
- Author.79.3 withRole PublishingRole.
- Author.79.3 isHeldBy Julio_Cesar_Dos_Reis.
- Lucas_Ramos type Person.
- Lucas_Ramos name "Lucas Ramos".
- Lucas_Ramos label "Lucas Ramos".
- Lucas_Ramos holdsRole Author.79.1.
- Victor_Hochgreb type Person.
- Victor_Hochgreb name "Victor Hochgreb".
- Victor_Hochgreb label "Victor Hochgreb".
- Victor_Hochgreb holdsRole Author.79.2.
- Julio_Cesar_Dos_Reis type Person.
- Julio_Cesar_Dos_Reis name "Julio Cesar Dos Reis".
- Julio_Cesar_Dos_Reis label "Julio Cesar Dos Reis".
- Julio_Cesar_Dos_Reis holdsRole Author.79.3.
- Julio_Cesar_Dos_Reis holdsRole Author.81.2.
- Julio_Cesar_Dos_Reis holdsRole Author.82.2.
- Author.81.2 type RoleDuringEvent.
- Author.81.2 label "Julio Cesar Dos Reis, 2nd Author for Paper 81".
- Author.81.2 withRole PublishingRole.
- Author.81.2 isHeldBy Julio_Cesar_Dos_Reis.
- Author.82.2 type RoleDuringEvent.
- Author.82.2 label "Julio Cesar Dos Reis, 2nd Author for Paper 82".
- Author.82.2 withRole PublishingRole.
- Author.82.2 isHeldBy Julio_Cesar_Dos_Reis.
- Author.80.1 type RoleDuringEvent.
- Author.80.1 label "Anna Primpeli, 1st Author for Paper 80".
- Author.80.1 withRole PublishingRole.
- Author.80.1 isHeldBy Anna_Primpeli.
- b0_g213 first Author.80.2.
- b0_g213 rest b0_g214.
- Author.80.2 type RoleDuringEvent.
- Author.80.2 label "Christian Bizer, 2nd Author for Paper 80".
- Author.80.2 withRole PublishingRole.
- Author.80.2 isHeldBy Christian_Bizer.
- b0_g214 first Author.80.3.
- b0_g214 rest nil.
- Author.80.3 type RoleDuringEvent.
- Author.80.3 label "Margret Keuper, 3rd Author for Paper 80".
- Author.80.3 withRole PublishingRole.
- Author.80.3 isHeldBy Margret_Keuper.
- Anna_Primpeli type Person.
- Anna_Primpeli name "Anna Primpeli".
- Anna_Primpeli label "Anna Primpeli".
- Anna_Primpeli holdsRole Author.80.1.
- Anna_Primpeli holdsRole Author.170.2.
- Author.170.2 type RoleDuringEvent.
- Author.170.2 label "Anna Primpeli, 2nd Author for Paper 170".
- Author.170.2 withRole PublishingRole.
- Author.170.2 isHeldBy Anna_Primpeli.
- Christian_Bizer type Person.
- Christian_Bizer name "Christian Bizer".
- Christian_Bizer label "Christian Bizer".
- Christian_Bizer holdsRole Author.80.2.
- Christian_Bizer holdsRole Author.170.4.
- Author.170.4 type RoleDuringEvent.
- Author.170.4 label "Christian Bizer, 4th Author for Paper 170".
- Author.170.4 withRole PublishingRole.
- Author.170.4 isHeldBy Christian_Bizer.
- Margret_Keuper type Person.
- Margret_Keuper name "Margret Keuper".
- Margret_Keuper label "Margret Keuper".
- Margret_Keuper holdsRole Author.80.3.
- Paper.80_Review.0 type ReviewVersion.
- Paper.80_Review.0 issued "2001-01-26T17:51:00.000Z".
- Paper.80_Review.0 creator Paper.80_Review.0_Reviewer.
- Paper.80_Review.0 hasRating ReviewRating.2.
- Paper.80_Review.0 hasReviewerConfidence ReviewerConfidence.4.
- Paper.80_Review.0 reviews Paper.80.
- Paper.80_Review.0 issuedAt easychair.org.
- Paper.80_Review.0 issuedFor Conference.
- Paper.80_Review.0 releasedBy Conference.
- Paper.80_Review.0 hasContent "----------- Strong Points ----------- -Domain independent scoring function and threshold heuristic -Evaluation using textual, structured, and “dirty” datasets -Implementation and data available online ----------- Weak Points ----------- -Valley and elbow thresholding have rather similar results Summary: The paper addresses an important shortcoming of active learning for entity resolution i.e. cold start problem. The proposed method deals with the cold start problem by introducing unsupervised matching based on a novel domain-independent threshold heuristic to bootstrap active learning. The unsupervised matching uses a datatype-specific similarity metrics to assign a similarity score to all record pairs. The threshold boundary “t” is then set to a value accounting for the elbow point of the cumulative similarity score distribution of all record pairs. The distance between the threshold values and the aggregated similarity score of each pair serves as confidence weights that are then used to provide the active learning with the most suitable pairs at for every iteration, i.e. more noisy pairs are supposed to affect the warm start less than more confident pairs. The method is evaluated and shows promising results on three different types of data, i.e. structured, textual and dirty. The evaluation experiments are well-designed to measure the influence of the proposed thresholding heuristic, the bootstrapping and the warm start of the active learning. Introduction and Related Work: The introduction as the whole paper is well-written and introduces the problem and its specific. Related work is nicely structured along the three main points of the presented methodology, feature engineering, unsupervised matching, and active learning. Proposed Active Learning Methodology: The authors proposed a two-step methodology. An unsupervised matching step consisting of labeling pairs and assigning confidence weights using the elbow point threshold method. In the second step unsupervised labeled and weighted pairs are used in the warm start pool to bootstrap the training of the active learning random forest classifier and a heterogeneous committee of five different classifiers which includes the random forest classifier. The committee is used to select a pair form the noisy pool to be added to the labeled set after manual labeling in every iteration of the active learning. The labeled set is used to incrementally train new trees of the random forest classifier. This procedure allows for a “fading away” effect of the initial model learned in the warm start phase. Experiments and Evaluation: I appreciate the evaluation procedure aimed to highlight the specifics of the proposed threshold heuristic. Nonetheless, I somehow missed a comparison to other existing approaches of entity resolution. Such a comparison would have put the results in a different light. It would have been also interesting to see an evaluation addressing the effects of blocking non-matches that eventually would justify the selected threshold of 0.2. My main point of concern is, however, the very similar results of the valley and elbow threshold methods. For example, if we look at the deltas to the supervised F1-scores, we have three wins for the elbow two wins for the valley and one for the static threshold. For the unsupervised, we see similar results where for two datasets we have a difference between the two methods in third place after the decimal point. The results are presented in a clear insightful way. The authors, however, may consider using different than yellow color for the “no_boot” results as standard divisions are rather difficult to see in figures 5-7. From my point of view, the authors elegantly combine a set of existing methodologies and techniques in an interesting and innovative way to solve an important problem. The key point of the paper is the elbow point threshold heuristic. Overall, I think this paper presents a sound and valuable contribution to the ESWC community and should to be accepted. =============================== After Rebuttal =============== I keep my original score."".