Matches in ScholarlyData for { <https://w3id.org/scholarlydata/inproceedings/www2010/paper/main/638> ?p ?o. }
Showing items 1 to 13 of
13
with 100 items per page.
- 638 creator eyal-oren.
- 638 creator frank-van-harmelen.
- 638 creator spyros-kotoulas.
- 638 type InProceedings.
- 638 label "Mind the data skew: Distributed inferencing by speeddating in elastic regions".
- 638 sameAs 638.
- 638 abstract "Semantic Web data exhibits very skewed frequency distribu- tions among terms. Efficient large-scale distributed reasoning methods should maintain load-balance in the face of such highly-skewed distri- bution of input data. We show that term-based partitioning, used by most distributed reasoning approaches, has limited scalability due to load-balancing problems. We address this problem with a method for data distribution based on clustering in elastic regions. Instead of assigning data to fixed peers, data flows semi-randomly in the network. Data items “speed-date” while being temporarily collocated in the same peer. We introduce a bias in the routing to allow semantically clustered neighborhoods to emerge. Our approach is self-organising, efficient and does not require any central coordination. We have implemented this method on the MaRVIN platform and have performed experiments on large real-world datasets, using a cluster of up to 64 nodes. We compute the RDFS closure over different datasets and show that our clustering algorithm drastically reduces computation time, calculating the RDFS closure of 200 million triples in 7.2 minutes.".
- 638 hasAuthorList authorList.
- 638 isPartOf proceedings.
- 638 keyword "Large-scale approaches for generating".
- 638 keyword "h".
- 638 keyword "ling Linked Open Data".
- 638 title "Mind the data skew: Distributed inferencing by speeddating in elastic regions".