A Statistical and Schema Independent Approach to Identify Equivalent Properties on Linked Data

TitleA Statistical and Schema Independent Approach to Identify Equivalent Properties on Linked Data
Publication TypeConference Proceedings
Year of Publication2013
AuthorsKalpa Gunaratna, Krishnaprasad Thirunarayan, Prateek Jain, Amit Sheth, Sanjaya Wijeratne
Conference Name9th International Conference on Semantic Systems (I-SEMANTICS)
Pagination33-40
Conference LocationGraz, Austria
KeywordsLinked Open Data, Property Alignment, Relationship Identication, Statistical Equivalence
Abstract

Linked Open Data (LOD) cloud has gained significant attention in the Semantic Web community recently. Currently it consists of approximately 295 interlinked datasets with over 50 billion triples including 500 million links, and continues to expand in size. This vast source of structured information has the potential to have a significant impact on knowledge-based applications. However, a key impediment to the use of LOD cloud is limited support for data integration tasks over concepts, instances, and properties. Efforts to address this limitation over properties have focused on matching data-type properties across datasets; however,matching of object-type properties has not received similar attention. We present an approach that can automatically match object-type properties across linked datasets, primarily exploiting and bootstrapping from entity co-reference links such as owl:sameAs. Our evaluation, using sample instance sets taken from Freebase, DBpedia, LinkedMDB, and DBLP datasets covering multiple domains shows that our approach matches properties with high precision and recall (on average, F measure gain of 57% - 78%).

DOI10.1145/2506182.2506187