You are here

Spatio-Temporal-Thematic Query Processing

This project ended on 08/31/2011.

Project Description Publications Presentations Data Sets

NSF Research Outcome Report

Analytical applications are increasingly exploiting complex relationships among named entities as a powerful analytical tool. Such `connect-the-dots' applications are common in many domains including national security, drug discovery, and medical informatics. Semantic Web Technologies are well suited for this type of analysis. It is often necessary that the analysis process spans across multiple heterogeneous data sources, and ontologies and semantic metadata standards help facilitate aggregation and integration of this content. In addition, standard models for metadata representation on the web, such as Resource Description Framework (RDF), model relationships as first class objects making it very natural to query and analyze entities based on their relationships. Researchers have consequently argued for graph-based querying of RDF, and fundamentally new analytical operators based on the graph structure of RDF have emerged (e.g., semantic associations and subgraph discovery). These operators allow querying for complex relationships among named entities where an ontology provides the context or domain semantics. We use the term semantic analytics to refer to this process of searching and analyzing semantically meaningful connections among named entities. Semantic analytics has been successfully used in a variety of settings, for example identifying conflict of interest, detecting patent infringement and discovering metabolic pathways.

So far, semantic analytics tools have primarily focused on thematic relationships, but spatial and temporal relationships are often critical components in analytical domains. In fact, most entities and events can be described along three dimensions: thematic, spatial and temporal. Consider the following event: Fred Smith moved into the house at 244 Elm Street on November 16, 2007. The thematic dimension describes what is occurring (the person Fred Smith moved to a new residence). The spatial dimension describes where the event occurs (the new residence is located at 244 Elm Street). The temporal dimension describes when the event occurs (the moving event occurred on November 16, 2007). Unfortunately, integrated semantic analytics over all three dimensions is not currently possible because of the following gaps in the state of the art:

  • Current GIS and spatial database technology does not support complex thematic analytics operations. Traditional data models used for GIS excel at modeling and analyzing spatial and temporal relationships among geospatial entities but tend to model the thematic aspects of a given domain as directly attached attributes of geospatial entities. Thematic entities and their relationships are not explicitly and independently represented, making analysis of these relationships difficult.
  • Current semantic analytics technology does not support analysis of spatial and temporal relationships. Semantic analytics research has focused on thematic relationships between entities. Thematic relationships can be explicitly stated in RDF graphs, but many important spatial and temporal relationships (e.g., distance and elapsed time) are implicit and require additional computation. Semantic analytics tools depend on explicit relations and must be extended if they are to use implicit spatial and temporal relations.

We are researching a framework that can bridge these gaps. We propose a very flexible approach for modeling spatial, temporal and thematic (STT) data using Semantic Web data models. In addition, we have developed and implemented two approaches for querying STT data in our model. The first approach is a SQL-based approach that uses user-defined functions for graph pattern based queries involving spatial and temporal components. The second approach defines a query language, SPARQL-ST, that is an extension of SPARQL for spatio-temporal-thematic queries. Both approaches have been prototyped by extending a commercial DBMS.

In addition, demand for systems that can efficiently manage large amounts of Semantic Web data has reached a critical point. This demand is driven to a major extent by the existence of many large, real-world Semantic Web datasets. Some examples of publicly-available datasets include GovTrack (data about activities of US Congress -- 13 million triples), SwetoDBLP (bibliography data focused on Computer Science publications -- 11 million triples), DBPedia (multi-domain data derived from Wikipedia content -- 218 million triples) and UniProt (data describing functional aspects of proteins -- over 1 billion triples). The development of a scalable system for managment of STT Semantic Web data is thus a major component of our research.

Modeling Approach

We model spatio-temporal-thematic data as follows. We incorporate temporal information using Temporal RDF Graphs. Temporal RDF extends the RDF statement from a triple to a quad where the fourth element is the valid time of the RDF statement. Temporal RDF triples are encoded using standard RDF reification (see the figure below). Spatial features are complex and must be properly defined with an ontology. We use an ontology based on the Open Geospatial Consortium (OGC) Geographic Modeling Language (GML) specification for this purpose (see the figure below).

LSDIS Lab LSDIS LabOntology-revised.png

SQL-based Querying Approach

We have developed a set of spatial and temporal query operators for searching and analyzing spatial and temporal relationships between named entities in temporal RDF graphs. These operators are an adequate functional set in that they (1) allow precise specification of a thematic portion of the RDF graph (subgraph), (2) provide facilities to compute spatial and temporal properties of these subgraphs and (3) allow filtering and joins based on the computed spatial and temporal properties. The operators are implemented as SQL table functions. Table functions produce a set of rows as output which can be queried. They are used in SQL queries in the same manner as a database table name. See the example below for illustration.

LSDIS Lab

With this query, we are using the spatial_eval operator to specify (1) a relationship between a soldier, a chemical agent and a battle location and (2) a relationship between members of an enemy organization and their known locations. We are then limiting the results based on the spatial proximity of the battles and enemy sightings. In Addition, we provide a spatial_extent operator that allows retrieving the spatial geometry associated with the spatial entities composing a thematic relationship and optionally filtering the results using a spatial predicate. For example, find all soldiers participating in military events that take place within an input bounding box. For temporal aspects, we provide an analogous temporal_extent operator that returns the temporal properties of a given relationship and allows optional filtering. For example, return all soldiers exhibiting a given symptom during a specific time period. We also provide a temporal_eval operator that can answer queries such as find soldiers who exhibited symptoms after participating in a given military event.

SPARQL-ST

It is important that our STT querying approach fits with the Semantic Web community's existing querying framework. SPARQL is the current World Wide Web Consortium (W3C) recommended query language for RDF data. As a part of this project, we have developed SPARQL-ST: an extension of SPARQL that allows querying spatiotemporal RDF graphs (i.e. temporal RDF Graphs that contain spatial objects). Consider the SPARQL-ST query below.

LSDIS Lab

SPARQL-ST introduces a spatial variable type (denoted with a % prefix) and a temporal variable type (denoted with a # prefix). Spatial variables represent complex spatial features rather than a single URI, and the concept of a mapping is extended so that spatial variables map to a set of triples that represent a spatial feature. The spatial variable %g is used in the query above to represent the spatial extent of a congressional district. Temporal variables map to time intervals rather than a URI and can appear in the quad position of what we term a spatiotemporal triple pattern. Temporal variables are used in the example query to retrieve the valid time of each temporal RDF statement. In addition, SPARQL-ST allows computation of derived time intervals. For example, the query above computes the interval intersection of four time intervals to derive the valid time of the entire triple pattern. SPARQL-ST also introduces SPATIAL FILTE} and TEMPORAL FILTER expressions to filter results using spatial and temporal conditions. The query above applies a filtering conditon to the spatial extent of each congressional district.

Publications

Conference and Workshop Papers:

  1. C. Henson, K. Thirunarayan, A. Sheth, and P. Hitzler. “Representation of Parsimonious Covering Theory in OWL-DL.” In Proceedings of the 8th International Workshop on OWL: Experiences and Directions (OWLED 2011). 5-6 June 2011, San Francisco, CA, United States.
  2. P. Jain, P.Z. Yeh, K. Verma, R. Vasquez, M. Damova, P. Hitzler and A.P. Sheth. “Contextual Ontology Alignment of LOD with an Upper Ontology: A Case Study with Proton.” In Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011). Part I. Lecture Notes in Computer Science 6643. 29 May- 2 June 2011, Springer, Heraklion, Crete, Greece. 80-92.
  3. P. Jain, P. Hitzler, A.P. Sheth, K. Verma and P.Z. Yeh. “Ontology Alignment for Linked Open Data,” Ed. P. Patel-Schneider, Y. Pan, P. Hitzler, P. Mika, L. Zhang, J. Pan, I. Horrocks, And B. Glimm. Proceedings of the 9th International Semantic Web Conference 2010. 7-11 Nov 2010, Shanghai, China. Volume 6496 of Lecture Notes in Computer Science. Springer, Berlin: Heidelberg, Germany, 2010. 402-417 (Acceptance Rate 20%).
  4. P. Jain, P. Hitzler, P. Z. Yeh, K. Verma, and A.P. Sheth. "Linked Data Is Merely More Data." Ed. Dan Brickley, Vinay K. Chaudhri, Harry Haplin, and Deborha McGuinness. Linked Data Meets Artificial Intelligence. Technical Report SS-10-07. Menlo Park, CA: AAAI Press, 2010. 82-86. ISBN 978-1-57735-461-1.
  5. P. Jain, P. Hitzler, and A. P. Sheth. “Flexible Bootstrapping-Based Ontology Alignment,” Ed. P. Shvaiko, J. Euzenat, F. Giunchiglia, H. Stuckenschmidt, M. Mao, and I. Cruz. Ontology Matching, OM-2010. Proceedings of the 5th International Workshop on Ontology Matching, at ISWC2010. Nov. 2010, Shanghai, China. 136-137.
  6. P. Jain, P.Z. Yeh, K. Verma, R. Vasquez, M. Damova, P. Hitzler and A.P. Sheth. “Contextual Ontology Alignment of LOD with an Upper Ontology: A Case Study with Proton.” Ed. Grigoris Antoniou, Marko Grobelnik, Elena Simperl, Bijan Parsia, Dimitris Plexousakis, Jeff Pan and Pieter De Leenheer. Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011). Volume 6643 of Lecture Notes in Computer Science Springer, Berlin: Heidelberg , Germany, 2010. (Acceptance Rate 23.5%).
  7. P. Jain, P. Hitzler, A.P. Sheth, K. Verma and P.Z. Yeh. “Ontology Alignment for Linked Open Data.” Ed. P. Patel-Schneider, Y. Pan, P. Hitzler, P. Mika, L. Zhang, J. Pan, I. Horrocks, And B. Glimm. Proceedings of the 9th International Semantic Web Conference 2010. Shanghai, China, 7-11 Nov. 2010. Volume 6496 of Lecture Notes in Computer Science. Springer, Berlin: Heidelberg, Germany, 2010. 402-417. (Acceptance Rate 20%).
  8. P. Jain, P. Hitzler, P. Z. Yeh, K. Verma, and A. Sheth. “Linked Data Is Merely More Data.” Ed. Dan Brickley, Vinay K. Chaudhri, Harry Halpin, and Deborah McGuinness. Linked Data Meets Artificial Intelligence. Technical Report SS-10-07. Menlo Park, CA: AAAI Press, 2010. 82-86. ISBN 978-1-57735-461-1.
  9. P. Jain, P. Z. Yeh, K. Verma, C. Henson, A. Sheth, "SPARQL Query Re-writing for Spatial Datasets Using Partonomy Based Transformation Rules",Third International Conference on Geospatial Semantics (GeoS 2009), Mexico City, Mexico, December 3-4, 2009, to appear (PDF)
  10. M. Nagarajan, K. Gomadam, A. Sheth, A. Ranabahu, R. Mutharaju, A. Jadhav, "Spatio-Temporal-Thematic Analysis of Citizen-Sensor Data - Challenges and Experiences", Tenth International Conference on Web Information Systems Engineering, Oct 5-7, 2009, Poland, to appear (PDF)
  11. M. Perry, A. Sheth, F. Hakimpour, P. Jain "Supporting Complex Thematic, Spatial and Temporal Queries over Semantic Web Data", Second International Conference on Geospatial Semantics (GeoS '07), Mexico City, Mexico, November 29 - 30, 2007 (PDF)
  12. M. Perry, F. Hakimpour, A. Sheth. "Analyzing Theme, Space and Time: An Ontology-based Approach", Fourteenth International Symposium on Advances in Geographic Information Systems (ACM-GIS '06), Arlington, VA, November 10 - 11, 2006 (PDF)
  13. F. Hakimpour, B. Aleman-Meza, M. Perry, A. Sheth. "Data Processing in Space, Time, and Semantics Dimensions", Terra Cognita 2006 - Directions to the Geospatial Semantic Web, in conjunction with the Fifth International Semantic Web Conference (ISWC '06), Athens, GA, November 6, 2006 (PDF)

Journal Articles:

  1. A. Sheth, C. Thomas, and P. Mehra. “Continuous Semantics to Analyze Real-Time Data,” IEEE Internet Computing. Vol. 14. No. 6. 84-89. Nov./Dec. 2010. doi:10.1109/MIC.2010.137.
  2. A. Sheth, H. Purohit, A. Jadhav, P. Kapanipathi and L. Chen. “Understanding Events Through Analysis Of Social Media,” Technical Report. Kno.e.sis Center. 2010.
  3. K. Thirunarayan, T. Immaneni. “Integrated Retrieval from Web of Documents and Data: Advances in Data Management.” Ed. Z. W. Ras and A. Dardzinska. Studies in Computational Intelligence, SCI 223. 2009. 25-48.
  4. M. Perry, A. Sheth and P. Jain. “SPARQLST:Extending SPARQL to Support Spatiotemporal Queries.” Kno.e.sis Center Technical Report. KNOESIS-TR-2009-01: 3 Nov. 2008.
  5. A. Sheth and M. Perry, "Traveling the Semantic Web through Space, Time and Theme", IEEE Internet Computing, Vol. 12, No. 2, February/March 2008 (PDF)
  6. I. B. Arpinar, A. Sheth, C. Ramakrishnan, L. Usery, M. Azami, and M. Kwan, "Geospatial Ontology Development and Semantic Analytics", Transactions in GIS, Blackwell Publishing, Vol. 10, No. 4, 2006. (PDF)

Book Chapters:

  1. F. Hakimpour, B. Aleman-Meza, M. Perry, A. Sheth, "Spatiotemporal-Thematic Data processing in Semantic Web", The Geospatial Web, A. Scharl and K. Tochtermann (Eds.), Springer-Verlag, May, 2007 (PDF)
  2. M. Perry, A. Sheth, I. B. Arpinar. "Geospatial and Temporal Semantic Analytics", To appear in Handbook of Research on Geoinformatics, Hassan A. Karimi (Ed.), Idea-Group Inc., 2009 (PDF)

Ph.D. Dissertations and Master's Theses:

  1. M. Nagaran, "Understanding User-Generated Content on Social Media", Ph.D. Dissertation, Wright State University, August 2010. (PDF
  2. M. Perry. "A Framework to Support Spatial, Temporal and Thematic Analytics over Semantic Web Data", Ph.D. Dissertation, Wright State University, June 10, 2008 (PDF)

Technical Reports:

  1. A. Sheth, “SPARQL-ST Extending SPARQL to Support Spatio-Temporal Queries.” Standards Activities: SPARQL Working Group (Jain). W3C Semantic Sensor Network Incubator Group (Sheth).
  2. M. Perry,A. Sheth and P. Jain "SPARQLST:Extending SPARQL to Support Spatiotemporal Queries", Kno.e.sis Center Technical Report. KNOESIS-TR-2009-01, Nov 3, 2008 (PDF)
  3. M. Perry,A. Sheth and P. Jain "A Framework to Support Spatial, Temporal and Thematic Analytics over Semantic Web Data", Kno.e.sis Center Technical Report. KNOESIS-TR-2009-02, Feb 27, 2009 (PDF)

Presentations:

  1. Title:Computing for Human Experience: Semantics empowered Cyber-Physical, Social and Ubiquitous Computing beyond the Web.”

    Given at:OnTheMove Federated Conferences and Workshops. 2011, Crete, Greece.

    Download:Library Item

  2. Title:Spatial Semantics for Better Interoperability and Analysis: Challenges and Experiences in Building Semantically Rich Applications in Web 3.0

    Given at:3rd Annual Spatial Ontology Community of Practice Workshop: Development, Implementation and Use of Geo-Spatial Ontologies and Semantics (USGS). 3 Dec. 2010, Reston, VA.

    Download:Library Item

  3. Title:“How to Make Linked Data More than Data

    Given at:Semantic Technology Conference. 23 June 2010, San Francisco, CA.

    Download:PPT

  4. Title: Query Rewriting Framework for Spatial Data

    Given at: Wright State University, Dayton, OH, March 10, 2009

    Download: PPT

  5. Title: A Framework to Support Spatial, Temporal and Thematic Analytics over Semantic Web Data

    Given at: Wright State University, Dayton, OH, July 10, 2008

    Download: PPT

  6. Title: Supporting Complex Thematic, Spatial and Temporal Queries over Semantic Web Data

    Given at: Second International Conference on Geospatial Semantics (GeoS '07), Mexico City, MX, November 30, 2007

    Download: PPT

  7. Title: Analyzing Theme, Space and Time: An Ontology-based Approach

    Given at: Fourteenth International Symposium on Advances in Geographic Information Systems (ACM-GIS '06), Arlington, VA, November 11, 2006

    Download: PPT

Data Sets:

  1. Description BLOOMS- An Ontology alignment system based on the idea of bootstrapping information already present on the LOD cloud. A framework for re-writing queries based on partonomical relationships.

    Link: http://wiki.knoesis.org/index.php/BLOOMS

    Related Publication: Prateek Jain, Pascal Hitzler, Amit P. Sheth, Kunal Verma and Peter Z. Yeh. “Ontology Alignment for Linked Open Data”. Ed. P. Patel-Schneider, Y. Pan, P. Hitzler, P. Mika, L. Zhang, J. Pan, I. Horrocks, and B. Glimm. Proceedings of the 9th International Semantic Web Conference 2010. 7-11 Nov. 2010, Shanghai, China. Vol. 6496 of Lecture Notes in Computer Science. Springer, Berlin: Heidelberg , 2010. 402-417. (Acceptance Rate 20%).

  2. DescriptionPartonomical Relationship Based Query Rewriting System. A framework for re-writing queries based on partonomical relationships.

    Link:http://knoesis.wright.edu/students/prateek/geos.htm

    Related Publication: Prateek Jain, Peter Z. Yeh, Kunal Verma, Cory Henson and Amit Sheth. “SPARQL Query Re-writing for Spatial Datasets Using Partonomy Based Transformation Rules.”Third International Conference on Geospatial Semantics (GeoS 2009). 3-4 Dec. 2009, Mexico City, Mexico.

  3. Description: Small real-world spatiotemporal RDF data set describing social and terrorism-related events.

    Link: http://lsdis.cs.uga.edu/projects/semdis/spatiotemporal/

    Related Publication: F. Hakimpour, B. Aleman-Meza, M. Perry, A. Sheth. "Data Processing in Space, Time, and Semantics Dimensions", Terra Cognita 2006 - Directions to the Geospatial Semantic Web, in conjunction with the Fifth International Semantic Web Conference (ISWC '06), Athens, GA, November 6, 2006

  4. Description: Large synthetically generated RDF data set for historical battlefield analysis scenario (7 million asserted triples) with links to accompanying spatial data.

    Link: http://knoesis.wright.edu/students/mperry/STData.html

    Related Publication: M. Perry, A. Sheth, F. Hakimpour, P. Jain "Supporting Complex Thematic, Spatial and Temporal Queries over Semantic Web Data", Second International Conference on Geospatial Semantics (GeoS '07), Mexico City, MX, November 29 - 30, 2007

  5. Description: Large synthetically generated RDF data set for historical battlefield analysis scenario (18 million asserted triples) with links to accompanying spatial data. Also, links are available for spatial data that can be incorporated into the real-world GovTrack RDF dataset.

    Link: http://knoesis.wright.edu/students/mperry/dissertation/Test-Details.html

    Related Publication: M. Perry. "A Framework to Support Spatial, Temporal and Thematic Analytics over Semantic Web Data", Ph.D. Dissertation, Wright State University, June 10, 2008

Personnel:

    Faculty

      Amit Sheth
      Krishnaprasad Thirunarayan


    Students

    Current

      Prateek Jain
      Hemant Purohit
      Lu Chen
      Ashwin Manjunatha
      Christopher Thomas
      Pramod Anantharam
      Matthan Sink
      Michael Cooney
      Alan Smith

    Alumni

      Meenakshi Nagarajan
      Graduated with Ph.D. in 2010 (thesis). Currently a research staff member at Almaden Research Center, San Jose, CA, USA.
      Matt Perry
      Graduated with Ph.D. in 2008 (thesis).Currently Senior Member of Technical Staff at Oracle USA,Inc at Nashua,NH with the Semantic Technologies Group.
      Paul Fultz

This research was initially funded in part by NSF Award#IIS-0714441 (01/01/2007-12/31/2009) [formerly IIS--0325464 (09/01/2004- 12/31/2006)], titled "Collaborative Proposal: ITR-SemDIS: Discovering Complex Relationships in the Semantic Web" Additionally, this research is partially funded by NSF Award#IIS-0842129, titled "III-SGER: Spatio-Temporal-Thematic Queries of Semantic Web Data: a Study of Expressivity and Efficiency (09/01/2008-08/31/2010)"

© 2012 Knoesis | 377 Joshi Research Center, 3640 Colonel Glenn Highway, Dayton, OH 45435 (937 - 775 - 5217)