TY - JOUR
T1 - Querying semistructured heterogeneous information
AU - Quass, Dallan
AU - Rajaraman, Anand
AU - Ullman, Jeffrey
AU - Wedom, Jennifer
AU - Sagiv, Yehoshua
PY - 1997
Y1 - 1997
N2 - Semistructured data has no absolute schema fixed in advance and its structure may be irregular or incomplete. Such data commonly arises in sources that do not impose a rigid structure (such as the World-Wide Web) and when data is combined from several heterogeneous sources. Data models and query languages designed for well structured data are inappropriate in such environments. Starting with a "lightweight" object model adopted for the TSIMMIS project at Stanford, in this paper we describe a query language and object repository designed specifically for semistructured data. Our language provides meaningful query results in cases where conventional models and languages do not: when some data is absent, when data does not have regular structure, when similar concepts are represented using different types, when heterogeneous sets are present, and when object structure is not fully known. This paper motivates the key concepts behind our approach, describes the language through a series of examples (a complete semantics is available in an accompanying technical report [23]), and describes the basic architecture and query processing strategy of the "lightweight" object repository we have developed.
AB - Semistructured data has no absolute schema fixed in advance and its structure may be irregular or incomplete. Such data commonly arises in sources that do not impose a rigid structure (such as the World-Wide Web) and when data is combined from several heterogeneous sources. Data models and query languages designed for well structured data are inappropriate in such environments. Starting with a "lightweight" object model adopted for the TSIMMIS project at Stanford, in this paper we describe a query language and object repository designed specifically for semistructured data. Our language provides meaningful query results in cases where conventional models and languages do not: when some data is absent, when data does not have regular structure, when similar concepts are represented using different types, when heterogeneous sets are present, and when object structure is not fully known. This paper motivates the key concepts behind our approach, describes the language through a series of examples (a complete semantics is available in an accompanying technical report [23]), and describes the basic architecture and query processing strategy of the "lightweight" object repository we have developed.
KW - Query language
KW - Semistructured data
UR - http://www.scopus.com/inward/record.url?scp=0031236663&partnerID=8YFLogxK
U2 - 10.1023/a:1008287522472
DO - 10.1023/a:1008287522472
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:0031236663
SN - 0925-4676
VL - 7
SP - 381
EP - 407
JO - Journal of Systems Integration
JF - Journal of Systems Integration
IS - 3-4
ER -