MITRE
 
About Us Our Work Employment News & Events
MITRE Remote Access for MITRE Employees Site Map
Home > Our Work > Technical Papers >

Analyzing and Revising Mediated Schemas to Improve Their Matchability

December 2007

Xiaoyong Chai, University of Wisconsin-Madison
Mayssam Sayyadian, University of Wisconsin-Madison
AnHai Doan, University of Wisconsin-Madison
Arnon Rosenthal, The MITRE Corporation
Len Seligman, The MITRE Corporation

ABSTRACT

Data integration systems often provide a uniform interface, called a mediated schema, to a multitude of disparate data sources. To answer user queries posed over the mediated schema, such systems employ a set of semantic matches between this schema and the local schemas of the data sources. Finding such matches is well known to be difficult. Hence much work has focused on developing semi-automatic techniques to efficiently find the matches. In this paper, however, we consider the complementary problem of improving the mediated schema, to make finding such matches easier. Specifically, a mediated schema S will typically be matched with many source schemas. Thus, can the developer of S analyze and revise S in a way that preserves S's semantics, and yet makes it easier to match with in the future?

We describe mSeer, a solution to this problem. Given a mediated schema S, mSeer first computes a matchability score that quantifies how well S can be matched against. Next, mSeer generates a matchability report that shows where the problems in matching S come from. Finally, mSeer automatically suggests changes to S (e.g., renaming an attribute, reformatting data values, etc.) that it believes will preserve the semantics of S and yet make it more amenable to matching. The creator of S is free to accept or revise the changes suggested by mSeer. We present extensive experiments over several real-world domains that demonstrate the effectiveness of our approach.

» Download Paper [PDF, 485KB]

Additional Search Keywords

N/A

 

Page last updated: December 27, 2007   |   Top of page

Homeland Security Center Center for Enterprise Modernization Command, Control, Communications and Intelligence Center Center for Advanced Aviation System Development

 
 
 

Serving as Architects of Information Advantage.™
Copyright © 1997-2008, The MITRE Corporation. All rights reserved.
MITRE is a registered trademark of The MITRE Corporation.
Material on this site may be copied and distributed with permission only.

 

Privacy Policy | Contact Us

Boston Business Journal Best Places to Work 2007 Computerworld Best Places to Work in IT 2005-2007 Fortune 100 Best Places to Work 2002-2008