Scalability Issues in Data Integration
July 2001
Arnon Rosenthal, The MITRE Corporation
Len Seligman, The MITRE Corporation
ABSTRACT
Data integration efforts often aim to give users access to multiple data sources through queries (and other requests) against a global schema. As sources change, new ones become available, and others become unavailable (at least temporarily), it becomes very burdensome to maintain the necessary mappings and other metadata. We compare the administrative labor and data accessibility for two popular approaches: federated databases that derive each table in the global schema as a view over sources, and source-profile systems that describe each source’s offerings as a view over the global tables. We then propose a hybrid process that combines their advantages.

Additional Search Keywords
data integration, data administration, scalability, federated databases
|