| 2004 Technology
Symposium > Information Management
Information Management
Information Management focuses on technologies and processes that enable
the organization, creation, management, and use of information to satisfy
the needs of diverse applications and users.
Combining Data and Knowledge Graph Analysis
Eric Bloedorn, Principal Investigator
Location(s): Washington
Problems The need to efficiently extract patterns from large graphs is increasingly recognized by many MITRE sponsors. However, fundamental operations such as exact graph matching are NP-complete and therefore computationally too expensive for even moderate-sized graphs. Thus, more complex analyses, such as approximate graph matching, frequent subgraph discovery, or graph classification, are currently impractical for our sponsors. Objectives The goal of this project is to develop and empirically evaluate the use of domain knowledge (DK) to improve inexact graph matching, frequent subgraph discovery, and graph classification. We are also interested in better understanding the characteristics of the graphs our sponsors deal with and in making those characteristics known to the external community. Activities To investigate the use of DK in graph analysis we will select datasets, develop metrics for characterizing graphs, and design approaches for representing and storing graphs and DK, as well as develop and evaluate algorithms. Our goal is to have a set of tools for supporting a wide range of graph analysis tasks by the end of this project. Impact For our sponsors, performing these operations efficiently for large graphs can mean detecting undiscovered criminal organizations, identifying key criminal organizers, and saving millions of dollars previously lost to fraud.
^TOP
Exploring Algorithms/Techniques for Location-Based Publish and Subscribe Services
Kevin Kelly, Principal Investigator
Location(s): Washington
Problems The "state of the practice" of publish-and-subscribe involves delivering services either as a centralized information broker or as a federation of information brokers with pre-engineered and/or manually configured subscriptions. Clearly, these approaches suffer from scalability and/or complexity issues. Objectives This project will focus on specific techniques and algorithms for the automatic distribution of location-based information. Specific objectives include leveraging existing COTS technology and algorithms for decentralized publish-and-subscribe services and verifying our approach in an experimental testbed. Activities The project will develop prototypes and algorithms, analyze use cases, develop an information dissemination evaluation testbed, conduct experimentation and analysis, and document its findings. Project staff will also participate in relevant standards bodies. Impact The work will develop technology for enabling edge-centric access to information in a highly decentralized environment, and specific techniques and algorithms that are applicable to Joint Blue Force Situational Awareness. It will also create system engineering insight applicable to information distribution techniques for a network-centric force, and lead to transition of new technology to standards bodies.
^TOP
Intelligence Sharing Across Boundaries
Amy Kazura, Principal Investigator
Location(s): Washington and Bedford
Problems Many sponsors have mandated a top-down, centralized information management approach, yet face the inevitable budgetary, political, social, cultural, technological and security-driven forces to fragment information in stovepipes. Distributed approaches present other problems, mostly related to the complexity of information sharing and interoperability, and the resulting inability to construct a common picture of the world from non-connected and disparate views. Objectives We will investigate ways that fine-grained information (facts and beliefs about the world), gathered piecemeal by distributed communities of users, can be more readily and effectively shared across organizational boundaries. More generally, we will show how richer models and sharing approaches can be automated and used to enable wider access to information for tasks involving reasoning, decision support, or analysis. Activities We will construct a prototype to capture knowledge, advertise information, discover remote information and assimilate new information into the local world-view. We will also extend this approach to guide filtering and selection from data sources. This will all be done with appropriate security controls to permit policy management.
Impact With the appropriate selection of semantic data models, the prototype tools should be well suited to various types of analytic tasks, such as counter-terrorism target development analysis, order-of-battle and targeting analysis, or open source analysis. The distributed, federated approach will also be applicable to organizations which must share information across many boundaries.
^TOP
Intersystem Mappings for Netcentric Data Sharing
Len Seligman, Principal Investigator
Location(s): Washington
Problems Netcentric data sharing aims to provide data visibility, understanding, and interoperability beyond traditional stovepipes. However, it is insufficient to simply post all schemas to a metadata repository, given heterogeneous semantics. We also need accurate intersystem mappings -- from implemented systems to community ontologies and among overlapping ontologies. In addition, intersystem mappings are essential to point-to-point data interoperability. Objectives Our goal is to advance the state of the art in creating and maintaining intersystem mappings, with an emphasis on the needs of netcentric environments. Specifically, we will develop, evaluate, and transition (1) a linguistically intelligent schema matcher, to improve upon previous matchers' performance; and (2) scalable techniques for specifying and supporting data sharing agreements between data producers and consumers. Activities We will implement and evaluate a linguistic-based schema matcher and design a multi-strategy match framework. We will develop mechanisms for specifying data sharing agreements and services for enforcing those agreements, understanding the impact of changes to participating schemas, and notifying affected parties. We will also analyze what kinds of agreements are enforceable with different levels of intrusiveness on participating systems. Impact The research directly addresses two of the DoD FFRDC's strategic outcomes: "Integration and Interoperability" and "Enabling Netcentric Operations." In addition, all our sponsors need greater agility in data sharing. We will speed progress in both the creation and maintenance of intersystem mappings and will influence future Net Centric Enterprise Services and DoD Metadata Registries.
^TOP
Multi-sensor Aerospace-ground Joint ISR Interoperability Coalition (MAJIIC)
John Kane, Principal Investigator
Location(s): Washington and Bedford
Problems Unprecedented levels of diverse ISR assets were utilized during Operation Iraqi Freedom to the benefit of U.S. and coalition operations. However, a lack of data accessibility and interoperability limited the utility of the collected ISR data and our ability to share it with our coalition partners. Objectives The objective is a Global Information Grid (GIG) populated in near-real time with U.S. and coalition air and space ISR sensor data and platform situational awareness information. All data will adhere to common formats and be discoverable and accessible via XML Web-based GIG Enterprise Services. The goal is to improve ISR data dissemination and utility in support of U.S. and coalition time-sensitive missions. Activities MAJIIC is working under a multinational agreement with nine other nations to develop ISR interoperability standards, interfaces, enterprise services, concepts of operations, and tactics, techniques, and procedures. MAJIIC is also an FY04 OASD (Networks and Information Integration) Horizontal Fusion Portfolio Initiative working to create the GIG Collateral Space and pilot DISA Net-centric Enterprise Services. Impact The culmination of MAJIIC's efforts will lead to the realization of OSD's multinational GIG objectives. MAJIIC represents multinational needs within the U.S. Horizontal Fusion/GIG Enterprise Services development community and works with our multinational partners to ensure their ISR systems are interoperable with the emerging U.S. GIG environment.
^TOP
Peer-to-Peer Information Sharing Architecture
John Kane, Principal Investigator
Location(s): Washington and Bedford
Problems The network-centric objectives of the Global Information Grid (GIG) envisage an information sharing environment that empowers the user with the ability to securely access all relevant information and recognizes the individual user as an information source. The stovepiped information and collaborative systems in use today do not satisfy this objective. Objectives This project will develop and demonstrate a distributed data sharing architecture based on peer-to-peer (P2P) technology that will connect users and C2ISR data sources together in a dynamic virtual network. Users will be empowered to publish, share, and discover information in concert with traditional systems of record. Activities Our approach is to develop a DoD focused P2P protocol, client application, and C2ISR system connector that will augment and complement operational collaborative tools. We will work with our sponsors to develop an effective CONOPS for use of this technology, study the security aspects of its implementation, and work with DISA to transition it into the GIG Core Enterprise Services. Impact This project has the potential to significantly enhance the way information is shared and discovered by tactical and analytical users. It will provide the first real effort on our sponsor's behalf to standardize P2P technology use within the DoD and thereby derail the likely proliferation of noninteroperable P2P solutions on the operational networks.
^TOP
Semantic Integration of the C2 Enterprise
Marwan Sabbouh, Principal Investigator
Location(s): Washington and Bedford
Problems Enterprise integration (EI) is a key initiative in the Air Force. Most EI strategies are based on standards such as XML and Web Services Description Language (WSDL). However, these standards are not scalable and remain too costly to implement. For instance, moving the Air Operations Center to Web services requires building over 1,000 services and will cost $44-$70 million. Objectives This research activity will show that ontologies are an enabling technology for the Command and Control Enterprise Reference Architecture (C2ERA). Using ontologies, we can gradually integrate disparate systems by mapping the ontology to a database schema to integrate new data sources and by mapping WSDL to the ontology to integrate Web services. Activities We will refine two existing ontologies and design two new ones; prototype a WSDL-to-ontology mapper service; leverage DevNet to map the ontology to the Air Operations Database and to WSDL operations; and prototype an airlift application that uses this methodology. We will publish our findings and transition this methodology, the ontology, and the prototypes to industry. Impact This research will provide an enabling technology for the C2ERA and C2EI. In addition to impacting ESC/AC (Battle Management Command, Control, and Communications), this methodology results in a set of services that can become part of the Common Information Infrastructure. This project will also impact ESC/EN (Engineering). In addition, we may license these technologies to industry partners.
^TOP
|