MITRE
 
About Us Our Work Employment News & Events
MITRE Remote Access for MITRE Employees Site Map
Home > Our Work > Technical Papers >

Multidocument, Multilingual, and Multimodal Information Extraction for Real World Applications

November 2002

Mark T. Maybury, The MITRE Corporation

ABSTRACT

This keynote addresses current and future challenges in terminology and knowledge engineering focusing on multidocument, multilingual and multimodal information extraction. With some reports that humanity creates more than an exabyte (1018 bytes) of unique information each year, the imperative for tools to mitigate the size, heterogeneity, and complexity of knowledge collec-tions is a priority. After exemplifying this grand challenge in typical real world analytic envi-ronments, we briefly review the state of the art in information access. We note that automated systems exist that can return documents relevant to a particular subject with around 80% preci-sion but low recall. Automated document query incorporating relevance feedback has achieved near human performance. Extraction of named entities (Hirschman 1998) is over 90% accurate and extraction of relations among entities in specific domains is about 70-80% accurate. Also, documents can be summarized to about 20% of their source size without information loss, which can save users 50% of their original task time. Finally, prototype systems can respond to a sim-ple factual questions by returning answers from relevant documents with about 75% accuracy.

» Download Paper [PDF, 122KB]

Additional Search Keywords

n/a

 

Page last updated: January 7, 2003   |   Top of page

Homeland Security Center Center for Enterprise Modernization Command, Control, Communications and Intelligence Center Center for Advanced Aviation System Development

 
 
 

Serving as Architects of Information Advantage.™
Copyright © 1997-2008, The MITRE Corporation. All rights reserved.
MITRE is a registered trademark of The MITRE Corporation.
Material on this site may be copied and distributed with permission only.

 

Privacy Policy | Contact Us

Boston Business Journal Best Places to Work 2007 Computerworld Best Places to Work in IT 2005-2007 Fortune 100 Best Places to Work 2002-2008