MITRE
 
About Us Our Work Employment News & Events
MITRE Remote Access for MITRE Employees Site Map
Home > Our Work > Technical Papers >

Focusing on the Data in Data Mining: Lessons from Recent Experience

November 1997

Neal J. Rothleder, The MITRE Corporation
Earl Harris, The MITRE Corporation
Eric Bloedorn, The MITRE Corporation

ABSTRACT

The use of data mining is growing rapidly. The number of data mining consultants, as well as the number of commercial tools available to the "non-expert" user, are also quickly increasing. It is becoming easier than ever to collect datasets and apply data mining tools to them. As more and more non-experts seek to exploit this technology to help with their business, it becomes increasingly important that they understand the underlying assumptions and biases of these tools. There are a number of factors to consider before applying data mining to a database. In particular, there are important issues regarding the data which should be examined before proceeding with the data mining process. While these issues may be well-known to the data mining expert, the non- expert is often unaware of their importance. In this paper, we will focus on three specific issues, and illustrate each through the use of examples taken from our recent experiences. For each issue, we provide insight into how it might be problematic and suggest techniques for approaching such situations.

» Download Paper [PDF, 109KB]

Additional Search Keywords

n/a

 

Page last updated: October 24, 1997   |   Top of page

Homeland Security Center Center for Enterprise Modernization Command, Control, Communications and Intelligence Center Center for Advanced Aviation System Development

 
 
 

Serving as Architects of Information Advantage.™
Copyright © 1997-2008, The MITRE Corporation. All rights reserved.
MITRE is a registered trademark of The MITRE Corporation.
Material on this site may be copied and distributed with permission only.

 

Privacy Policy | Contact Us

Boston Business Journal Best Places to Work 2007 Computerworld Best Places to Work in IT 2005-2007 Fortune 100 Best Places to Work 2002-2008