About Us Our Work Employment News & Events
MITRE Remote Access for MITRE Staff and Partners Site Map
Our Work

Follow Us:

Visit MITRE on Facebook
Visit MITRE on Twitter
Visit MITRE on Linkedin
Visit MITRE on YouTube
View MITRE's RSS Feeds
View MITRE's Mobile Apps
Home > Our Work > Technical Papers >

Table Classification: An Application of Machine Learning to Web-hosted Financial Documents

April 2006

Marc Vilain, The MITRE Corporation
John Gibson, The MITRE Corporation
Benjamin Wellner, The MITRE Corporation
Rob Quimby, The MITRE Corporation

ABSTRACT

This paper presents learning-based techniques that support the processing of tables in HTML publications. We are concerned especially with classifying tables as to format and content, focusing on the domain of corporate financials. We present performance results based on multiple classification methods, and make several novel methodological contribu-tions. These include a new evaluation corpus, a clever tech-nique for creating the corpus, and an exhaustive approach to-wards sensitivity analysis for classification features.

View/Download Document

Additional Search Keywords

N/A

 

Page last updated: May 3, 2006   |   Top of page

Homeland Security Center Center for Enterprise Modernization Command, Control, Communications and Intelligence Center Center for Advanced Aviation System Development

 
 
 

Solutions That Make a Difference.®
Copyright © 1997-2013, The MITRE Corporation. All rights reserved.
MITRE is a registered trademark of The MITRE Corporation.
Material on this site may be copied and distributed with permission only.

IDG's Computerworld Names MITRE a "Best Place to Work in IT" for Eighth Straight Year The Boston Globe Ranks MITRE Number 6 Top Place to Work Fast Company Names MITRE One of the "World's 50 Most Innovative Companies"
 

Privacy Policy | Contact Us