File processing of native file formats

Inactive Publication Date: 2012-10-18
XEROX CORP
View PDF10 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0009]According to one aspect of the present disclosure, a computer-implemented method for storing configuration data for electronic documents having different native file formats is provided. The method is implemented in a computer system comprising one or more processors configured to execute one or more computer program modules. The method includes (a) receiving and displaying an electronic document in its native file format; (b) receiving a user input for identifying regions of interest in the displayed electronic document for data extraction; (c) receiving a user input for associating each region of interest with a corresponding defined output field; (d) storing configuration data for the electronic document, the configuration data comprising the regions of interest and their associations with corresponding defined output fields; and (e) performing procedures (a) through (d) for other electronic documents to obtain and store configuration data for those electronic documents.

Problems solved by technology

Some drawbacks with these types of systems is that they are often very compute-intensive and storage intensive.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • File processing of native file formats
  • File processing of native file formats
  • File processing of native file formats

Examples

Experimental program
Comparison scheme
Effect test

Example

[0019]The present disclosure provides a system and a set of methods wherein data or information is extracted from a collection of documents provided in a number of different electronic formats. The system of the present disclosure directly consumes virtually any native file format documents, extracts information and data from the documents, formats and stores the extracted information or data for subsequent processing.

[0020]The method of the present disclosure includes a configuration sub-method and a runtime sub-method. The configuration sub-method allows a user a) to visually identify elements and / or regions on a received document (in virtually any native file format) using an advanced or a specialized viewer and b) to associate the identified elements and / or regions with fields to be output by the system. The configuration sub-method also includes storing, for each electronic document, the regions of interest and their associations with corresponding defined output fields. The r

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A computer-implemented method for processing electronic documents having different native file formats is provided. The method is implemented in a computer system comprising one or more processors configured to execute one or more computer program modules. The method includes (a) receiving electronic documents in different native file formats; (b) identifying the native file format for each received electronic document; (c) retrieving a stored configuration data for the identified native file format, the configuration data includes a mapping of regions of interest in the electronic document with the identified native file format and their associations with output fields; and (d) processing the electronic documents using their retrieved configuration data to extract data from the electronic documents.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner XEROX CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products