System and method for on-the-fly elimination of redundant data

a technology of redundant data and system, applied in the field of storage systems, can solve the problems of increasing the total cost of maintaining the storage system, affecting the overall performance of the storage system implementing such a method, and retaining a plurality of copies of the same data

Active Publication Date: 2008-11-27
NETWORK APPLIANCE INC
View PDF3 Cites 362 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patent describes a method where when a client requests access to files on a network server, it divides them into smaller units called chunks. These chunks can have different sizes and types based on how well they were divided. A fingerprint is created from each chunk's content and used to verify its validity. When a new block is added, the pointer to the previous block becomes invalid. By comparing the fingerprint of the new block with the original ones, the system checks if their locations match up. It also updates the index file containing the necessary information about the block being accessed. Overall, this technology helps ensure accurate data retrieval even after changes made during execution.

Problems solved by technology

The technical problem addressed in this patent is how efficiently manage and retrieve data from a storage system without requiring excessive storage capacity.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for on-the-fly elimination of redundant data
  • System and method for on-the-fly elimination of redundant data
  • System and method for on-the-fly elimination of redundant data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020]A. Storage System Environment

[0021]FIG. 1 is a schematic block diagram of an environment 100 including a storage system 120 that may be advantageously used with the present invention. The storage system is a computer that provides storage service relating to the organization of information on storage devices, such as disks 130 of a disk array 160. The storage system 120 illustratively comprises a processor 122, a memory 124, a network adapter 126 and a storage adapter 128 interconnected by a system bus 125. The storage system 120 also includes a storage operating system 200 that preferably implements a high-level module, such as a file system, to logically organize the information as a hierarchical structure of directories, files and special types of files called virtual disks on the disks.

[0022]In the illustrative embodiment, the memory 124 comprises storage locations that are addressable by the processor and adapters for storing software program code. A portion of the memory ma

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A system and method for “on-the-fly” de-duplication of data before storing the data in a storage system is provided. A data de-duplication module illustratively cooperates with protocol servers and a file system of a storage operating system executing on the storage system to implement the novel de-duplication technique. The de-duplication module illustratively generates a block store, an index file and a hash table on storage space provided by the storage system. The hash table is utilized for tracking fingerprints and locations of blocks within the block store. The index file is utilized for storing directory information identifying the contents of data containers stored on the storage system, while the block store is utilized to store raw data blocks that comprise the data containers.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner NETWORK APPLIANCE INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products