Streaming computing system data tracking method, device and equipment and storage medium

A streaming computing and system data technology, applied in the field of data processing, can solve problems such as undiscoverable, inaccurate judgment of data loss, inability to locate lost data, etc., and achieve the effect of improving accuracy

Active Publication Date: 2020-11-13
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing technology is not accurate in judging the data loss situation
For example, when it is found that the Heartbeat data is lost, the real data may not be lost; or when the real data is lost, the Heartbeat data is not lost and cannot be found
At the same time, when the Heartbeat data is lost, we can only know that there is an unexpected data loss in the system, but it is impossible to locate which data is lost and at which node the data loss occurred
In addition, the existing technology cannot know which node each piece of data is currently processing and the latest processing status of the data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0019] figure 1 It shows an exemplary system architecture 100 to which embodiments of the stream computing system data tracking method or the stream computing system data tracking device of the present application can be applied.

[0020] Such as figure 1 As shown, the system architecture 100 may include a storage device 101 , a network 102 and a server 103 . The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a streaming computing system data tracking method, device and equipment and a storage medium, and relates to the technical field of cloud computing. One specific embodiment ofthe method comprises the following steps: identifying data entering a streaming computing system, and generating a tracking identifier of the data; after the data is processed by the computing nodes in the streaming computing system, sending the data to the computing nodes; persistently storing the record information of the data, wherein the record information comprises the tracking identifier ofthe data and the current processing state information of the data, and the storage state of the check point of the data is consistent with the storage state of the record information of the data, andtherefore the accuracy of judging the data loss condition is improved.

Description

technical field [0001] The present application relates to the technical field of data processing, specifically to the technical field of cloud computing, and in particular to a data tracking method, device, device and storage medium of a streaming computing system. Background technique [0002] Streaming computing technology refers to the real-time processing of continuously generated data streams. Compared with batch computing, streaming computing is more time-sensitive. The streaming computing system interfaces with other data transmission systems, receives input data and outputs the data to the designated system after a series of processing. When data is transmitted and processed in the streaming computing system, unexpected data loss may occur due to some reasons (perhaps due to system bugs or unexpected errors in the underlying storage system). [0003] The current open source stream computing system is mainly Apache Flink. During the operation and maintenance of the F...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/23G06F16/27
CPCG06F16/22G06F16/2365G06F16/27
Inventor 孙英富邢越汪婷
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products