Data processing method and device, equipment and medium

A technology of data processing and text data, which is applied in the field of knowledge graphs, can solve the problems of cumbersome and complex entity linking, and the inability to link entities, etc., and achieve the effects of improving flexibility and generalization ability, simplifying the process of entity linking, and improving efficiency

Pending Publication Date: 2022-07-19
BEIJING ORION STAR TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

The technical effect of this patented method allows for efficient identification of entities that match their content with specific attributes or features during entity linkage processing. This makes them easier than traditional methods like manual annotation which requires human input.

Problems solved by technology

This patented technical problem addressed in the patents relates to how efficiently establish connections for connecting unknown concepts together based on their attributes such as location within space or time. Current methods require manually create numerous entity vectors beforehand, making it difficult if they don't already exist at all times.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device, equipment and medium
  • Data processing method and device, equipment and medium
  • Data processing method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0025] Example 1: figure 1 A schematic diagram of a data processing process provided by an embodiment of the present invention, the process includes:

[0026] S101: Determine the target entity contained in the text data to be processed.

[0027] The data processing method provided by the embodiment of the present invention is applied to an electronic device, and the electronic device may be an intelligent device such as a robot, or a server.

[0028] In a practical application scenario, when a user needs to query certain information, the query information can be input to the smart device. The query information may be an input query question, for example, "Who is A's wife?", or a descriptive sentence, for example, "A is the spokesperson of the world brand C". Among them, there are many ways for the user to input query information, which may be input by voice, or by inputting query information in the form of text on the display screen of the smart device, and of course, inputting

Embodiment 2

[0041] Embodiment 2: In order to simplify the process of entity linking and provide the efficiency, flexibility, and generalization ability of entity linking, on the basis of the above embodiment, in this embodiment of the present invention, the entity linking model completed by pre-training is based on Text data, knowledge records, and the target type corresponding to the knowledge records, to determine whether the knowledge records match the target entities contained in the text data, including:

[0042] Through the coding network in the entity linking model, the text vector corresponding to the text data, the attribute vector corresponding to the knowledge record, and the category vector corresponding to the target type are respectively determined; and

[0043] Through the decoding network in the entity linking model, based on the text vector, attribute vector, and category vector, it is determined whether the knowledge record matches the target entity contained in the text dat

Embodiment 3

[0113] Embodiment 3: In order to simplify the process of entity linking and provide the efficiency, flexibility, and generalization ability of entity linking, on the basis of the above embodiment, in this embodiment of the present invention, the entity linking model is trained in the following manner:

[0114] Obtain any sample data in the sample set. The sample data includes sample text data, sample knowledge records corresponding to the sample text data, and sample types corresponding to the sample knowledge records. The sample data corresponds to a label, and the label is used to identify the sample knowledge record and the sample text. Whether the entities contained in the data match;

[0115] Through the original entity linking model, based on the sample data, determine the recognition result of whether the sample knowledge record matches the entity contained in the sample text data; and

[0116] Based on the labels and the recognition results, the original entity link model

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data processing method and device, equipment and a medium. As the entity link model is trained in advance, after the target entities contained in the to-be-processed text data are determined, the knowledge records corresponding to the target entities and the target types corresponding to the knowledge records are obtained. And for each knowledge record, whether the knowledge record is matched with the target entity contained in the text data or not can be determined directly based on the text data, the knowledge record and the target type corresponding to the knowledge record through the pre-trained entity linking model, so that entity linking is realized, the entity linking process is simplified, and the user experience is improved. The efficiency of determining whether the knowledge record is matched with the target entity contained in the text data or not is improved, the knowledge record corresponding to any entity can be recognized through the entity link model, whether the knowledge record is matched with the target entity contained in the text data or not is determined, and the flexibility and generalization ability of entity link are improved.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner BEIJING ORION STAR TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products