Text relation automatic labeling method based on similar glossary

An automatic tagging and vocabulary technology, which is applied in text database query, unstructured text data retrieval, and natural language data processing.

Pending Publication Date: 2022-01-04
苏州空天信息研究院
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to propose a method for automatically labeling text relations based on a similar vocabulary to solve the problems of difficulty in building a knowledge base and low quality of generated corpus

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text relation automatic labeling method based on similar glossary
  • Text relation automatic labeling method based on similar glossary

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0048] Example

[0049] In order to verify the effectiveness of the solution of the present invention, the following simulation experiments are carried out to establish a character relationship data set.

[0050] Example: In Five’s earlier days, Kieran Conlon was briefly engaged todancer Suzanne Mole, who toured with the group.

[0051] The first step is to determine some relationship types related to the characters. Here, four kinds of relationships are selected: parent, couple, sibling, and friend. The corresponding entity types are all characters, and then the corpus is obtained from the Internet according to the names of these relationships.

[0052] The second step is to generate a similar vocabulary for all occurrences of parent, couple, sibling, and friend in the corpus, and then organize and generate a type vocabulary for each relationship.

[0053] The third step is to use the named body recognition model and the open domain information extraction tool for the sentence

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text relation automatic labeling method based on similar glossaries. The method comprises the following steps: determining all extracted relation names and corresponding entity types, and obtaining a corpus containing the relation names; generating a similar glossary capable of replacing the relation name in each place where the relation name appears in the corpus, and sorting out a relation type glossary according to the similar glossaries; performing named body recognition and entity relation triple extraction on sentences; judging whether entity pairs of entity relation triples meet conditions or not according to results of the named body recognition, and generating a candidate relation set; and generating a similar glossary for each vocabulary of relation phrases in the entity relation triples, judging candidate relations expressed by the vocabularies in combination with the relation type glossary, taking the candidate relation which is expressed most frequently in all the vocabularies as a labeling relation, and completing automatic labeling. According to the method, the problems that a traditional remote supervision method is high in knowledge base construction difficulty and low in corpus labeling quality are solved, and a new strategy is provided for constructing a data set needed by a relation extraction model.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner 苏州空天信息研究院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products