Dictionary format generation method and electronic device

An electronic device and format technology, applied in the computer field, can solve the problems of trivial sentence segmentation, information integrity damage, unfavorable semantic analysis stage processing, etc., to achieve the effect of assisting semantic analysis and improving the segmentation effect

Active Publication Date: 2018-01-26
UNION MOBILE PAY
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patented technology allows for better understanding on how specific types of symbols are distributed within certain parts of an image when analyzing them. It involves capturing images containing these symbol patterns along with their surrounding areas called tokens (also known as markers) attached thereto. These token-related features like numerical values, abbreviations, punctuings, etc., help identify important aspects about this area. By comparing different dictionaries based upon those characteristics between the captured content and what they should have been beforehand, it becomes possible to determine if there may exist any signs indicating unusual behavior such as abnormalities in gene expression levels.

Problems solved by technology

This patented technical problem addressed in previous researches relates to improving extraction accuracy when analyzing specific types of data such as documents written over different languages like Chinese and Japanese. Current solutions require expensive hardware resources and slow down operations due to their static nature.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dictionary format generation method and electronic device
  • Dictionary format generation method and electronic device
  • Dictionary format generation method and electronic device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] In order to make the purpose, technical solutions and advantages of the present invention clearer, the following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the drawings in the embodiments of the present invention.

[0062] The text information in the embodiment of the present invention refers to the notification information containing special characters sent by organizations such as merchants, operators or enterprises to users, such as courier information containing numbers and / or letters, hotel ticket reservation information, operator tariff information, Bank card usage information or application push information, etc.

[0063] like figure 1 As shown, the embodiment of the present invention provides a method for generating a dictionary format, which can be described as follows.

[0064] S11: Obtain a plurality of text information from at least one data source, where each text information in

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention provide a dictionary format generation method and an electronic device, which are used for processing special characters in a text and improving the accuracy of segmentinga field containing a special character string in text analysis. The method includes the steps of obtaining a plurality of textual information pieces from at least one data source, each textual information piece of the plurality of textual information pieces including a special character, and the special characters including numbers and/or letters; extracting at least one semantic segment relatedto the special characters in the plurality of textual information pieces, wherein each semantic segment in the at least one semantic segment includes the special characters and associated characters adjacent to the special characters and the number of characters of the associated characters is less than or equal to a preset number; and determining at least one dictionary format according to the atleast one semantic segment, the at least one dictionary format being used for representing distribution rules of the special characters in the corresponding semantic segment.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner UNION MOBILE PAY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products