Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

8 results about "Text corpus" patented technology

In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed). In corpus linguistics, they are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory.

Food material compatibility and inter-restriction relation classification method based on neural network

ActiveCN106844738AGood effectImprove accuracyMedical data miningWeb data indexingText corpusRelation classification
The invention discloses a food material compatibility and inter-restriction relation classification method based on a neural network. The method comprises the steps that data of constitutional science of Chinese medicine is acquired to serve as text corpus; overall modelling is conducted on the acquired text corpus to generate word vectors, so that each non-stop word in the text corpus correspond to one word vector fixed in length; the cosine similarity between two word vectors is used as the similarity between entities corresponding to the two word vectors; for two given food materials, the relation characteristics of the two food materials are represented as a matrix consisting of the word vectors of representation words of food material relation; a cyclic convolution neural network is used, and the characteristics of food material relation serve as inputs of the cyclic convolution neural network to train data of manually annotated food material compatibility and inter-restriction relation. By adopting the method, the food material compatibility and inter-restriction relation can be accurately and rapidly judged, and further a food therapy recommendation system is assisted to enrich food varieties recommended by the therapy recommendation system.
Owner:SOUTH CHINA UNIV OF TECH

Speech support system based on seat speech

ActiveCN111475633AImprove speech skillsNeural architecturesNeural learning methodsSpeech soundSupport system
The invention provides a speech support system based on seat voice, and the system is characterized in that the system comprises a plurality of seat terminals which are held by a seat person; and an analysis server, wherein the seat terminal is provided with a voice acquisition part used for acquiring seat voice when a seat person performs a seat call, and the analysis server is provided with a verbal skill label prediction part used for predicting the seat voice based on a preset verbal skill prediction model and outputting a verbal skill label corresponding to the seat voice and the confidence coefficient of the seat voice; the voice sample classification part is used for classifying the seat voice according to the verbal skill label and forming a plurality of voice sample sets; a sampleacquisition unit that acquires representative samples from each of the speech sample sets; the corpus extraction forming part is used for performing corpus extraction on the representative sample toform a corpus; the verbal skill word and sentence acquisition part is used for traversing the corpus and acquiring verbal skill words and sentences; and the verbal skill support library storage part forms a verbal skill support library according to the verbal skill words and sentences.
Owner:FUDAN UNIV

Industrial data recognition device, related method and related device

InactiveCN110674248AEliminate the influence of artificial factorsHigh precisionSemantic analysisText database indexingFeature extractionEngineering
The invention discloses an industry data recognition device, and the device comprises a corpus construction module which is used for carrying out the corpus construction of the obtained invoice data of a plurality of industries, and obtaining a sparse vector corpus; a to-be-recognized vector acquisition module used for performing feature extraction processing on the to-be-recognized invoice data to obtain a to-be-identified vector; and a recognition processing module used for recognizing the to-be-recognized vector and the sparse vector corpus by utilizing a potential semantic index model to obtain an industry data recognition result. The invoice data to be recognized is recognized through the potential semantic index model and the collected sparse vector corpus, fusion of artificial subjective factors is avoided, a machine learning mode is also avoided for recognition, and the recognition precision and accuracy are improved. The invention also discloses an industry data identificationmethod, a server and a computer readable storage medium, which have the above beneficial effects.
Owner:SERVYOU SOFTWARE GRP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products