Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

58results about "Semantic analysis" patented technology

Method and System for Determining Word Senses by Latent Semantic Distance

InactiveUS20130197900A1Natural language translationSemantic analysisPattern recognitionData set
The invention relates to methods and systems for semantic disambiguation of a plurality of words. A representative method comprises providing a dataset of words associated by meaning into sets of synonyms; locating said sets at respective vertices of a graph according to semantic similarity and semantic relationship; transforming the graph into a Euclidean vector space comprising vectors indicative of respective locations of said sets; identifying a first group of said sets which include a first of said pair of words; identifying a second group of said sets which include a second of said pair of words; determining a closest pair in said vector space of said sets taken from said first and second groups of sets respectively; and outputting a meaning, of said plurality of words based on said closest pair of said sets and at least one of said semantic relationships between said closest pair of said sets.
Owner:SPRINGSENSE

Telecommunication fraud prevention system and method based on big data and machine learning

InactiveCN106970911APrevent fraudulent incidentsCombating Telecom FraudSemantic analysisSubstation equipmentData informationMobile end
The invention discloses a telecommunication fraud prevention system and method based on big data and machine learning. The system comprises a mobile terminal, a big data analysis terminal and a fraud interdiction governance terminal, wherein the mobile terminal is used for performing fraud detection determination on current telecommunication data according to predetermined constraint rules when receiving a short message or an incoming call message, a machine learning algorithm is adopted to detect whether the telecommunication data is a telecommunication fraud, and if the detection result is that the telecommunication data is determined as a telecommunication fraud, fraud data information is uploaded to the big data analysis terminal; the big data analysis terminal is used for performing real-time statistics on the fraud data information uploaded and reported from the mobile terminal and sending fraud early-warning information to the fraud interdiction governance terminal according to a bank card account or/and a phone number with the number of received reports exceeding a certain threshold value; and the fraud interdiction governance terminal is used for taking corresponding measures in time to interdict occurrence of a telecommunication fraud event when receiving the fraud early-warning information. The system can unite the mobile terminal, an operator, a public security institution, a bank and other institutions, quick and effective prevention can be realized, and the telecommunication fraud can be cracked down in time.
Owner:INST OF SOFTWARE APPL TECH GUANGZHOU & CHINESE ACAD OF SCI

Mapping Documents to Associated Outcome based on Sequential Evolution of Their Contents

A method and system is described for modeling the content evolution of an accessed document and predicting an associated outcome for said document. The system accesses a document but can further receive additional tags, metadata, or related information that characterizes the nature of such text collection. The invention applies various processing to separate the document into elements and performs semantic modeling to create a narrative model that describes the evolution of the contents of the elements in terms of their respective sequencing. This system then uses a set of training documents with target values assigned to them to predict an associated outcome for the accessed document. The most relevant subset of a training set can be selected by matching metadata information that characterize the accessed document and a collection of metadata that characterize other broad document sets. Such characterization is done using graph partitioning or other community detection methods from metadata information that characterize the document sets and relations between multiple sets of such documents. The outcome of the method may apply to prediction of economic value of a events described by the accessed document, success measures of the document quality, or discovery of related content with similar associated outcome to the accessed document.
Owner:DUBNOV SHLOMO +1

An optical character recognition error correction method based on natural language recognition

PendingCN109582972AOvercoming the problem of indeterminate resultsSemantic analysisCharacter and pattern recognitionSyntaxOptical character recognition
The invention discloses an optical character recognition error correction method based on natural language recognition, and the method comprises the steps: carrying out the fusion of a lexical analysis model and a semantic analysis model, obtaining a fusion model, and obtaining a high-precision optical character recognition result through employing the fusion model. According to the model, the characteristics of Chinese characters in a lexical model are considered, and meanwhile, significant characteristics such as context relations of Chinese syntactic semantics are considered to correct optical character recognition results, so that the model precision is improved.
Owner:SUNYARD SYST ENG CO LTD

Medical institution dialysis level assessment method, device and equipment and storage medium

ActiveCN109637642ADetermine authenticityThe true reflection of dialysis levelSemantic analysisHealthcare resources and facilitiesData contentIntensive care medicine
The invention discloses a medical institution dialysis level assessment method and device based on semantic recognition, equipment and a storage medium. The method comprises: a dialysis effect feedback instruction triggered by a dialysis patient is received, dialysis data, information of a medical institution and a dialysis sufficiency evaluation report are obtained from a dialysis data managementserver according to the dialysis effect feedback instruction, and the dialysis data and the dialysis sufficiency evaluation report are displayed on a user interface; receiving dialysis effect evaluation information made by the dialysis patient; extracting at least one keyword from the dialysis effect evaluation information; determining dialysis data contents corresponding to the keywords in the dialysis data; and sending each keyword, the dialysis data content corresponding to each keyword, the information of the medical institution and the dialysis sufficiency assessment report to a medicalinstitution assessment server. In this way, the patient can assist relevant departments in examining the dialysis level of the medical institution, and it is guaranteed that the examination result cantruly reflect the dialysis level of the medical institution.
Owner:深圳平安医疗健康科技服务有限公司

Paragraph merging method, device, storage medium and electronic equipment

ActiveCN110362832AImprove accuracyRealize the mergerSemantic analysisSpecial data processing applicationsHidden layerSemantic vector
The invention provides a paragraph merging method, a paragraph merging device, a storage medium and electronic equipment. The method comprises the steps: determining a position vector and a semantic vector of text data; sequentially selecting a plurality of target text data from the document content; determining a hidden layer vector of the target text data; judging whether the target text data and other target text data belong to the same paragraph or not according to the hidden layer vector of the target text data; then sequentially selecting the target text data again, and repeating the process until all text data in the document content is traversed; and counting all judgment results, and combining all text data belonging to the same paragraph into one paragraph according to a positionsequence. According to the paragraph merging method, the paragraph merging device, the storage medium and the electronic equipment provided by the embodiment of the invention, the judgment basis comprises the position vector and the semantic vector. The context semantic information in a larger range can be considered. The judgment result is more accurate, so that the paragraph merging accuracy can be optimized.
Owner:BEIJING SHANNON HUIYU TECH CO LTD

Managing and control system based on sensitive content perception

InactiveCN105868905ASolve the problems of low processing efficiency and low customer satisfactionGood qualitySemantic analysisResourcesService flowInformation analysis
The invention relates to a managing and control system based on sensitive content perception. The system includes a sensitive content processing system and a cooperation service processing system, wherein the sensitive content processing system further includes a sensitive information acquisition unit, a sensitive information analysis unit and a sensitive service early warning analysis unit. The sensitive content processing system conducts analysis and selection on service flow data and service work order content data and conducts early warning mark output on corresponding content work order. The cooperation service processing system starts a corresponding cooperation service flow and defines the started cooperation service flow is composed of which specific service steps. According to the invention, based on the sensitive content perception, the system establishes quality management and control based on sensitive content perception, addresses the problems of inefficient cooperation service processing and low client satisfaction, conducts qualitative and quantitative analysis on sensitive information, forms real-time online early warning of sensitive service and guarantees ability of processing sensitive service.
Owner:STATE GRID TIANJIN ELECTRIC POWER +1

CN-DBpedia-based entity identification and linking system and method

ActiveCN108491375AExtended vocabulary spaceReasonable calculationSemantic analysisSpecial data processing applicationsEntity linkingLearning based
The present invention discloses a CN-DBpedia-based entity identification and linking system and method. The system comprises an entity linking module and an entity identification module; the entity linking module comprises a synonym matching unit and an entity linking unit; and the entity identification module comprises a tokenizer, a word probability calculation unit, and an entity discriminatingunit. According to the technical scheme of the present invention, a semantic relationship between an entity and a word is constructed, so that the relationship with the entity can be mined in a few of context; a machine learning-based entity recognition algorithm is combined with an unsupervised word segmentation algorithm, the rationality of entity name division is considered from the perspective of globality, the vocabulary space of word segmentation is expanded, and the word formation probability of entity words can be calculated by using a more reasonable algorithm; and with a linking first and then identification manner, the semantic information of the text is fully utilized in the entity identification, and better word segmentation and entity identification are realized.
Owner:FUDAN UNIV

Public contribution combination request repeatability detecting method based on hybrid similarity

ActiveCN108182181AImprove review efficiencyAvoid Duplicate Review EffortsSemantic analysisSpecial data processing applicationsData setRepeatability
The invention belongs to the field of software coordinative development, and discloses a public contribution combination request repeatability detecting method based on hybrid similarity. The method includes the steps of calculating the text similarity between a newly-submitted public contribution combination request and a historical public contribution combination request, calculating the variation similarity between the newly-submitted public contribution combination request and the historical public contribution, searching a public coordinative development platform for a historical repeatedcontribution data set, combining the text similarity and the variation similarity by means of a weight calculating method based on a greedy search strategy under the training of the data set to calculate the hybrid similarity of public contributions, and finally obtaining a list of historical public contribution combination requests the most probably repeated with a given public contribution combination request according to the value of the hybrid similarity. The public contribution repeatability can be detected in time, repeated artificial code inspection work is avoided, and the public contribution inspection efficiency is improved.
Owner:NAT UNIV OF DEFENSE TECH

Empathy fostering based on behavioral pattern mismatch

ActiveUS20190102696A1Semantic analysisMachine learningBehavioral patternBioinformatics
A cognitive system collects online behaviors of a user and an affinity group of users who are related (e.g. by relationship, or behavioral similarities) to the user. A knowledge base of behavior and sentiment patterns is produced and maintained. If real-time data for the user shifts in behavior and / or sentiment and significantly deviates from established patterns, the system looks for a similar behavior and / or sentiment pattern shift among members of the affinity group. If the affinity group patterns shift in a manner similar to the first user's pattern shift, the cognitive system, in response, updates the knowledge base with information related to the shift, thereby adding knowledge to the long-term patterns. If the cognitive system finds that the user's behavior and / or sentiment pattern shift differs significantly from the affinity group, the system generates an empathy fostering alert message and sends it to one or more recipients.
Owner:IBM CORP

Topic classification method and device and computer equipment

PendingCN112036485AAvoid interferenceReduce the effects of noiseSemantic analysisCharacter and pattern recognitionInformation repositoryBag-of-words model
The invention relates to a big data technology, and discloses a topic classification method, which comprises the steps of obtaining a bag-of-words model corresponding to each article in an informationbase, the bag-of-words model being a subject term combination formed after stop words and part-of-speech screening, and the bag-of-words model comprising subject terms and occurrence frequencies corresponding to the subject terms; taking the bag-of-words models corresponding to the articles as topics of the articles in a one-to-one correspondence manner, and inputting the bag-of-words models intoan LDA topic model; judging whether the iterative training process of topic classification of the topics of the articles is converged or not by the LDA topic model according to a preset topic number;and if yes, obtaining classification information of topic classification corresponding to each output article when LDA topic model training converges. Bag-of-words models respectively corresponding to articles are respectively formed after stop words and part-of-speech screening and serve as feature input of the LDA topic model, interference of words without content value and appearing at high frequency is avoided, and noise influence in the topic classification process is eliminated.
Owner:PING AN TECH (SHENZHEN) CO LTD

Multi-language machine translation method and device, electronic equipment and storage medium

PendingCN113239710AImprove translationImprove generalization abilityNatural language translationSemantic analysisSemantic alignmentSemantic representation
The invention provides a multi-language machine translation method and device, electronic equipment and a storage medium. The method comprises the steps of: determining a to-be-translated source language text; and inputting the source language text into a multi-language translation model to obtain a target language text output by the multi-language translation model, wherein the multi-language translation model is constructed based on a pre-training encoder, and the pre-training encoder is obtained through training by taking a unified coding result obtained by coding a multi-language parallel sentence pair as a target. According to the method, the device, the electronic equipment and the storage medium provided by the invention, unified semantic representation of sentences in different languages can be learned through the pre-training encoder, and then the multi-language translation model is obtained based on the pre-training encoder, so that the machine translation model can learn the semantic alignment relationship more easily, and therefore, the multi-language machine translation effect can be improved, and the generalization performance of the multi-language machine translation model is improved.
Owner:合肥讯飞数码科技有限公司

Annual report risk mining system and method based on phrase vector construction

PendingCN114492392AAvoid subjectivityImprove accuracyFinanceSemantic analysisCosine similarityEngineering
The invention relates to an annual report risk mining system and method based on phrase vector construction. The system comprises an annual report risk information extraction module, a risk factor mining module, a risk phrase vector construction module and an automatic statistics and visualization module. According to the method, related annual reports can be automatically downloaded according to input listed company stock codes and year ranges, risk part texts in the annual reports are extracted, risk phrases are mined from the risk part texts, and a phrase knowledge base is constructed for further mining risk factors; the method comprises the following steps: performing model training on a phrase knowledge base by a Doc2Vector algorithm, and reasoning representative vectors of risk phrases and risk factors; according to the method, the cosine similarity between representative vectors is calculated, and the display information and co-occurrence information of risk factors in annual reports are automatically counted. Compared with the prior art, the method has the advantages that subjectivity of manual judgment is avoided, meanwhile, a large amount of manual operation is saved, and the method is suitable for carrying out risk mining on a large batch of annual reports.
Owner:SHANGHAI INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products