Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

7 results about "Speech identification" patented technology

Speech ID helps protect the authenticity and accuracy of speech by helping the hearing aid to prioritize conversation in difficult listening situations. This feature works to isolate the unique frequencies associated with various letters and words and helps maintain the crispness of conversation. Acuity Directionality.

Speech Recognition with Parallel Recognition Tasks

ActiveUS20100004930A1Improve accuracyImprove optimizationSpeech recognitionConfidence intervalSubject matter
The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS's). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS's that have not completed generating a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
Owner:GOOGLE LLC

Self-adaptive endpoint detection method and self-adaptive endpoint detection system for isolate word speech recognition

InactiveCN103366739AGood effectWith noise immunitySpeech recognitionSpeech identificationZero-crossing rate
The invention discloses a self-adaptive endpoint detection method and a self-adaptive endpoint detection system for isolate word speech recognition. The self-adaptive endpoint detection method for isolate word speech recognition comprises the following steps: a, a voice input step, wherein a voice signal containing an isolate word to be recognized is input; b, a voice preprocessing step, wherein the voice signal is subjected to amplitude translation and normalization and framing processing operation, and short time average energy and a short time average zero-crossing rate of each frame of voice are calculated; c, an isolate word endpoint rough detection step, wherein isolate word endpoints are roughly estimated through utilization of the short time average energy and the short time average zero-crossing rate of each frame of the voice signal and constraint on the shortest length of continuous voice frames before and after the end points, d, a detection threshold self-adaptive adjustment and accurate endpoint detection step, wherein through utilization of constraint on the smallest time duration and the largest time duration of the isolate word, the detection threshold is subjected to dynamic adjustment operation, the voice endpoints are subjected to front and back fine adjustment, and accurate isolate word endpoints are obtained; e, an isolate word endpoint output and isolate word voice recognition step, wherein the accurate isolate word endpoints are output and isolate word recognition is realized by using voice recognizing technologies.
Owner:ZHENGZHOU SCI TECH INFORMATION RESINST

Human-computer interaction method and system based on multi-modal historical response result

ActiveCN106205611ASpeech recognitionStatistical modelSpeech identification
The invention provides a human-computer interaction method and system based on a multi-modal historical response result. The human-computer interaction method based on the multi-modal historical response result includes the steps of receiving a voice instruction of a user; performing voice recognition on the voice instruction to obtain a plurality of response results; acquiring a plurality of pieces of input feature information, and calculating the joint probability of each various response result according to a probability model and the plurality of pieces of input feature information; and determining the response result having the largest joint probability to respond to the voice instruction, wherein the probability model is a stochastic mathematical model established on the basis of the historical response results, and wherein each joint probability is the product of the probabilities of the response result under the plurality of pieces of input feature information. The embodiment of the invention enriches the dimension of the feature information inputted in determining the response result, comprehensively considers the influence of various factors on the true intention of the user, improves the accuracy of the response result, and improves the user experience in the human-computer interaction process.
Owner:北京如布科技有限公司

Voice language recognition method and system based on confidence degree

ActiveCN108172212AImprove accuracyImprove Speech Recognition EfficiencySpeech recognitionMulti languageSpeech identification
The invention provides a voice language recognition method and system based on the confidence degree, aiming to solve the problem that in the existing voice recognition, the language recognition efficiency is low. The method comprises the following steps: S1, extracting a voice segment from each voice segment as a preset voice segment, comparing with a preset language database, acquiring the language information matched with the preset voice segment; S2, acquiring the language confidence degree and the confidence degree average value of each voice segment according to the language information,judging whether the confidence degree mean value is larger than a preset credibility threshold value or not, if yes, the current language is used as the default language of the voice information; S3,if not, screening all the voice segments through a preset screening condition, until the mean value of the language confidence degree is larger than the preset threshold value, and acquiring the voice fragment obtained by the screening, and turning to the step S1. By adopting the voice language recognition method and device, the voice recognition efficiency is improved, and meanwhile, the recognition accuracy of the multi-language voice information is further improved.
Owner:HENGQIN INT INTPROP EXCHANGE CENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products