Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

120results about "Speech recognition" patented technology

Speech Recognition with Parallel Recognition Tasks

ActiveUS20100004930A1Improve accuracyImprove optimizationSpeech recognitionConfidence intervalSubject matter
The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS's). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS's that have not completed generating a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
Owner:GOOGLE LLC

Method and apparatus for processing an input speech signal during presentation of an output audio signal

InactiveUS6937977B2Accurately establishedTwo-way loud-speaking telephone systemsAutomatic call-answering/message-recording/conversation-recordingStart timeCommunications system
A start of an input speech signal is detected during presentation of an output audio signal and an input start time, relative to the output audio signal, is determined. The input start time is then provided for use in responding to the input speech signal. In another embodiment, the output audio signal has a corresponding identification. When the input speech signal is detected during presentation of the output audio signal, the identification of the output audio signal is provided for use in responding to the input speech signal. Information signals comprising data and / or control signals are provided in response to at least the contextual information provided, i.e., the input start time and / or the identification of the output audio signal. In this manner, the present invention accurately establishes a context of an input speech signal relative to an output audio signal regardless of the delay characteristics of the underlying communication system.
Owner:AUVO TECH +1

Speech interactive training system and speech interactive training method

ActiveCN102063903AImprove training effectImprove the level ofSpeech recognitionEvaluation resultSpeech training
The invention relates to a speech interactive training system and a speech interactive training method. The system comprises a user selection module, a speech interactive training module, a user feedback module, a speech evaluation module and a result feedback module, wherein the user selection module is used for acquiring training contents selected by a user; the speech interactive training module is used for displaying the training contents to the user in a multimode guiding mode to guide the user to perform a speech training; the user feedback module is used for collecting a fed-back speech and a lip video corresponding to the speech; the speech evaluation module is used for receiving the speech fed back by the user and the lip video corresponding to the speech, and automatically evaluating the speech training of the user and giving an evaluation result; and the result feedback module is used for feeding the evaluation result back to the user so that the user can correct and adjust the speech training. The speech interactive training system is used for automatically evaluating the speech training of the user, giving the evaluation result and feeding the evaluation result back to the user, and then the user finds out the level of the personal speech training according to the evaluation result and corrects and adjusts the personal speech training to further improve the speech level, so the rehabitation training effect of a speech impediment is greatly enhanced.
Owner:SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI

Client-server architecture for automatic speech recognition applications

ActiveUS20150120290A1Speech recognitionEngineeringMessage oriented middleware
A client-server architecture for Automatic Speech Recognition (ASR) applications, includes: (a) a client-side including: a client being part of distributed front end for converting acoustic waves to feature vectors; VAD for separating between speech and non-speech acoustic signals; adaptor for WebSockets; and (b) a server side including: a web layer utilizing HTTP protocols and including a Web Server having a Servlet Container; an intermediate layer for transport based on Message-Oriented Middleware being a message broker; a recognition server and an adaptation server both connected to said intermediate layer; a Speech processing server; a Recognition Server for instantiation of a recognition channel per client; an Adaptation Server for adaptation acoustic and linguistic models for each speaker; a Bidirectional communication channel between a Speech processing server and client side; and a Persistent layer for storing a Language Knowledge Base connected to said Speech processing server.
Owner:DIXILANG

Intelligent voice interaction realization method and device, computer equipment and storage medium

InactiveCN108597509AVoice interaction sensibilityVoice interaction anthropomorphismSpeech recognitionResponse strategySpeech sound
The invention discloses an intelligent voice interaction realization method and device, computer equipment and a storage medium. The intelligent voice interaction realization method comprises the following steps that a user query from an intelligent voice device is acquired, wherein the query is input in the process of voice interaction between the user and the intelligent voice device; a conversation scene corresponding to the query is determined; and a response voice is generated and returned to the intelligent voice device for playing according to a scene conversation response strategy corresponding to the conversation scene. According to the scheme of the intelligent voice interaction realization method, the conversation scenes are distinguished, and different scene dialogue response strategies are correspondingly used according to the different conversation scenes so as to express appropriate voice personality to enable voice interaction to be more perceptual, anthropomorphic andintelligent, and interactive experience more in line with human conversation habits is brought for users.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Methods and apparatus for improving voice quality in an environment with noise

ActiveUS7260209B2Two-way loud-speaking telephone systemsGain controlEnvironmental noiseNoise level
A method for improving a downlink signal received by a listener on a phone is disclosed. The method includes calculating an environment noise level of the listener and filtering and adjusting gain of the downlink signal based on the environment noise level.
Owner:TELLABS OPERATIONS

Self-adaptive endpoint detection method and self-adaptive endpoint detection system for isolate word speech recognition

InactiveCN103366739AGood effectWith noise immunitySpeech recognitionSpeech identificationZero-crossing rate
The invention discloses a self-adaptive endpoint detection method and a self-adaptive endpoint detection system for isolate word speech recognition. The self-adaptive endpoint detection method for isolate word speech recognition comprises the following steps: a, a voice input step, wherein a voice signal containing an isolate word to be recognized is input; b, a voice preprocessing step, wherein the voice signal is subjected to amplitude translation and normalization and framing processing operation, and short time average energy and a short time average zero-crossing rate of each frame of voice are calculated; c, an isolate word endpoint rough detection step, wherein isolate word endpoints are roughly estimated through utilization of the short time average energy and the short time average zero-crossing rate of each frame of the voice signal and constraint on the shortest length of continuous voice frames before and after the end points, d, a detection threshold self-adaptive adjustment and accurate endpoint detection step, wherein through utilization of constraint on the smallest time duration and the largest time duration of the isolate word, the detection threshold is subjected to dynamic adjustment operation, the voice endpoints are subjected to front and back fine adjustment, and accurate isolate word endpoints are obtained; e, an isolate word endpoint output and isolate word voice recognition step, wherein the accurate isolate word endpoints are output and isolate word recognition is realized by using voice recognizing technologies.
Owner:ZHENGZHOU SCI TECH INFORMATION RESINST

Information processing apparatus and information processing method

ActiveUS20080140423A1Speech recognitionInformation processingExecution unit
An information processing apparatus performs a process in accordance with a command. The information processing apparatus includes a first selection unit configured to refer to a storage unit that stores a plurality of recognition commands for inputting the command by speech, recognize input speech and select a command based on the recognized input speech, and a second selection unit configured to sequentially select a plurality of commands that correspond to a plurality of recognition commands stored in the storage unit. The information processing apparatus further includes a process determination unit configured to select either the first selection unit or the second selection unit based on an operation performed on a predetermined operation unit, and an execution unit configured to execute a command which is selected by one of the selection units that is selected by the process determination unit.
Owner:CANON KK

Voice taxi calling method, voice taxi calling device and voice taxi calling system

The invention belongs to the technical field of mobile terminals, and discloses a voice taxi calling method comprising the steps that voice information of a user is detected in real time; when the mobile terminal responds to preset awakening information included in the voice information of the user under the standby state, a taxi calling software client side is awakened; and when the taxi calling software client side responds to destination information included in the voice information of the user, current position information of the mobile terminal is acquired, and the current position information and the destination information are transmitted to a taxi calling software server so that the taxi calling software server is enabled to start the taxi calling flow. The voice information of the user is identified, the awakening information and the destination information are acquired from the voice information of the user, the taxi calling software client side is awakened according to the awakening information and the current position information of the mobile terminal is acquired, and the destination information and the current position information are transmitted to the taxi calling software server so as to start the taxi calling flow. The taxi calling service can be realized by inputting the destination information for one time through the voice information.
Owner:LETV HLDG BEIJING CO LTD +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products