Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

149 results about "Speech sound" patented technology

Speech sound. noun. 1 : any one of the smallest recurrent recognizably same constituents of spoken language produced by movement or movement and configuration of a varying number of the organs of speech in an act of ear-directed communication.

Speech Recognition with Parallel Recognition Tasks

ActiveUS20100004930A1Improve accuracyImprove optimizationSpeech recognitionConfidence intervalSubject matter
The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS's). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS's that have not completed generating a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
Owner:GOOGLE LLC

Method and apparatus for processing an input speech signal during presentation of an output audio signal

InactiveUS6937977B2Accurately establishedTwo-way loud-speaking telephone systemsAutomatic call-answering/message-recording/conversation-recordingStart timeCommunications system
A start of an input speech signal is detected during presentation of an output audio signal and an input start time, relative to the output audio signal, is determined. The input start time is then provided for use in responding to the input speech signal. In another embodiment, the output audio signal has a corresponding identification. When the input speech signal is detected during presentation of the output audio signal, the identification of the output audio signal is provided for use in responding to the input speech signal. Information signals comprising data and / or control signals are provided in response to at least the contextual information provided, i.e., the input start time and / or the identification of the output audio signal. In this manner, the present invention accurately establishes a context of an input speech signal relative to an output audio signal regardless of the delay characteristics of the underlying communication system.
Owner:AUVO TECH +1

Classroom behavior monitoring system and method based on face and voice recognition

ActiveCN106851216AImprove developmentImprove learning effectSpeech analysisClosed circuit television systemsFacial expressionSpeech sound
The invention discloses a classroom behavior monitoring system and method based on face and voice recognition. The method comprises the following steps: a camera acquires the video information of students and teachers in classrooms; voice recording equipment acquires the voice information of the students and teachers in the classrooms; a main control processor preprocesses the received video information of the students and teachers and extracts the facial expression features and behavior features of the students and teachers; the main control processor processes the received voice information of the students and extracts the voice features of the students; and the main control processor processes the received voice information of the teachers, extracts the voice features of the teachers, calculates the scores of the teaching effect of the teachers, evaluates the teaching effect of the teachers according to the scores, and provides guidance suggestions. According to the classroom behavior monitoring system and method disclosed by the invention, the classroom behaviors of the teachers and students in the classrooms are observed, and thus the accuracy and objectivity of the evaluation can be increased, the teaching methods can be improved, and the teaching quality can be increased.
Owner:SHANDONG NORMAL UNIV

Systems for monitoring proximity to prevent loss or to assist recovery

A portable proximity alarm apparatus comprising a Bluetooth system and an alarm monitors the presence of a portable electronic device equipped with a compatible transceiver within range and alarms when that device leaves its range. On detecting disconnection, the proximity alarm automatically tries to reconnect. A portable proximity alarm apparatus with an optional voice mode allows to additionally use the unit as a headset when an earpiece is folded. A portable proximity alarm apparatus with relay functionality allows using a Bluetooth headset and proximity alarm functions unobtrusively on most mobile phones.
Owner:OPTIMA DIRECT LLC

Speech interactive training system and speech interactive training method

ActiveCN102063903AImprove training effectImprove the level ofSpeech recognitionEvaluation resultSpeech training
The invention relates to a speech interactive training system and a speech interactive training method. The system comprises a user selection module, a speech interactive training module, a user feedback module, a speech evaluation module and a result feedback module, wherein the user selection module is used for acquiring training contents selected by a user; the speech interactive training module is used for displaying the training contents to the user in a multimode guiding mode to guide the user to perform a speech training; the user feedback module is used for collecting a fed-back speech and a lip video corresponding to the speech; the speech evaluation module is used for receiving the speech fed back by the user and the lip video corresponding to the speech, and automatically evaluating the speech training of the user and giving an evaluation result; and the result feedback module is used for feeding the evaluation result back to the user so that the user can correct and adjust the speech training. The speech interactive training system is used for automatically evaluating the speech training of the user, giving the evaluation result and feeding the evaluation result back to the user, and then the user finds out the level of the personal speech training according to the evaluation result and corrects and adjusts the personal speech training to further improve the speech level, so the rehabitation training effect of a speech impediment is greatly enhanced.
Owner:SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI

Client-server architecture for automatic speech recognition applications

ActiveUS20150120290A1Speech recognitionEngineeringMessage oriented middleware
A client-server architecture for Automatic Speech Recognition (ASR) applications, includes: (a) a client-side including: a client being part of distributed front end for converting acoustic waves to feature vectors; VAD for separating between speech and non-speech acoustic signals; adaptor for WebSockets; and (b) a server side including: a web layer utilizing HTTP protocols and including a Web Server having a Servlet Container; an intermediate layer for transport based on Message-Oriented Middleware being a message broker; a recognition server and an adaptation server both connected to said intermediate layer; a Speech processing server; a Recognition Server for instantiation of a recognition channel per client; an Adaptation Server for adaptation acoustic and linguistic models for each speaker; a Bidirectional communication channel between a Speech processing server and client side; and a Persistent layer for storing a Language Knowledge Base connected to said Speech processing server.
Owner:DIXILANG

Intelligent voice interaction realization method and device, computer equipment and storage medium

InactiveCN108597509AVoice interaction sensibilityVoice interaction anthropomorphismSpeech recognitionResponse strategySpeech sound
The invention discloses an intelligent voice interaction realization method and device, computer equipment and a storage medium. The intelligent voice interaction realization method comprises the following steps that a user query from an intelligent voice device is acquired, wherein the query is input in the process of voice interaction between the user and the intelligent voice device; a conversation scene corresponding to the query is determined; and a response voice is generated and returned to the intelligent voice device for playing according to a scene conversation response strategy corresponding to the conversation scene. According to the scheme of the intelligent voice interaction realization method, the conversation scenes are distinguished, and different scene dialogue response strategies are correspondingly used according to the different conversation scenes so as to express appropriate voice personality to enable voice interaction to be more perceptual, anthropomorphic andintelligent, and interactive experience more in line with human conversation habits is brought for users.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Airport passenger registered luggage real-time tracking system and usage method

InactiveCN106327121AFlexible timeFlexible scheduleCo-operative working arrangementsNavigation instrumentsInformatizationMonitoring system
The invention provides an airport passenger registered luggage real-time tracking system. The airport passenger registered luggage real time tracking system is characterized by comprising a radio frequency label adhered to the passenger registered luggage, an identification and reading device, a background monitoring system, a database and a personal mobile terminal APP; the background monitoring system can realize functions of query, display, statistics and abnormal alarm on the registered luggage of the passenger; the database is used for storing information data of passengers and registered luggage thereof; the personal mobile terminal APP can realize real-time query and can display positions and state information of the registered luggage of the passengers, and can perform luggage abnormal state broadcasting through text information and a voice prompt; and the personal mobile terminal is in interconnection and intercommunication with an airport passenger registered luggage real time tracking system through an airport mobile communication network. The airport passenger registered luggage real time tracking system and the usage method use the informatization mean to manage and monitor the safety check and the transport state of the registered luggage, and can make the passenger quickly master the real-time position and state information of the registered luggage.
Owner:贾鹏

Eye guard with voice indication

InactiveUS20080082179A1Less interferenceIncrease power consumptionComputer controlElectric controllersElectricityDisplay device
An eye guard with voice indication is provided, and it includes an eye guard body, a filter, a display device and a voice output device. The display device is mounted inside the eye guard body to display a circumstance value or operating condition information. Furthermore, the voice output device is electrically connected with the display device and includes a main controller, a power supply, a sound output controller, a sensor circuit and a drive circuit. The sensor circuit has at least one sensor exposed on an external surface of the eye guard body and defined in a supposed centerline of the filter to detect the surrounding circumstance. Thus, the circumstance value and the operation condition information are noticed in the form of voice and vision simultaneously for the operators.
Owner:YANG YEA CHYI

Self-adaptive endpoint detection method and self-adaptive endpoint detection system for isolate word speech recognition

InactiveCN103366739AGood effectWith noise immunitySpeech recognitionSpeech identificationZero-crossing rate
The invention discloses a self-adaptive endpoint detection method and a self-adaptive endpoint detection system for isolate word speech recognition. The self-adaptive endpoint detection method for isolate word speech recognition comprises the following steps: a, a voice input step, wherein a voice signal containing an isolate word to be recognized is input; b, a voice preprocessing step, wherein the voice signal is subjected to amplitude translation and normalization and framing processing operation, and short time average energy and a short time average zero-crossing rate of each frame of voice are calculated; c, an isolate word endpoint rough detection step, wherein isolate word endpoints are roughly estimated through utilization of the short time average energy and the short time average zero-crossing rate of each frame of the voice signal and constraint on the shortest length of continuous voice frames before and after the end points, d, a detection threshold self-adaptive adjustment and accurate endpoint detection step, wherein through utilization of constraint on the smallest time duration and the largest time duration of the isolate word, the detection threshold is subjected to dynamic adjustment operation, the voice endpoints are subjected to front and back fine adjustment, and accurate isolate word endpoints are obtained; e, an isolate word endpoint output and isolate word voice recognition step, wherein the accurate isolate word endpoints are output and isolate word recognition is realized by using voice recognizing technologies.
Owner:ZHENGZHOU SCI TECH INFORMATION RESINST

Devices, Systems and Methods for Proactive Call Context, Call Screening and Prioritization

InactiveUS20100151839A1Automatic call-answering/message-recording/conversation-recordingSpecial service for subscribersSpeech soundOn demand
Devices, systems and methods are disclosed which enable a user of a communications device to receive a notification of a priority and a context of a call before answering the call. A caller records a short phrase to be played on the callee's device either in place of ringing or following a predetermined amount of rings controlled by the callee's preferences. Among the callee's preferences is a selection of priority, such as among low, high, and critical. When callee's communications device is in any of these modes, every incoming call is intercepted giving the caller the opportunity to provide a short phrase for the context of the call and choose a priority level. This short phrase is played on the callee's communications device if the specified priority of the call matches or exceeds the current mode set on the callee's communications device, and the callee may answer. If the specified priority of the call falls below the current mode set, then the short phrase and priority are saved, but the call is ignored. Optional device configuration allows a special tone, vibration pattern, or beep to occur on the callee's communications device indicating a call has been ignored. The recorded phrase can then be reviewed on demand. The above solution can also be extended to provide a quick glimpse into the voicemail, either in place of the current tone indicating a voicemail has arrived, or immediately following.
Owner:AT&T INTPROP I L P
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products