Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

66results about "Speech analysis" patented technology

Classroom behavior monitoring system and method based on face and voice recognition

ActiveCN106851216AImprove developmentImprove learning effectSpeech analysisClosed circuit television systemsFacial expressionSpeech sound
The invention discloses a classroom behavior monitoring system and method based on face and voice recognition. The method comprises the following steps: a camera acquires the video information of students and teachers in classrooms; voice recording equipment acquires the voice information of the students and teachers in the classrooms; a main control processor preprocesses the received video information of the students and teachers and extracts the facial expression features and behavior features of the students and teachers; the main control processor processes the received voice information of the students and extracts the voice features of the students; and the main control processor processes the received voice information of the teachers, extracts the voice features of the teachers, calculates the scores of the teaching effect of the teachers, evaluates the teaching effect of the teachers according to the scores, and provides guidance suggestions. According to the classroom behavior monitoring system and method disclosed by the invention, the classroom behaviors of the teachers and students in the classrooms are observed, and thus the accuracy and objectivity of the evaluation can be increased, the teaching methods can be improved, and the teaching quality can be increased.
Owner:SHANDONG NORMAL UNIV

Payment authentication method, device thereof and system thereof

InactiveCN103679452ASpeech analysisIndividual entry/exit registersText messagingOperational costs
The invention discloses a payment authentication method, a device thereof and a system thereof, belonging to the computer technical field. The method comprises the following steps of receiving a payment authentication request sent by a terminal, detecting whether the identification information in the payment authentication request is same with the prestored identification information, extracting a current voice characteristic if the identification information in the payment authentication request is same with the prestored identification information, matching the current voice characteristic with a prestored voiceprint model, and sending authentication reply information for allowing a payment operation to the terminal if the current voice characteristic is successfully matched with the prestored voiceprint model. According to the payment authentication method, the device thereof and the system thereof, a current voice signal is confirmed through the voiceprint model, after the confirmation is successful, the subsequent payment operation is allowed, a problem that a server needs to send an authentication message and the operation cost is increased in the payment operation process in the prior art is solved, an effect that the payment security can be greatly raised by only using the voiceprint identification of the voice signal is achieved, and the operation cost brought by message authentication is greatly reduced.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Sound source recording apparatus and method adaptable to operating environment

ActiveUS20110103617A1Increase sound source recognition capabilityGain controlElectronic editing digitised analogue information signalsSound sourcesEngineering
Disclosed herein is a sound source recording apparatus and method adaptable to an operating environment, which can record a target sound source at a predetermined level without being affected by characteristics of the sound source or ambient noise. A target sound source is separated from a sound source signal received through an array of microphones and a recording sound pressure level and a gain are estimated using a reference sound pressure level and a reference distance for the target sound source, thereby controlling or adjusting the gain of the microphones.
Owner:SAMSUNG ELECTRONICS CO LTD

Method, device and equipment for recording synchronization

The invention discloses a method for recording synchronization, a device for the recording synchronization and equipment for the recording synchronization and belongs to the field of terminal equipment. The method comprises the steps that an audio data stream is obtained in the process of recording; the obtained audio data stream is coded according to a preset coded format and data generated by coding are written in an audio file; the data are read from the audio file which involves in the process of writing; the data which are read each time are uploaded to a server, so that the synchronization is conducted by the server according to the received data. The device comprises an audio data stream obtaining module, a writing module, a data reading module and an uploading module. Real-time recording synchronization is achieved, the effects that storage can be conducted at the moment of recording and uploading can be conducted at the moment of the storage are achieved, recording content can be recovered to the maximum extent according to the conducted synchronization on the server when the recording is not finished and interrupted by accident or deleted by accident, safety of the audio file is protected, the stability of the audio file is improved and the purpose of recovering the data is achieved.
Owner:XIAOMI INC

Video data fraud detection method and device, computer equipment and storage medium

PendingCN110781916AImprove accuracyIncrease diversitySpeech analysisAcquiring/recognising facial featuresFeature vectorData set
The invention relates to a fraud detection method and device for video data, computer equipment and a storage medium. The method comprises the following steps: acquiring to-be-detected video data; extracting image data of each video frame from the to-be-detected video data, and dividing the image data into a plurality of image data sets according to the time sequence of each video frame, the imagedata sets which comprises image data corresponding to continuous video frames; inputting each image data set into a pre-trained image feature extraction model to obtain an image feature vector; extracting voice data from the to-be-detected video data, and obtaining a voice feature vector of the voice data; performing cascade splicing on the image feature vector and the voice feature vector to obtain a multi-modal feature vector; and inputting the multi-modal feature vector into a pre-trained fraud detection model to obtain a fraud detection result corresponding to the to-be-detected video data output by the fraud detection model. By adopting the method, the characteristic information amount can be increased, the comprehensiveness and diversity of the characteristic information are improved, and the accuracy of video data fraud detection is effectively improved.
Owner:PING AN TECH (SHENZHEN) CO LTD

Method and device for reducing echoes, and communication equipment

ActiveCN105810202ATwo-way loud-speaking telephone systemsSpeech analysisAdaptive filterVIT signals
The invention discloses a method and device for reducing echoes, and communication equipment. The communication equipment is provided with at least two microphones, wherein the distance between the first microphone and a loudspeaker is greater than the distance between the second microphone and the loudspeaker. The method comprises the steps: carrying out the filtering of a first echo signal s1(t) and/or a second echo signal s2(t); obtaining a target signal d(t) corresponding to the first echo signal s1(t), and a reference signal r(t) corresponding to the second echo signal s2(t), wherein the first echo signal s1(t) is an echo signal generated when the loudspeaker plays a downlink signal x(t) picked by the first microphone, and the second echo signal s2(t) is an echo signal generated when the loudspeaker plays the downlink signal x(t) picked by the second microphone; carrying out the filtering of the reference signal r(t) through employing an adaptive filter, and obtaining a filtering signal y(t); solving the difference between the filtering signal y(t) and the target signal d(t), and obtaining and outputting a residual signal e(t). The method makes the most of the signal picked by the microphone closer to the loudspeaker. Compared with the prior art, the method is better in echo inhibition effect.
Owner:芯鑫融资租赁(天津)有限责任公司

Method and device for calling

InactiveCN105448300AAvoid additional operations such as deselectionSave computing resourcesSpeech analysisPersonalizationVoice transformation
The invention relates to a method and a device for calling. The method comprises the steps of acquiring a first voice signal of the local party when talking to the other party, transforming the first voice signal by use of a preset voice model to get a second voice signal, and transmitting the second voice signal to the other party. As the voice signal transmitted in the call process is the voice signal after voice transformation, the other party gets the voice after voice transformation. Therefore, the call effect desired by users is achieved, the personalized need of the caller for call voice is satisfied, and the user experience is enhanced.
Owner:XIAOMI INC

Echo cancellation method based on convex combination for M-estimaion proportional affine projection

InactiveCN109102794AFast convergenceSmall steady state errorSpeech analysisSound producing devicesSignal cancellationAffine projection
The invention relates to an echo cancellation method based on convex combination for M-estimation proportional affine projection. The method comprises steps of A of far-end signal sampling; B of convex combination, a large-step filter value y1(n) and a small-step filter value y2(n) at the current time n are subjected to convex combination through a large-step filter weight lambda(n) of the currenttime n to obtain a combination filter value y(n) of the current time n; C of echo signal elimination, an echoed near-end signal d(n) picked up by a near-end microphone is subtracted with an output value y(n) of an adaptive filter and then returned to a far end, a return signal is a residual signal e(n), and e(n)=d(n)-y(n); D of updating a filter tap weight vector, the method of M-estimation proportional affine projection based on convex combination is utilized to calculate update of an adaptive filter tap weight vector; E of updating a filter weight; F of limiting the filter weight; G, let n=n+1, and the steps A, B, C, D, E, F, G are repeated till the end of iteration.
Owner:SOUTHWEST JIAOTONG UNIV

Method and system for multimedia data recognition, and method for multimedia customization which uses the method for multimedia data recognition

System and method for multimedia data recognition and method for multimedia customization which uses the method for multimedia data recognition are disclosed. Wherein the system includes a data capturing unit, a data recognition unit, and a waveform feature database. In which, the data capturing unit is for capturing a set of multimedia data to be recognized. The data recognition unit has a sound waveform conversion unit, a waveform feature capturing unit, and a waveform feature comparison unit, which are respectively used for converting sound data into waveform data, capturing waveform feature from waveform data, and comparing the captured waveform feature with at least a known waveform feature. By analyzing the sound data of the multimedia data, the multimedia data can be recognized.
Owner:IPEER MULTIMEDIA INT

Objective examination method of flat-tongue sound and cacuminal in standard Chinese

InactiveCN101546553AGood distinctionSpeech analysisSpeech soundExamination method
The invention discloses an objective examination method of flat-tongue sound and cacuminal in standard Chinese, including the steps: receiving input voice; syncopating the input voice; distilling distinguishing characteristics; giving a mark according to an evaluating model and obtaining a pronunciation score. By applying the objective examination method and adopting the distinguishing characteristics which can better reflect the pronunciation essential to distinguish the flat-tongue sound and the cacuminal, the better distinguishing performance can be obtained.
Owner:INST OF ACOUSTICS CHINESE ACAD OF SCI +1

Equalization of an Audio Signal

ActiveUS20130195286A1MicrophonesLoudspeakersPhase shiftedEqualization
A method of processing an input audio signal, the method comprising forming a plurality of output audio signals from the input audio signal, wherein each output audio signal is formed by performing respective processing on the input audio signal, wherein for a first output audio signal there is a target audio equalization operation comprising a target filter twice, wherein for the first output 10 audio signal, the respective processing comprises a first audio equalization operation, the first audio equalization operation being the target audio equalization operation modified to compensate for phase shifts that correspond to zeros of the transfer function of the target audio equalization operation, wherein for each output audio signal other than the first output audio signal, the respective 15 processing comprises a compensation filter that compensates for phase shifts that correspond to poles of the transfer function of the target audio equalization operation. 20
Owner:OXFORD DIGITAL

Abnormal sound detection method based on edge cloud intelligent architecture

InactiveCN110544489AReasonable configurationRelieve pressureSpeech analysisTransmissionPattern recognitionSound detection
An abnormal sound detection method based on an edge cloud intelligent architecture is provided. The method comprises the following steps: collecting audio data on an edge end; deploying tasks that canbe processed by the edge end to an edge device as much as possible; using the Docker container technology to perform encapsulation on task processing operators on a cloud end to realize migration ofcomputing tasks, and storing an audio detection result; using a deep neural network model to perform abnormal sound determination; and performing message communication between different devices through the MTZT protocol. According to the method provided by the present invention, the pressures on the cloud computing center and the network bandwidth are alleviated, the system real-time performance and responsiveness are improved, and data security can be better protected.
Owner:JIANGSU HUIZHONG DATA TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products