Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

25 results about "Speech recognition" patented technology

Speech recognition is a interdisciplinary subfield of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields.

Method and device for playing audio file

InactiveCN102881305AAchieving processing powerAchieve playbackDigital recording/reproducing24-bitQuantitative accuracy
The invention relates to the technical field of communication electron, in particular to a method and a device for playing an audio file. The method comprises the following steps: decoding the audio file to be played to acquire a target audio file; judging whether the quantitative accuracy value of the target audio file is more than the preset value or not; when determining that the quantitative accuracy value of the target audio file is more than the preset value, establishing an audio track for the target audio file; and adding the audio track to a corresponding thread and playing the audio file to be played by using the thread. Through the method, loss of tone quality when the audio file with the quantitative accuracy value of 24 bit or 32 bit is played in the prior art is avoided.
Owner:INGENIC SEMICON CO LTD

Equalization of an Audio Signal

ActiveUS20130195286A1MicrophonesLoudspeakersPhase shiftedEqualization
A method of processing an input audio signal, the method comprising forming a plurality of output audio signals from the input audio signal, wherein each output audio signal is formed by performing respective processing on the input audio signal, wherein for a first output audio signal there is a target audio equalization operation comprising a target filter twice, wherein for the first output 10 audio signal, the respective processing comprises a first audio equalization operation, the first audio equalization operation being the target audio equalization operation modified to compensate for phase shifts that correspond to zeros of the transfer function of the target audio equalization operation, wherein for each output audio signal other than the first output audio signal, the respective 15 processing comprises a compensation filter that compensates for phase shifts that correspond to poles of the transfer function of the target audio equalization operation. 20
Owner:OXFORD DIGITAL

Method and system for off-line publishing of internet voice frequency content with literal label

InactiveCN102143180AInterconnection arrangementsTransmissionVoice frequencyInternet servers
The invention relates to a method and a system for off-line publishing of internet voice frequency content with a literal label. By utilizing the method, the voice frequency content transcribed by the voice frequency module and the word content received by a short message module are combined by adopting a combination manner of transcribing the voice frequency content through dialing and adding the word label through sending short messages, and the word content utilized as the word label is published on an internet server. The system provided by the invention comprises a cellphone, a voice frequency transcription module, a short message module, a combination publishing module and the internet server, wherein, the voice frequency transcription module and the short message module are respectively connected with the cellphone; the combination publishing module is respectively connected with the voice frequency transcription module and the short message module; and the internet server is connected with the combination publishing module. By utilizing the method and system provided by the invention, users are enabled to publish the word information of the voice frequency content in an off-line mode by utilizing the off-line advantages of the communication network, and meanwhile, the word information can be previewed or searched.
Owner:北京蓝珀通信技术有限公司

Audio adjustment based on dynamic and static rules

ActiveUS10171054B1Manually-operated gain controlSpeech analysisSpeech recognitionAudio frequency
An approach is provided that compares inputs received at a system to a set of rules. The rules include both static rules as well as dynamic rules. The approach retrieves audio adjustments based on the comparison of inputs to the rules. The approach then automatically adjusts an output of an audio system based on the retrieved audio adjustment.
Owner:IBM CORP

Method and device for recognizing speaker in video in real time

ActiveCN114819110AWill not cause lossReal-time processingCharacter and pattern recognitionNeural architecturesPattern recognitionInformation repository
The invention discloses a method and a device for recognizing a speaker in a video in real time. The method comprises the following steps: acquiring an image sequence and an audio sequence which start at the same moment and are continuous; detecting and tracking a face according to the latest frame of image in the image sequence, and updating an existing face sequence information base; inputting the face sequence information in the face sequence information base and the audio sequence into a trained speaker detection network, detecting a speaking state, and updating a speaking state database; and according to the speaking state database, obtaining the current state of all people so as to identify possible speakers in the video.
Owner:ZHEJIANG LAB

Word entry retrieval and wrong word detection methods and systems for pinyin input method

ActiveCN105653061AImprove error correction performanceHigh degree of intelligenceInput/output processes for data processingSequence processingUser input
The invention discloses word entry retrieval and wrong word detection methods and systems for a pinyin input method. The retrieval method comprises the steps of detecting whether a reference word entry exists before or after a currently input pinyin string; if the front or back reference word entry exists, performing matching in a word bank according to the front or back reference word entry and the current input pinyin string to obtain at least one back or front word entry; and if no reference word entry exists before or after the pinyin string, performing matching in the word bank according to the current pinyin string to obtain corresponding word entries or a word entry list. The wrong word detection method comprises the steps of obtaining at least one front or back word entry according to the reference word entry of the pinyin string currently input by a user; constructing a word graph; performing forward and reverse comparison, wrong key processing or wrong sequence processing; and judging whether the processed pinyin string meets a pinyin rule or not, and if yes, returning the word entry corresponding to the word graph to the user. According to the methods and systems, the word entry recommendation accuracy and the user input correction capability in the input method are improved.
Owner:BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1

Audio noise reduction method and device, electronic equipment and storage medium

PendingCN114708877AImprove noise reductionSpeech analysisNoiseFeature extraction
The embodiment of the invention discloses an audio noise reduction method and device, electronic equipment and a storage medium. The method comprises the following steps: acquiring an audio to be denoised; obtaining voiceprint features of a user corresponding to the to-be-denoised audio, wherein the voiceprint features are obtained after feature extraction is performed on registered audio of the user; and inputting the to-be-denoised audio and the corresponding voiceprint features into a denoising model to obtain a denoised audio corresponding to the user, the denoising model being used for obtaining the denoised audio corresponding to the user from the to-be-denoised audio according to the voiceprint features. And the noise reduction model finds the audio corresponding to the voiceprint feature from the audio to be subjected to noise reduction in combination with the voiceprint feature of the user, and outputs the audio corresponding to the voiceprint feature, thereby completing noise reduction of the audio to be subjected to noise reduction. The noise-reduced audio of the user corresponding to the voiceprint feature and not containing interference audio of other users and background noise is obtained, and the noise reduction effect on the audio to be subjected to noise reduction of the user is improved.
Owner:MASHANG CONSUMER FINANCE CO LTD

Chinese character input method

ActiveCN111796692ALow repetition rateReduce learning difficultyInput/output processes for data processingChinese charactersEngineering
The invention relates to an input method, in particular to a Chinese character input method. The Chinese character input method is characterized in that: input of a single Chinese character is achieved by sequentially inputting an initial code representing a Chinese character initial consonant, a final code representing a Chinese character final and a structure code representing a Chinese character structure; input of words or phrases is achieved by continuously inputting initial codes and final codes of single Chinese characters, the structural codes are represented by letters I, U and V, whether the single Chinese characters, words or phrases are input is judged according to whether the structural codes are followed after the initial codes and the final codes, if so, the single Chinese characters are judged to be input, and otherwise the words or phrases are judged to be input. According to the Chinese character input method , the defects that the input speed is low, the repeated code rate is high and the learning period is long in the prior art can be effectively overcome.
Owner:HEFEI EMMET INFORMATION & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products