Method and device for voiceprint recognition,Advanced conference drop,Methods for transmitting and managing voice frames, computer program product, means of storage and corresponding devices,Video data fraud detection method and device, computer equipment and storage medium,Voice-controlled display device and method of voice control of display device

Patents

Literature

Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.

Hiro

15 results about "Voice data" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

Method and device for voiceprint recognition

ActiveUS20140214417A1Reduce noise disturbanceFunction increaseSpeech recognitionSpeaker verificationLoudspeaker

A method and device for voiceprint recognition, include: establishing a first-level Deep Neural Network (DNN) model based on unlabeled speech data, the unlabeled speech data containing no speaker labels and the first-level DNN model specifying a plurality of basic voiceprint features for the unlabeled speech data; obtaining a plurality of high-level voiceprint features by tuning the first-level DNN model based on labeled speech data, the labeled speech data containing speech samples with respective speaker labels, and the tuning producing a second-level DNN model specifying the plurality of high-level voiceprint features; based on the second-level DNN model, registering a respective high-level voiceprint feature sequence for a user based on a registration speech sample received from the user; and performing speaker verification for the user based on the respective high-level voiceprint feature sequence registered for the user.

View all

Owner:TENCENT TECH (SHENZHEN) CO LTD

Advanced conference drop

ActiveUS7085364B1Multiplex system selection arrangementsSpecial service provision for substationVoice dataAudio signal

A network telephone system is provided with a distributed network, a network call processor, with the call processor connected to the network. A telephone line network interface is connected to a telephone line and is connected to the network for receiving packets from the network and sending packets to the network including packets with telephone voice data. A plurality of network telephones are part of the network telephone system with each network telephone connected to the network. Each network telephone has a display for displaying information and each network telephone is capable of engaging in a concurrent telephonic communication. Each network telephone has an I/O device in electrical communication with the network for receiving and sending packets to other devices connected to the network, an input device for producing audio signals from an input local to the device and a packet controller in electrical communication with the I/O device and the input device. The packet controller generates packets from the audio signals received by the input device, forwards the generated packets to the I/O device for transmission to the network and combines packets received by the I/O device to produce an audio signal with the combined packets and the audio signals from the local input device. Each network telephone displays information on the display corresponding to an identity of a source of packets combined whereby the identity may be selected for dropping a source form a concurrent telephonic communication.

View all

Owner:VALTRUS INNOVATIONS LTD +1

Methods for transmitting and managing voice frames, computer program product, means of storage and corresponding devices

ActiveUS20100099400A1Accurate locationError preventionFrequency-division multiplex detailsComputer hardwareTransmission channel

A method of transmitting voice frames, via a transmission channel reserved for voice data, by a transmitting terminal generating voice frames using a voice signal is proposed. Such a method includes steps of: obtaining non-voice data; selecting voice data from the voice frame according to configuration data obtained beforehand relative to the transmission of non-voice data on the reserved channel; constructing of a degraded voice frame by replacing selected voice data with non-voice data; transmitting the degraded voice frame via the reserved channel to a receiving terminal. On the receiving terminal side, a method of managing voice frames coming from the transmitting terminal is proposed including the steps of: detecting a non-voice data header included in the voice frame; extracting, from the voice frame, of non-voice data, according to configuration data read in the header; transmitting of extracted non-voice data to a processor of non-voice data.

Methods for transmitting and managing voice frames, computer program product, means of storage and corresponding devices

View all

Owner:SIERRA WIRELESS

Video data fraud detection method and device, computer equipment and storage medium

PendingCN110781916AImprove accuracyIncrease diversitySpeech analysisAcquiring/recognising facial featuresFeature vectorData set

The invention relates to a fraud detection method and device for video data, computer equipment and a storage medium. The method comprises the following steps: acquiring to-be-detected video data; extracting image data of each video frame from the to-be-detected video data, and dividing the image data into a plurality of image data sets according to the time sequence of each video frame, the imagedata sets which comprises image data corresponding to continuous video frames; inputting each image data set into a pre-trained image feature extraction model to obtain an image feature vector; extracting voice data from the to-be-detected video data, and obtaining a voice feature vector of the voice data; performing cascade splicing on the image feature vector and the voice feature vector to obtain a multi-modal feature vector; and inputting the multi-modal feature vector into a pre-trained fraud detection model to obtain a fraud detection result corresponding to the to-be-detected video data output by the fraud detection model. By adopting the method, the characteristic information amount can be increased, the comprehensiveness and diversity of the characteristic information are improved, and the accuracy of video data fraud detection is effectively improved.

Video data fraud detection method and device, computer equipment and storage medium

View all

Owner:PING AN TECH (SHENZHEN) CO LTD

Voice-controlled display device and method of voice control of display device

InactiveUS20160139877A1Speech recognitionSound input/outputUser needsDisplay device

The present invention is to provide a voice-controlled display device configured such that the inputted user's speech is compared with the identification voice data assigned to each of the execution unit areas on a screen displayed through a display unit and, if there exists identification voice data corresponding to the user's speech, an execution signal is generated to the execution unit area to which the identification voice data is assigned to resolve the inconvenience that the user needs to learn the voice commands stored in the database and to apply the convenience and intuitive simplicity of user experience (UX) of the conventional touchscreen control to the voice control, and a method of voice control of the above display device

Voice-controlled display device and method of voice control of display device

View all

Owner:PARK NAM TAE

Children smartwatch and wechat client talkback method

InactiveCN105681167AData switching networksClient-sideVoice data

The invention discloses a children smartwatch and wechat client talkback method, comprising a wechat server and a transfer platform; the transfer platform comprises a transfer server; the talkback for a children smartwatch and a wechat client comprises the following steps: for downlink voice, the wechat client stores downlink voice data to the wechat server after recording; the wechat server sends a voice message notification to the transfer server; and the transfer server obtains the downlink voice data from the wechat server and then sends the downlink voice data to the corresponding children smartwatch; and for uplink voice, the children smartwatch uploads uplink voice data to the transfer server after recording; the transfer server pushes the uplink voice data to the wechat server; and then the wechat server sends the uplink voice data to the corresponding wechat client. By utilizing the method disclosed by the invention, parents can communicate with children very conveniently via wechat.

View all

Owner:深圳市泰比特科技有限公司

Data simplifying and merging method for a voice decoding memory system

InactiveUS20050096919A1Save processor resourcesImprove decoding efficiencyData processing applicationsCode conversionLogical operationsSpeech sound

A data simplifying and merging method for a voice decoding memory system is disclosed. The method includes the steps of: reading a voice data from a non-volatile memory in a memory system; performing logic operation on the voice data in order to obtain an index; fetching corresponding decoded voice data in a table of the memory system in accordance with the index; and adding the decoded voice data to the voice data in order to obtain an original voice data.

View all

Owner:SUNPLUS TECH CO LTD

Speech recognition method and related device

PendingCN114360510AImprove fault tolerancePrecise Syllable Probability DistributionSpeech recognitionVoice dataSpeech sound

The embodiment of the invention discloses a speech recognition method and a related device, and at least relates to a speech recognition technology in artificial intelligence, speech data to be recognized are used as input data of a time delay neural network in an acoustic model, and an output layer of the time delay neural network comprises acoustic modeling units corresponding to a plurality of syllables respectively, so that the speech recognition efficiency is improved. And the syllable probability distribution corresponding to the voice frames included in the voice data can be obtained by taking the syllables as the recognition granularity through the time delay neural network. When syllable recognition is carried out through the output layer, auxiliary judgment can be carried out on the syllables to which the voice frames belong on the basis of pronunciation rules in combination with front and back syllable information of the voice frames, so that more accurate syllable probability distribution is output. Moreover, since the syllables are generally composed of one or more phonemes, the method has higher fault-tolerant capability, not only can more accurately determine the speech recognition result based on the probability distribution of the syllables, but also has low requirements for the quality of the speech data to be recognized, and effectively expands the application scenarios of the speech recognition technology.