Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

15 results about "Voice data" patented technology

Advanced conference drop

A network telephone system is provided with a distributed network, a network call processor, with the call processor connected to the network. A telephone line network interface is connected to a telephone line and is connected to the network for receiving packets from the network and sending packets to the network including packets with telephone voice data. A plurality of network telephones are part of the network telephone system with each network telephone connected to the network. Each network telephone has a display for displaying information and each network telephone is capable of engaging in a concurrent telephonic communication. Each network telephone has an I/O device in electrical communication with the network for receiving and sending packets to other devices connected to the network, an input device for producing audio signals from an input local to the device and a packet controller in electrical communication with the I/O device and the input device. The packet controller generates packets from the audio signals received by the input device, forwards the generated packets to the I/O device for transmission to the network and combines packets received by the I/O device to produce an audio signal with the combined packets and the audio signals from the local input device. Each network telephone displays information on the display corresponding to an identity of a source of packets combined whereby the identity may be selected for dropping a source form a concurrent telephonic communication.
Owner:VALTRUS INNOVATIONS LTD +1

Video data fraud detection method and device, computer equipment and storage medium

PendingCN110781916AImprove accuracyIncrease diversitySpeech analysisAcquiring/recognising facial featuresFeature vectorData set
The invention relates to a fraud detection method and device for video data, computer equipment and a storage medium. The method comprises the following steps: acquiring to-be-detected video data; extracting image data of each video frame from the to-be-detected video data, and dividing the image data into a plurality of image data sets according to the time sequence of each video frame, the imagedata sets which comprises image data corresponding to continuous video frames; inputting each image data set into a pre-trained image feature extraction model to obtain an image feature vector; extracting voice data from the to-be-detected video data, and obtaining a voice feature vector of the voice data; performing cascade splicing on the image feature vector and the voice feature vector to obtain a multi-modal feature vector; and inputting the multi-modal feature vector into a pre-trained fraud detection model to obtain a fraud detection result corresponding to the to-be-detected video data output by the fraud detection model. By adopting the method, the characteristic information amount can be increased, the comprehensiveness and diversity of the characteristic information are improved, and the accuracy of video data fraud detection is effectively improved.
Owner:PING AN TECH (SHENZHEN) CO LTD

Speech recognition method and related device

PendingCN114360510AImprove fault tolerancePrecise Syllable Probability DistributionSpeech recognitionVoice dataSpeech sound
The embodiment of the invention discloses a speech recognition method and a related device, and at least relates to a speech recognition technology in artificial intelligence, speech data to be recognized are used as input data of a time delay neural network in an acoustic model, and an output layer of the time delay neural network comprises acoustic modeling units corresponding to a plurality of syllables respectively, so that the speech recognition efficiency is improved. And the syllable probability distribution corresponding to the voice frames included in the voice data can be obtained by taking the syllables as the recognition granularity through the time delay neural network. When syllable recognition is carried out through the output layer, auxiliary judgment can be carried out on the syllables to which the voice frames belong on the basis of pronunciation rules in combination with front and back syllable information of the voice frames, so that more accurate syllable probability distribution is output. Moreover, since the syllables are generally composed of one or more phonemes, the method has higher fault-tolerant capability, not only can more accurately determine the speech recognition result based on the probability distribution of the syllables, but also has low requirements for the quality of the speech data to be recognized, and effectively expands the application scenarios of the speech recognition technology.
Owner:TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products