Twin network voiceprint recognition method based on 3D convolution
A twin network, voiceprint recognition technology, applied in biological neural network models, neural learning methods, speech analysis, etc., can solve the problems of low recognition rate, ignore the spatial and temporal characteristics of speech information, etc., to improve the accuracy rate, Relevance-enhancing effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0034] The present invention will be described in detail below with reference to the drawings and specific embodiments.
[0035] The embodiment of the invention discloses a twin network based on 3D convolution for voiceprint recognition. The Siamese-Net network is abbreviated as Sia-Net network, and includes: a feature extraction unit: used to convert audio data into a three-dimensional tensor, The three-dimensional tensor is the MFLC feature.
[0036] Sia-Net network: used to process the MFLC features, shorten the feature distance of data between the same speaker, and increase the feature distance of data between different speakers. This distance is the Euclidean distance. CNN network: used to build a model library for each speaker. Prediction unit: used to test and determine the speaker identity of audio data.
[0037] The Sia-Net network: There are two, each of the Sia-Net networks includes: three 3D convolutional layers, one pooling layer, four 3D convolutional layers, one connec
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap