Training method and system for mixed speech recognition model of mandarin and Sichuan speech

A technology of mixing speech and recognition models, applied in speech recognition, speech analysis, neural learning methods, etc., can solve the problems of resource occupation, poor dialect performance recognition, high resource occupation and deployment cost, and achieve small resource occupation and engineering realization. Simple, convenient and robust effects

Active Publication Date: 2020-10-30
AISPEECH CO LTD
View PDF6 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patented technology allows for better simulation of various scenarios by simulating multiple environmental conditions or channel changes without repeating all previous simulations. It uses both models called phonetic part modelling (POM) units and Chinese Character Models (CAM), making these tasks easier than traditional techniques involving multitaskers trained on individual languages. Additionally, POM units help improve speech quality while CAM modules make calculations faster compared to other approaches like LSTM networks. Overall, this approach improves efficiency and accuracy in spoken word processing systems such as SiChu Language Learning System's English version.

Problems solved by technology

This patented technical problem addressed by this patents relates to improving spoken word recognition (SWT) accuracy when supporting various types of linguistics due to errors caused during speech recognition processing. Current methods involve adding or removing models depending upon whether certain parts have been previously identified correctly. Additionally, current techniques require acquiring specific dataset sizes for trained networks, making them less efficient in handling mixed datasets containing both English and German words.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training method and system for mixed speech recognition model of mandarin and Sichuan speech
  • Training method and system for mixed speech recognition model of mandarin and Sichuan speech
  • Training method and system for mixed speech recognition model of mandarin and Sichuan speech

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the objectives, technical solutions, and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0024] Such as figure 1 Shown is a flowchart of a training method for a Mandarin and Sichuanese hybrid speech recognition model provided by an embodiment of the present invention, including the following steps:

[0025] S11: Perform data enhancement on the mixed training audio data with text annotations, determine the features of the data-

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a training method for a mixed speech recognition model of mandarin and Sichuan. The method comprises the steps of determining features of mixed training audiodata, and determining the features as input data for training based on phoneme-based data alignment and Chinese character-based data alignment; inputting to N common intermediate layers, calculating afirst loss function by a first task layer, and calculating a second loss function by a second task layer; training N layers of first task layers based on the first loss function, training N layers ofsecond task layers based on the second loss function, performing multi-task training based on the trained first neural network parameters and the trained second neural network parameters, and training N layers of common intermediate layers. The embodiment of the invention further provides a training system for the mixed speech recognition model of mandarin and Sichuan. According to the embodimentof the invention, phonemes and Chinese characters are used as tasks of multi-task joint training, so that the recognition performance of mandarin and Sichuan talks is improved, and the resource occupation is reduced.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products