Stereo coding method and device

A stereo and encoding technology, applied in the multimedia field, can solve the problems of inability to achieve realistic restoration, uncomfortable listening experience of the listener, and inability to meet the recovery requirements, and achieve the effect of improving encoding efficiency and enhancing sound field effects.

Inactive Publication Date: 2013-10-23
HUAWEI TECH CO LTD
View PDF0 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

ILD is a ubiquitous signal characteristic parameter that reflects the sound field signal. ILD can better reflect the energy of the sound field. However, stereo sound often has background space and sound fields in the left and right directions. It is not enough to restore the original stereo sound only by transmitting ILD to restore the stereo sound. Signal requirements, so a scheme to transmit more parameters to better restore the stereo signal was proposed. In addition to extracting the most basic ILD parameters, it also proposed to transmit the phase difference between the left and right channels (IPD: InterChannel Phase Difference) and the left and right channels. Cross-correlation ICC parameters, sometimes including the phase difference (OPD) parameters of the left channel and the downmix signal, these parameters reflecting the background space of the stereo signal and the sound field information in the left and right directions and the ILD parameters are encoded as side information and sent to Decoder to restore stereo signal
[0003] Coding bit rate is one of the important evaluation factors of multimedia signal coding performance. Th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0031] Example 1:

[0032] figure 1 A schematic diagram of the implementation of a stereo encoding method, including:

[0033] Step 101: Transform the left channel signal and the right channel signal of the time domain stereo to the frequency domain to form the left channel signal and the right channel signal in the frequency domain.

[0034] Step 102: The left channel frequency domain signal and the right channel frequency domain signal in the frequency domain are down-mixed to generate a mono downmix signal (DMX), the bits after encoding and quantization of the DMX signal are transmitted, and the extracted frequency domain The spatial parameters of the upper left channel signal and the right channel signal are quantized and encoded.

[0035] Spatial parameters are parameters representing spatial characteristics of stereo signals, such as ILD parameters.

[0036] Step 103: Use the left and right channel signals in the frequency domain to estimate the group delay (Group Delay)

Example Embodiment

[0041] Embodiment 2:

[0042] figure 2 It is a schematic diagram of another stereo coding method embodiment, including:

[0043] Step 201, transform the time domain stereo left channel signal and the right channel signal to the frequency domain to form a stereo left channel signal X in the frequency domain 1 (k) and the right channel signal X 2 (k), where k is the index value of the frequency point of the frequency signal.

[0044] Step 202, performing a downmix operation on the left channel signal and the right channel signal in the frequency domain, encoding and quantizing the downmix signal and transmitting, and encoding stereo spatial parameters, quantizing to form side information and transmitting, which may include the following steps:

[0045] Step 2021, the left channel signal and the right channel signal in the frequency domain are downmixed to generate a synthesized mono downmix signal DMX.

[0046] Step 2022, encode the quantized mono downmix signal DMX, and trans

Example Embodiment

[0076] Combined with the above-mentioned preprocessing of the cross-correlation function time-domain signal, the group delay and group phase of the stereo signal are estimated in step 2033 using the same estimation method, or can be estimated separately. Specifically, at least the following estimated group phase and Implementation of group delay:

[0077] Step 2033 Embodiment 1, such as Figure 4a shown:

[0078] According to the cross-correlation function time-domain signal or the index corresponding to the value with the largest amplitude in the processed cross-correlation function time-domain signal, the group delay is estimated, and the phase angle corresponding to the cross-correlation function corresponding to the group delay is obtained, and the group phase is estimated. , including the following steps:

[0079] Determine the relationship between the index corresponding to the value with the largest amplitude in the cross-correlation function of the time-domain signal an

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention relates to a stereo coding method, which comprises the following steps of transforming a left channel signal and a right channel signal of stereo in a time domain into a frequency domain to form a left channel signal and a right channel signal in the frequency domain; performing down-mixing on the left channel signal and the right channel signal in the frequency domain to generate a single-channel down-mixed signal, and transmitting bits of the coded and quantized down-mixed signal; extracting spatial parameters of the left channel signal and the right channel signal in the frequency domain; estimating a group delay and a group phase between the left and right channels of the stereo by utilizing the left channel signal and the right channel signal in the frequency domain; and quantitatively coding the group delay, the group phase and the spatial parameters to achieve high stereo coding performance under a low code rate.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products