HDfpga: Digital Audio Compression Algorithms

Audio data compression is widely applied in multimedia devices including video conference system.

The human ear can nominally hear sounds in the range 20 Hz to 20,000 Hz (20 kHz). Peak sensitivity is between 2 kHz and 4 kHz. According to the Nyquist theory (the minimum sampling rate required to avoid aliasing, equal to twice the highest frequency contained within the signal), digital audio data is usually sampled from 8 kHz to 48 kHz, covering from 4 kHz to 24 kHz which is bigger than human hearing dynamic range.

Similar to image data compression, digital audio data compression often utilizes data quantization, entropy coding, transformation (A-law algorithm and μ-law algorithm), prediction, and frequency domain coding using filter bank bands, such as PQMF and PQF for MDCT. Similar to taking advantage of human visual system model, digital audio compression takes advantage of Psychoacoustics which outlines human hearing limits:

High frequency limit

Absolute threshold of hearing

Temporal masking

Simultaneous masking

An audio compression algorithm can assign a lower priority to sounds outside the range of human hearing. WMA is an example.

Echo cancellation is an essential technique for telephony and video conference. The following book chapter describe the principle of echo cancellation:

http://users.ece.gatech.edu/~barry/digital/supp/20echo.pdf

PEAQ (Perceptual Evaluation of Audio Quality) is a standardized algorithm for objectively measuring perceived audio quality.

For more see

Tutorials:

http://www.ics.uci.edu/~dan/class/267/notes/uci/z8.pdf

http://www.ee.columbia.edu/~dpwe/e6820/papers/Pan95-mpega.pdf

http://www.cs.princeton.edu/courses/archive/spr06/cos579/CookSoundSIG98.pdf

http://www.cs.ucf.edu/courses/cap5015/Image%20compression%20and%20video%20compression%202004%20notes%20-%206%20Audio%20compression.pdf

Perceptual audio coding algorithms:

http://ocw.mit.edu/courses/health-sciences-and-technology/hst-723-neural-coding-and-perception-of-sound-spring-2005/labs/fmntlprcptlaudio.pdf

http://www.mp3-tech.org/programmer/docs/audiopaper1.pdf

http://dsp-book.narod.ru/DSPMW/42.PDF

http://www.mp3-tech.org/programmer/docs/CaveT2002.pdf

MP3:

http://www.ece.umd.edu/class/enee408f.F2001/Projects/mp3.pdf

Coding and Standards:

http://eeweb.poly.edu/~yao/EE3414/audio_coding.pdf

MPEG4 Audio Algorithm:

http://www.itu.int/dms_pub/itu-r/oth/0A/07/R0A0700001F0001PDFE.pdf

http://www.iis.fraunhofer.de/en/Images/AES6183_MPEG-4_Scalable_to_Lossless_Audio_Coding_tcm183-51663.pdf

http://www.eurasip.org/proceedings/ext/isccsp2006/defevent/papers/cr1351.pdf

http://www.ebu.ch/fr/technical/trev/trev_305-moser.pdf

Advances in Linear Prediction Techniques:

http://alexandria.tue.nl/extra2/200710483.pdf

AC3:

http://www.dolby.com/uploadedFiles/English_(US)/Professional/Technical_Library/Technologies/Dolby_Digital_(AC-3)/37_ac3-flex.pdf

HD AAC:

http://www.iis.fraunhofer.de/bf/amm/download/HD-AAC_final_low_2011.pdf

Intel High Definition Audio:

http://www.intel.com/design/chipsets/hdaudio.htm

HD DVD Audio:

http://www.highdefdigest.com/news/show/1064

Audio quality test equipment:

http://www.tek.com/products/video-test/

1 comment:

Office Furniture in West Palm Beach said...: before comperssing we have to thing regarding the echo cancellation.
While designing an Echo cancellation system , we have to keep two things in mind. those are:-

first recognizing the originally transmitted signal that re-appears, with some delay, in the transmitted or received signal.
secondly you have to choose the best method for designing that .some of methods are echo suppressors or echo cancellers.

To implement the system we generally use the digital signal processor technique.
You have a great idea regarding
echo cancellation .; June 18, 2012 at 9:47 AM

HDfpga

Saturday, April 16, 2011

Digital Audio Compression Algorithms

1 comment:

Followers

Blog Archive

About Me