Analisis Kemampuan Mpeg Spatial Audio Object Coding untuk Reproduksi Audio Multikanal

Amirul Luthfi, Ikhwana Elfitri


MPEG Spatial Audio Object Coding (MPEG SAOC) is the first standard on object-based audio coding released by International Organization for Standardization (ISO). The MPEG SAOC has ability to reproduce multichannel audio at the decoder side. In this paper, performance of the MPEG SAOC on multichannel audio reproduction is analyzed and compared to MPEG Surround (MPS). The results show both MPEG SAOC and MPS have Objective Difference Grade (ODG) at impeceptible level. However, the MPEG SAOC encoder can only supports one multichannel object audio input.

Keywords :Spatial Audio Object Coding, Object-based Audio, and Multichannel Audio

Abstrak— MPEG Spatial Audio Object Coding (MPEG SAOC) merupakan pengkode audio berbasis objek pertama yang telah telah distandarisasi oleh International Organization for Standardization (ISO). MPEG SAOC memiliki kemampuan reproduksi audio multikanal pada sisi decoder. Pada paper ini, dianalisis kemampuan reproduksi audio multikanal pada decoder MPEG SAOC dan dibandingkan dengan MPEG Surround (MPS). Hasil penelitian menunjukkan, MPEG SAOC dan MPS sama-sama memiliki Objective Difference Grade (ODG) pada level imperceptible. Namun, encoder MPEG SAOC hanya mendukung masukan satu objek audio multikanal saja.

Kata Kunci : Spatial Audio Object Coding, Audio berbasis Objek dan Audio Multikanal

Full Text:



T. Ito, “Future television - super hi-vision and beyond,” in Proc. IEEE Asian Solid State Circuits Conference, 2010.

Y. Shishikui, I. Kazuhiza, S. Sakaida, K. Kazui, and A. Nakagawa, “High Performance video-codec for Super Hi-Vision,” in Proceedings of the IEEE, 2013, pp. 130–139.

E. Nakasu, “Super Hi-Vision on the horizon: A future TV system that conveys an enhanced sense of reality and presence,” IEEE Consumer Electronics Magazine, pp. 36–42, 2012.

“Multichannel Stereophonic Sound System with and without Accompanying Picture BS Series,”ITU-R BS.775-3, 2012.

S. Kim, Y. Lee, and V. Pulkki, “New 10.2-Channel Vertical Surround System (10.2-VSS): Comparasion Study of Perceived Audio Quality in Various Multichannel Sound Systems with Height Loudspeakers.” Presented at AES 129th AES Convention, San Fransisco, USA, 2010.

K. Hamasaki, K. Hiyama, and R. Okumura, “The 22.2 multichannel sound system and its application,” Presented at 118th AES Convention. Barcelona, Spain, pp. 1–11, 2005.

K. Hamasaki, T. Nishiguchi, R. Okumura, and Y. Nakayama, “Wide Listening Area with Exceptional Spatial Sound Quality of a 22.2 Multichannel Sound System.” Vienna, Austria, 2007.

T. Sugimoto, Y. Nakayama, and S. Oode, “Bitrate of 22.2 Multichannel Sound Signal Meeting Broadcast Quality,” in Proc. 137th AES Convention, 2014.

T. Nishiguchi, T. Sugimoto, Y. Nakayama, and K. Hamasaki, “Production and Live Transmission of 22.2 Multichannel Sound with Ultrahigh-Definition TV,” in Proc. 122nd AES Convention, 2013.

K. Matsui and A. Ando, “Binaural Reproduction of 22.2 Multichannel Sound with Loudspeaker Array Frame,” in Proc. 135th AES Convention, 2013.

R. Bleidt, A. Borsum, H. Fuchs, and S. M. Weiss, “Object-Based Audio: Opportunities for Improved Listening Experience and Increased Listener Involvement,” Motion Imaging Journal, SMPTE, vol. 124, no. 5, pp. 1–13, 2015.

K. Brandenburg and M. Bosi, “ISO/IEC MPEG-2 Advanced Audio Coding: Review and Applications,” AES 103rd Convention. New York, USA, 1997.

“Information Technology - Generic Coding of Moving Pictures and Associated audio Information, Part 7 : Advanced Audio Coding,” ISO/IEC 13818-7, 2006.

M. Bosi, K. Brandenburg, S. Quackenbush, L. Fielder, K. Akagiri, H. Fuchs, M. Dietz, J. Herre, G. Davidson, and Y. Oikawa, “ISO/IEC MPEG-2 Advanced Audio Coding,” J. Audio Eng. Soc., vol. 45, no. 10, pp. 789–814, 1997.

J. Herre, J. Breebaart, C. Faller, S. Disch, K. Kjorling, E. Schuijers, J. Hilpert, and F. Myburg, “The Reference Model Architecture for MPEG Spatial Audio Coding.” Presented at AES 118th AES Convention, Barcelona, Spain, 2005.

I. Elfitri, “Spatial Audio Coding,” TeknikA, vol. 1, no. 32, pp. 14–18, 2009.

J. Breebaart, G. Hotho, J. Koppens, E. Schuijers, W. Oomen, and S. van de Par, “Background, concept, and architecture for the recent MPEG surround standard on multichannel audio compression,” J. Audio Eng. Soc., vol. 55, no. 5, pp. 331–351, 2007.

M. Jia, Z. Yang, C. Bao, X. Zheng, and C. Ritz, “Encoding multiple audio objects using intra-object sparsity,” IEEE Trans. Audio, Speech Lang. Process., vol. 23, no. 6, pp. 1082–1095, 2015.

K.-L. Huang, T.-M. Chang, and P.-C. Chang, “An Object-Based Audio Rendering System using Spatial Parameters,” in The 1st IEEE Global Conference on Consumer Electronics 2012, 2012, pp. 687–688.

C. S. Cho, J. W. Kim, H. S. Shin, and B. H. Choi, “Implementation of an object audio system based on MPEG-4 audio lossless coding on DSP,” IEEE Int. Symp. Broadband Multimed. Syst. Broadcast. 2010, BMSB 2010 - Final Program., pp. 1–5, 2010.

S. Gorlow, E. A. P. Habets, and S. Marchand, “Multichannel object-based audio coding with controllable quality,” in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2013, pp. 561–565.

“Spatial Audio Object Coding,”ISO/IEC 23003-2, 2010.

J. Engdegard, B. Resch, C. Falch, O. Hellmuth, J. Hilpert, A. Hoelzer, J. Breebaart, J. Koppens, E. Schuijers, and W. Oomen, “Spatial Audio Object Coding ( SAOC ) – The Upcoming MPEG Standard on Parametric Object Based Audio Coding,” Presented at 124th AES Convention. Amsterdam, The Netherlands, 2008.

J. Herre and S. Disch, “New Concepts in Parametric Coding of Spatial Audio: From SAC to SAOC,” in IEEE International Conference on Multimedia and Expo, 2007.

C. Falch, L. Terentiev, and J. Herre, “Spatial Audio Object Coding With Enhanced Audio Object Separation,” in DAFx-10 : proceedings of the 13th International Conference on Digital Audio Effects DAF, 2010.

J. Park, J. Hong, K. Kim, and M. Hahn, “Harmonic elimination structures for Karaoke Mode in Spatial Audio Object Coding Scheme,” in IEEE International Conference on Consumer Electronics (ICCE) Harmonic, 2011, pp. 813–814.

O. Hellmuth, H. Purnhagen, J. Koppens, J. Herre, J. Engdegård, J. Hilpert, L. Villemoes, L. Terentiv, C. Falch, A. Hölzer, M. L. Valero, B. Resch, H. Mundt, and H. O. Oh, “MPEG Spatial Audio Object Coding-The ISO/MPEG Standard for Efficient Coding of Interactive Audio Scenes,” J. Audio Eng. Soc., vol. 60, no. 9, pp. 655–673, 2012.

Aude, Arlo J. 1998. Audio Quality Measurement Primer. AN9789

ITU-R BS.1387-1: “Method for Objective Measurements of Perceived Audio Quality”, 2001.

Liebetrau, J., Sporer, T., Kampf, S., Schneider, S.: “Standarization of PEAQ-MC: Extension of ITU-R BS.1387 to Multichannel Audio”. Presented at AES 40th Int. Conf, Spatial Audio: Sense the Sound of Space, Tokyo, Japan, October 2010.

Kabal, P.: “An examination and interpretation of ITU-R BS.1387: perceptual evaluation of audio quality”, Telecommunication and Signal Processing Laboratory, Department of Electrical and Computer Engineering, McGill University.



Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.



Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Statistic and Traffic