Pengenalan Bentuk Benda Berdasarkan Sinyal Suara dengan Transducer Mikrofon dan Teknologi Kinect

Siska Aulia; Lifwarda Lifwarda; Yustini Yustini

doi:10.25077/jnte.v7n3.600.2018

Siska Aulia

Jurusan Teknik Elektro, Politeknik Negeri Padang

Lifwarda Lifwarda

Jurusan Teknik Elektro, Politeknik Negeri Padang

Yustini Yustini

Jurusan Teknik Elektro, Politeknik Negeri Padang

Keywords

Abstract

Voice processing or speech recognition is growing rapidly hence it can be used for various applications such as moving a system or motion control and multimedia-based learning media. Implementation of speech recognition and image detection in this study using microphone transducer and kinect technology. This study aims to produce a system that can identify and recognize an object with word commands, such as circles, triangles, rectangles and many. In sound processing, sound feature extraction is carried out with Mel-Frequency Cepstrum Coeffecient (MFCC). Word modeling was done using statistical modeling, namely the Hidden Markov Model (HMM). HMM is able to provide an efficient mechanism for statistically modeling diversity in words or words. Data were collected with offline and online microphone transducers. This study matches the pattern of words through training and testing process. The output of this system is a recognizable word based on the highest probability and displaying the object shape based on the recognized word, namely circle, triangle and quadrilateral. Test results with mirofon tranducers, for 85% trained sources, 81.5% untrained sources, and 84% untrained Kinect source testing hence that word recognition systems can be implemented with Kinect technology.

Keywords : speech processing, HMM, MFCC, Kinect

Abstrak

Pengolahan suara atau pengenalan kata berkembang pesat sehingga dapat digunakan untuk berbagai aplikasi seperti menggerakan suatu sistem atau kontrol gerak dan media pembelajaran berbasis multimedia. Implementasi pengenalan suara dan deteksi citra pada penelitian ini menggunakan transducer mikrofon dan teknologi kinect. Penelitian ini bertujuan untuk menghasilkan sistem yang dapat mengidentifikasi dan mengenali suatu objek dengan perintah kata, seperti lingkaran, segitiga, segiempat dan segibanyak. Dalam pengolahan suara dilakukan ekstraksi ciri suara dengan Mel-Frequency Cepstrum Coeffecient (MFCC). Pemodelan kata dilakukan dengan menggunakan pemodelan statistik yaitu Hidden Markov Model (HMM). HMM mampu memberikan mekanisme yang efisien untuk memodelkan secara statistik keragaman dalam ucapan atau kata. Pengambilan data sampel dengan transducer mikrofon secara offline dan online. Pada penelitian ini pencocokan pola kata melalui proses pelatihan dan pengujian kata. Keluaran sistem ini berupa kata yang dikenali berdasarkan probabilitas tertinggi dan menampilkan bentuk benda berdasarkan kata yang dikenali. Prosesnya setelah kata dikenali, sistem akan mentracking citra benda berdasarkan bentuk benda kemudian menampilkan bentuk benda yaitu lingkaran, segitiga, segiempat dan segibanyak. Hasil pengujian dengan tranducer mirofon, untuk sumber terlatih 85%, sumber tidak terlatih 81,5%, dan pengujian dengan Kinect sumber tidak terlatih 84% sehingga sistem pengenalan kata dapat diimplementasikan dengan teknologi Kinect.

Kata Kunci : speech processing, HMM, MFCC, kinect

References

Aulia, Siska, "Implementasi Pengenalan Kata Dengan Metode Mel Frequency Cepstrum Coeffecient Dan Hidden Markov Model Untuk Mengontrol Gerak Robot Mobil Penjejak Identifikasi Warna", Tugas Akhir. Padang: Teknik Elektro Universitas Andalas. 2011.

Hardiyanti, Margareta, "Rancang Bangun Aplikasi Pembelajaran Pengucapan bagi Penderita Tunarungu Menggunakan Teknologi Kinect", Institut Teknologi Sepuluh Nopember, Surabaya, 2013.

Cahyarini Ratri, "Rancang Bangun Modul Pengenalan Suara Menggunakan Teknologi Kinect", Jurnal Teknik Pomits Vol. 2, No. 1, ISSN: 2337-3539. 2013.

Kinect, 2018. [Online]. Available: http://en.wikipedia.org/wiki/Kinect.

Fitrilina, Kurnia Rahmadi, Aulia Siska, "Pengenalan Ucapan Metoda MFCC-HMM untuk Perintah Gerak Robot Mobil Penjejak Identifikasi Warna", Jurnal Nasional Teknik Elektro, Vol 2 No.1 Maret 2013.

Rabiner, "A Tutorial on Hidden Markov Model and Selected Aplication in Speech Recognition", Proceedings of the IEEE, vol 77, No 2, 1989.

L. R. Rabiner and B-H Juang, Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs, New Jersey, 1993, chapter 6.

S Aulia, Lifwardaa dan V. Veronica, "The Implementation of Speech Recognition Using Kinect Technology”, International Conference of Applied Science on Engineering, Business, Linguistics and Information Technology, pp. 375-383, 2017.

A. Zunita, Peningkatan Pemahaman Bentuk Geometri Melalui Pembelajaran Berbasis Multimedia Pada Anak Kelompok B TK KKLKMD Kuwon Bambanglipuro Bantul, Yogyakarta: Fakultas Ilmu Pendidikan Universitas Negeri Yogyakarta. 2013.

P. S. Sawai, "Gesture & Speech Recognition using Kinect Device –A Review", International Conference on Science and Technology for Sustainable Development, Kuala Lumpur, May, 2016 .

G. Archana , "Dynamic Hand Gesture Recognition using Hidden Markov Model by Microsoft Kinect Sensor", International Journal of Computer Applications, vol.150, no.5, September 2016.

https://www.electronicstutorials.ws/ io/io_8.html

D. W. J. Stein, "Detection of Random and Sinusoidal Signals in Hidden Markov Noise", Procedding of the 30th IEEE Asilomar Conference on Signal Systems and Computers, 464-468. 1997.

M. J. Landau, "Simulating Kinect Infrared and Depth Images", IEEE Transactions On Cybernetics, Vol. 46, No. 12, December 2016.

X. Chai, G Li, M. Zhou, "Sign Language Recognition and Translation with Kinect". In Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, Shanghai, China, April 2013.

PDF

Published

Nov 19, 2018

https://doi.org/10.25077/jnte.v7n3.600.2018

How to Cite

Aulia, S., Lifwarda, L., & Yustini, Y. (2018). Pengenalan Bentuk Benda Berdasarkan Sinyal Suara dengan Transducer Mikrofon dan Teknologi Kinect. Jurnal Nasional Teknik Elektro, 7(3), 191–202. https://doi.org/10.25077/jnte.v7n3.600.2018

Issue

Vol 7, No 3: November 2018

Section

Telecommunication

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

The authors declare that:

1. This paper has not been published in the same form elsewhere.

2. It will not be submitted anywhere else for publication prior to acceptance/rejection by this Journal.

3. A copyright permission is obtained for materials published elsewhere and which require this permission for reproduction.

Main Article Content

Keywords

Abstract

References

Article Sidebar

Article Details

Most read articles by the same author(s)