Analysis of Speech Features for Gender Identification in Tai Language

Authors

  • Kankana Dutta Dibrugarh University
  • Dr. Rizwan Rehman Dibrugarh University
  • ANKUMON SARMAH Dibrugarh University

Abstract

The vast number of information packed into the human speech signal makes analysis a tough undertaking. This intricacy is notably noticeable in tasks like speaker recognition, especially when it comes to gender distinction. In this paper, we address this issue by conducting a thorough examination of the effectiveness of various speech features, namely Pitch, Formant Frequency, MFCC (Mel Frequency Cepstral Coefficients), and Chroma, in the context of gender identification in the Tai Language, which is spoken by the Tai people of Assam. In this study, we use machine learning (SVM, KNN, Decision Tree, Neural Network) to analyze speech features (Pitch, Formant Frequency, MFCC, Chroma) for gender identification in the Tai language spoken by the Tai people of Assam. Our results show that MFCC consistently outperforms other features, delivering highest accuracy rates across all approaches. This demonstrates MFCC's ability to extract gender information from Tai Language speech signals, suggesting more accurate gender identification systems. Beyond gender identification, our study extends voice analysis in linguistics and improves the application of spoken language data, allowing for improved communication systems and linguistic insights. In summary, our findings highlight the critical significance of MFCC in gender identification in the Tai language, with ramifications that extend far beyond its local context, promising advances in voice analysis and improving our understanding of language and human communication.

Keywords: Machine Learning methods, Neural Networks, Gender Identification, MFCC, Pitch, Formant Frequency, Chroma.

Downloads

Download data is not yet available.

References

A. A. Alnuaim, M. Zakariah, C. Shashidhar, W. A. Hatamleh, H. Tarazi, P. K. Shukla, R. Ratna, ”Speaker Gender Recognition Based on Deep Neural Networks and ResNet”, Hindawi Wireless Communications and Mobile Computing, Volume 2022

A. A. D. S. S. N. V. D. S. G. K. Pravin Bhaskar Ramteke, “Gender Identification From Children’s Speech,” Proceedings of 2018 Eleventh International Conference on Contemporary Computing (IC3), 2018.

V. K. M. Esther Ramdinmawii, “Gender Identification from Speech Signal by Examining the Speech Production Characteristics,” International Conference on Signal Processing and Communication (ICSC), IEEE, 2016.

A. G. Ngipethon Gohain, Tai Bhashar Kathopakathan (Prathamik Path), Dibrugarh: Centre for Studies in Language, Dibrugarh University, 2011.

M. Z. S. A. H. T. K. S. R. Abeer Ali Alnuaim, “Speaker Gender Recognition Based on Deep Neural Networks and ResNet50,” Hindawi, Wireless Communications and Mobile Computing, 2022.

T. B. M. Tshephisho Joseph Sefara, “Gender Identification in Sepedi Speech Corpus,” DOI: 10.1109/icABCD51485.2021.9519308, 2021

R. K. P. M. S. H. M. B. Mohammad Amaz Uddin, “Gender and region detection from human voice using the three-layer feature extraction method with 1D CNN,” JOURNAL OF INFORMATION AND TELECOMMUNICATION, VOL. 6, NO. 1, 2022.

R. G. U. R. Héctor A. Sánchez Hevia, “Age group classification and gender recognition from speech with temporal convolutional neural networks,” Multimedia Tools and Applications,https://doi.org/10.1007/s11042-021-11614-4, 2022.

M. S. Seema Khanum, “Speech-based Gender Identification using Feed Forward Neural Networks,” International Journal of Computer Applications, 2015.

R. W. S. L. R. Rabiner, “Introduction to Digital Speech Processing,” Now publishers Inc., 2007.

B. Medhi, “Analysis of formant frequency F1, F2, and F3 in Assamese vowel phonemes using LPC Model,” International Journal of Engineering Research & Technology (IJERT), vol. Vol. 6, no. Issue 05, 2017.

M. M. H. M. Z. H. Md. Rakibul Hasan, “How many Mel-frequency cepstral coefficients to be utilized in speech recognition? A study with the Bengali language,” The Journal of Engineering, Wiley, DOI: 10.1049/tje2.12082, no. 817– 827 (2021), 2021.

R. S. Alkhawaldeh, “DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network,” Hindawi, Scientific Programming, 2019.

S. K. B. Sayantani Ghosh, “SVM Classifier for Human Gender Classification,” International Journal of Applied Research on Information Technology and Computing, vol. 7, no. 2, pp. 100-105, 2016.

F. A. Issam Dagher, “Improving the SVM gender classification accuracy using clustering and incremental learning,” Exper Systems, 2019.

V. V. D. V. V. Y. A. P. G. Assim Ara Abdulsatar, “Age and gender recognition from speech signals,” Journal of Physics: Conference Series, 2019.

R. S. K. P. V. V. A Raahul, “Voice based gender classification using machine learning,” IOP Conf. Series: Materials Science and Engineering, pp. doi:10.1088/1757-899X/263/4/042083, 2017.

Y. L. J. W. B. Q. C. L. W. W. J. Z. Biliang Zhong, “Gender Recognition of Speech based on Decision Tree Model,” Advances in Computer Science Research (ACSR), vol. 90, 2019.

A. T. G. T. N. M. Orken Mamyrbayev, “Neural architectures for gender detection and Speaker Identification,” Cogent Engineering, 2020.

S. Sharma, A. Shukla, P Mishra, “ Speaker and Gender Identification on Indian Languages using Multilingual Speech”, International Journal of Innovative Science, Engineering & Technology, Vol. 1 Issue 4, June 2014.

D.S. Deiv, M. Bhattacharya, Gaurav, “Automatic Gender Identification for Hindi Speech Recognition”, International Journal of Computer Applications (0975 – 8887) Volume 31– No.5, October 2011

M. R. Hasan, M. M. Hasan, M. Z. Hossain,”How many Mel-frequency cepstral coefficients to be utilized in speech recognition? A study with the Bengali language”, Willey, 2021

Published

24-04-2024

How to Cite

Dutta, K., REHMAN, R., & SARMAH, A. (2024). Analysis of Speech Features for Gender Identification in Tai Language. Communications in Mathematics and Applications, 15(1). Retrieved from http://www.rgnpublications.com/journals/index.php/cma/article/view/2450

Issue

Section

Research Article