Analysis of Speech Features for Gender Identification in Tai Language
Abstract
The vast number of information packed into the human speech signal makes analysis a tough undertaking. This intricacy is notably noticeable in tasks like speaker recognition, especially when it comes to gender distinction. In this paper, we address this issue by conducting a thorough examination of the effectiveness of various speech features, namely Pitch, Formant Frequency, MFCC (Mel Frequency Cepstral Coefficients), and Chroma, in the context of gender identification in the Tai Language, which is spoken by the Tai people of Assam. In this study, we use machine learning (SVM, KNN, Decision Tree, Neural Network) to analyze speech features (Pitch, Formant Frequency, MFCC, Chroma) for gender identification in the Tai language spoken by the Tai people of Assam. Our results show that MFCC consistently outperforms other features, delivering highest accuracy rates across all approaches. This demonstrates MFCC's ability to extract gender information from Tai Language speech signals, suggesting more accurate gender identification systems. Beyond gender identification, our study extends voice analysis in linguistics and improves the application of spoken language data, allowing for improved communication systems and linguistic insights. In summary, our findings highlight the critical significance of MFCC in gender identification in the Tai language, with ramifications that extend far beyond its local context, promising advances in voice analysis and improving our understanding of language and human communication.
Keywords: Machine Learning methods, Neural Networks, Gender Identification, MFCC, Pitch, Formant Frequency, Chroma.
Downloads
References
A. A. Alnuaim, M. Zakariah, C. Shashidhar, W. A. Hatamleh, H. Tarazi, P. K. Shukla, R. Ratna, ”Speaker Gender Recognition Based on Deep Neural Networks and ResNet”, Hindawi Wireless Communications and Mobile Computing, Volume 2022
A. A. D. S. S. N. V. D. S. G. K. Pravin Bhaskar Ramteke, “Gender Identification From Children’s Speech,” Proceedings of 2018 Eleventh International Conference on Contemporary Computing (IC3), 2018.
V. K. M. Esther Ramdinmawii, “Gender Identification from Speech Signal by Examining the Speech Production Characteristics,” International Conference on Signal Processing and Communication (ICSC), IEEE, 2016.
A. G. Ngipethon Gohain, Tai Bhashar Kathopakathan (Prathamik Path), Dibrugarh: Centre for Studies in Language, Dibrugarh University, 2011.
M. Z. S. A. H. T. K. S. R. Abeer Ali Alnuaim, “Speaker Gender Recognition Based on Deep Neural Networks and ResNet50,” Hindawi, Wireless Communications and Mobile Computing, 2022.
T. B. M. Tshephisho Joseph Sefara, “Gender Identification in Sepedi Speech Corpus,” DOI: 10.1109/icABCD51485.2021.9519308, 2021
R. K. P. M. S. H. M. B. Mohammad Amaz Uddin, “Gender and region detection from human voice using the three-layer feature extraction method with 1D CNN,” JOURNAL OF INFORMATION AND TELECOMMUNICATION, VOL. 6, NO. 1, 2022.
R. G. U. R. Héctor A. Sánchez Hevia, “Age group classification and gender recognition from speech with temporal convolutional neural networks,” Multimedia Tools and Applications,https://doi.org/10.1007/s11042-021-11614-4, 2022.
M. S. Seema Khanum, “Speech-based Gender Identification using Feed Forward Neural Networks,” International Journal of Computer Applications, 2015.
R. W. S. L. R. Rabiner, “Introduction to Digital Speech Processing,” Now publishers Inc., 2007.
B. Medhi, “Analysis of formant frequency F1, F2, and F3 in Assamese vowel phonemes using LPC Model,” International Journal of Engineering Research & Technology (IJERT), vol. Vol. 6, no. Issue 05, 2017.
M. M. H. M. Z. H. Md. Rakibul Hasan, “How many Mel-frequency cepstral coefficients to be utilized in speech recognition? A study with the Bengali language,” The Journal of Engineering, Wiley, DOI: 10.1049/tje2.12082, no. 817– 827 (2021), 2021.
R. S. Alkhawaldeh, “DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network,” Hindawi, Scientific Programming, 2019.
S. K. B. Sayantani Ghosh, “SVM Classifier for Human Gender Classification,” International Journal of Applied Research on Information Technology and Computing, vol. 7, no. 2, pp. 100-105, 2016.
F. A. Issam Dagher, “Improving the SVM gender classification accuracy using clustering and incremental learning,” Exper Systems, 2019.
V. V. D. V. V. Y. A. P. G. Assim Ara Abdulsatar, “Age and gender recognition from speech signals,” Journal of Physics: Conference Series, 2019.
R. S. K. P. V. V. A Raahul, “Voice based gender classification using machine learning,” IOP Conf. Series: Materials Science and Engineering, pp. doi:10.1088/1757-899X/263/4/042083, 2017.
Y. L. J. W. B. Q. C. L. W. W. J. Z. Biliang Zhong, “Gender Recognition of Speech based on Decision Tree Model,” Advances in Computer Science Research (ACSR), vol. 90, 2019.
A. T. G. T. N. M. Orken Mamyrbayev, “Neural architectures for gender detection and Speaker Identification,” Cogent Engineering, 2020.
S. Sharma, A. Shukla, P Mishra, “ Speaker and Gender Identification on Indian Languages using Multilingual Speech”, International Journal of Innovative Science, Engineering & Technology, Vol. 1 Issue 4, June 2014.
D.S. Deiv, M. Bhattacharya, Gaurav, “Automatic Gender Identification for Hindi Speech Recognition”, International Journal of Computer Applications (0975 – 8887) Volume 31– No.5, October 2011
M. R. Hasan, M. M. Hasan, M. Z. Hossain,”How many Mel-frequency cepstral coefficients to be utilized in speech recognition? A study with the Bengali language”, Willey, 2021
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a CCAL that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.