Volume 10 , Issue 1 , PP: 128-142, 2023 | Cite this article as | XML | Html | PDF | Full Length Article
Hayder Mahmood Salman 1 * , Vian S. Al-Doori 2 , Hayder sharif 3 , Wasfi Hameed4 4 , Rusul S. Bader 5
Doi: https://doi.org/10.54216/FPA.100108
To enhance the performance of Chinese language pronunciation evaluation and speech recognition systems, researchers are focusing on developing intelligent techniques for multilevel fusion processing of data, features, and decisions using deep learning-based computer-aided systems. With a combination of score level, rank level, and hybrid level fusion, as well as fusion optimization and fusion score improvement, these systems can effectively combine multiple models and sensors to improve the accuracy of information fusion. Additionally, intelligent systems for information fusion, including those used in robotics and decision-making, can benefit from techniques such as multimedia data fusion and machine learning for data fusion. Furthermore, optimization algorithms and fuzzy approaches can be applied to data fusion applications in cloud environments and e-systems, while spatial data fusion can be used to enhance the quality of image and feature data In this paper, a new approach has been presented to identify the tonal language in continuous speech. This study proposes the Machine learning-assisted automatic speech recognition framework (ML-ASRF) for Chinese character and language prediction. Our focus is on extracting highly robust features and combining various speech signal sequences of deep models. The experimental results demonstrated that the machine learning neural network recognition rate is considerably higher than that of the conventional speech recognition algorithm, which performs more accurate human-computer interaction and increases the efficiency of determining Chinese language pronunciation accuracy.
Automatic Speech Recognition , Machine Learning , Language detection , Language Pronunciation , Fusion processing.
[1] Zhang, L., Li, Y., Zhou, H., Zhang, Y., & Shu, H. (2020). Sentence Context Differentially Modulates Contributions of Fundamental Frequency Contours to Word Recognition in Chinese-Speaking Children With and Without Dyslexia. Frontiers in Psychology, 11, 3337.
[2] Zhang, Q., & Reilly, R. G. (2020). What are regions of Chinese characters crucial for recognition? A web-based study. Journal of Chinese Writing Systems, 2513850220950020.
[3] Ren, Z., Yang, G., & Xu, S. (2019). Two-Stage Training for Chinese Dialect Recognition. arXiv preprint arXiv:1908.02284.
[4] Guan, C. Q., Fraundorf, S. H., & Perfetti, C. A. (2020). Character and child factors contribute to character recognition development among excellent and poor Chinese readers from grade 1 to 6. Annals of dyslexia, 70(2), 220-242.
[5] Hatim Abdelhak Dida, DSK Chakravarthy, & Fazle Rabbi. (2023). ChatGPT and Big Data: Enhancing Text-to-Speech Conversion. Mesopotamian Journal of Big Data, 2023, 33–37. https://doi.org/10.58496/MJBD/2023/005 [6] Jaber, M.M., Ali, M.H., Abd, S.K., Jassim, M.M., Alkhayyat, A., Alreda, B.A., Alkhuwaylidee, A.R. and Alyousif, S., 2022. A Machine Learning-Based Semantic Pattern Matching Model for Remote Sensing Data Registration. Journal of the Indian Society of Remote Sensing, pp.1-14.
[7] Yu, C., Chen, Y., Li, Y., Kang, M., Xu, S., & Liu, X. (2019). Cross-language end-to-end speech recognition research based on transfer learning for the low-resource Tujia language. Symmetry, 11(2), 179.
[8] Chen, L., Lei, J., & Gong, H. (2018). The effect of hearing status on speechreading performance of Chinese adolescents. Clinical linguistics & phonetics, 32(12), 1090-1102.
[9] Li, L., Wang, H. C., Castles, A., Hsieh, M. L., & Marinus, E. (2018). Phonetic radicals, not phonological coding systems, support orthographic learning via self-teaching in Chinese. Cognition, 176, 184-194.
[10] Tung, C. H., & Lin, Y. G. (2020). Off-line handwritten Chinese character recognition by using support vector machines. Journal of Information and Optimization Sciences, 1-20.
[11] Lin, J., Li, W., Gao, Y., Xie, Y., Chen, N. F., Siniscalchi, S. M., ... & Lee, C. H. (2018). Improving Mandarin tone recognition based on DNN by combining acoustic and articulatory features using extended recognition networks. Journal of Signal Processing Systems, 90(7), 1077-1087.
[12] Qin, Z., Tremblay, A., & Zhang, J. (2019). Influence of within-category tonal information in recognition of Mandarin-Chinese words by native and non-native listeners: An eye-tracking study. Journal of Phonetics, 73, 144-157.
[13] McLaughlin, D. J., & Van Engen, K. J. (2020). Task-evoked pupil response for accurately recognized accented speech. The Journal of the Acoustical Society of America, 147(2), EL151-EL156.
[14] Li, M. F., Gao, X., Y., & Wu, J. T. (2020). Neighbourhood effects in Chinese character recognition: Going beyond phonological perspectives to explain a possible underlying mechanism. Reading and Writing, 33(3), 547-570. [15] Jaber, M.M., Ali, M.H., Abd, S.K., Jassim, M.M., Alkhayyat, A., Kadhim, E.H., Alkhuwaylidee, A.R. and Alyousif, S., 2023. AHI: a hybrid machine learning model for complex industrial information systems. Journal of Combinatorial Optimization, 45(2), p.58.
[16] Liao, C. C. (2018). Double-Sided Occluded Chinese Character Recognition Accuracy and Response Time for Design and Nondesign Educational Background. SAGE Open, 8(4), 2158244018810065.
[17] Lim, R. Y., Yap, M. J., & Tse, C. S. (2020). Individual differences in Cantonese Chinese word recognition: Insights from the Chinese Lexicon Project. Quarterly Journal of Experimental Psychology, 73(4), 504-518.
[18] Gao, S., Kong, D., Yu, Z., Luo, Y., Guo, J., & Xian, Y. (2019). Chinese question speech recognition integrated with domain characteristics. International Journal of Computational Science and Engineering, 19(3), 325-333.
[19] Zhang, J., Chen, B. B., Hodges-Simeon, C., Albert, G., Gaulin, S. J., & Reid, S. A. (2020). High recognition accuracy for low-pitched male voices in men with higher threat potential: Further evidence for humans' retaliation-cost model. Evolution and Human Behavior.
[20] Chronaki, G., Wigelsworth, M., Pell, M. D., & Kotz, S. A. (2018). The development of cross-cultural recognition of vocal emotion during childhood and adolescence. Scientific reports, 8(1), 1-17.
[21] Yang, P. P. (2020). How amplitude influences Mandarin Chinese tone recognition in a whisper. Working Papers of the Linguistics Circle, 30(1), 42-52.
[22] Yang, J., Qian, J., Chen, X., Kuehnel, V., Rehmann, J., von Buol, A., ... & Xu, L. (2018). Effects of nonlinear frequency compression on the acoustic properties and recognition of speech sounds in Mandarin Chinese. The Journal of the Acoustical Society of America, 143(3), 1578-1590.
[23] Tong, X., Shen, W., Li, Z., Xu, M., Pan, L., & Tong, S. X. (2020). Phonological, not semantic, activation dominates Chinese character recognition: Evidence from a visual world eye-tracking study. Quarterly Journal of Experimental Psychology, 73(4), 617-628.
[24] Guan, C. Q., & Fraundorf, S. H. (2020). Cross-linguistic word recognition development among Chinese children: A multilevel linear mixed-effects modelling approach. Frontiers in psychology, 11.
[25] Xu, C., & Xiao, X. (2019, May). A Novel Information Integration Algorithm for Speech Recognition System: Basing on Adaptive Clustering and Supervised State of Acoustic Feature. In Journal of Physics: Conference Series (Vol. 1229, No. 1, p. 012073). IOP Publishing.