USING DEEP LEARNING TO IMPROVE SPEAKING SKILLS: A BIBLIOMETRIC REVIEW OF ACADEMIC TRENDS AND RESEARCH DIRECTIONS (2010–2024)

Main Article Content

Abdurakhimova Janar Seyda-Axmetovna

Abstract

This bibliometric review examines the incorporation of deep learning methodologies in enhancing speaking skills within language instruction. This study evaluates 437 peer-reviewed articles published between 2010 and 2024, utilising data from Scopus and Web of Science. The review used VOSviewer and Bibliometrix (R) to analyse publishing trends, identify leading authors and journals, and delineate significant research clusters and citation effects. The results indicate a notable increase in research activity from 2018 onwards, with substantial contributions from China, the United States, and Europe. Despite the increasing importance of studies utilising convolutional neural networks (CNNs), recurrent neural networks (RNNs), and transformers, deficiencies persist in pedagogical alignment, learner effect, and real-time feedback mechanisms. This review provides a framework for future research at the convergence of AI and language acquisition, emphasising customised and scalable speaking skill development.

Article Details

Section
Articles

References

Amodei, D., et al. (2016). Deep Speech 2. ICML.

Bengio, Y. (2009). Deep architectures for AI. Foundations and Trends in Machine Learning.

Li, X., Wang, E., & Liu, H. (2018). Deep pronunciation modeling for ESL learners. Computer Assisted Language Learning, 31(4), 421–436.

Park, J., Kim, H., & Lee, S. (2019). Intelligent AI tutor for ESL speaking practice. IEEE Transactions on Learning Technologies, 12(2), 122–135.

Povey, D., et al. (2011). Kaldi toolkit. IEEE ASRU.

Wang, L., et al. (2018). Automated scoring for spontaneous speech. Speech Communication.

Xiong, W., et al. (2019). Microsoft 2019 speech recognition system. ICASSP.

Zhang, X., et al. (2021). Speech recognition in AI-enhanced education: A deep learning approach. Language Learning & Technology, 25(1), 14–33.

Zhao, Y., & Wang, M. (2020). Emotion-aware spoken language training using deep learning. Speech Communication, 112, 52–60.