Enhancement of Learning using Speech Recognition and Lecture Transcription: A Survey

Call for Paper

March Edition

IJCA solicits high quality original research papers for the upcoming March edition of the journal. The last date of research paper submission is 20 February 2026

Submit your paper

Know more

The week's pick

A Knowledge-Graph–Driven Multimodal Large Model for Semantic Understanding and Controllable Generation of Intangible Cultural Heritage

Jundi Yang Heng Yao

Random Articles

Reseach Article

Enhancement of Learning using Speech Recognition and Lecture Transcription: A Survey

Published on July 2014 by Ashwini B V, Laxmi B Rananavare

International Conference on Information and Communication Technologies

Foundation of Computer Science USA

ICICT - Number 6

July 2014

Authors: Ashwini B V, Laxmi B Rananavare

Ashwini B V, Laxmi B Rananavare . Enhancement of Learning using Speech Recognition and Lecture Transcription: A Survey. International Conference on Information and Communication Technologies. ICICT, 6 (July 2014), 6-11.

@article{

author = { Ashwini B V, Laxmi B Rananavare },

title = { Enhancement of Learning using Speech Recognition and Lecture Transcription: A Survey },

journal = { International Conference on Information and Communication Technologies },

issue_date = { July 2014 },

volume = { ICICT },

number = { 6 },

month = { July },

year = { 2014 },

issn = 0975-8887,

pages = { 6-11 },

numpages = 6,

url = { /proceedings/icict/number6/18004-1461/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Proceeding Article

%1 International Conference on Information and Communication Technologies

%A Ashwini B V

%A Laxmi B Rananavare

%T Enhancement of Learning using Speech Recognition and Lecture Transcription: A Survey

%J International Conference on Information and Communication Technologies

%@ 0975-8887

%V ICICT

%N 6

%P 6-11

%D 2014

%I International Journal of Computer Applications

Abstract

Speech recognition (SR) technologies were evaluated in different classroom environments to assist students to automatically convert oral lectures into text. Two distinct methods of SR-mediated lecture acquisition (SR-mLA), real-time captioning (RTC) and post-lecture transcription (PLT), has been developed to increase the word recognition accuracy. Both methods has been compared according to technical feasibility and reliability of classroom implementation, instructors' experiences, word recognition accuracy, and student class performance. RTC provided near-instantaneous display of the instructor's speech for students during class. PLT employed a user-independent SR algorithm to optimally generate multimedia class notes with synchronized lecture transcripts and instructor audio for students to access online after class. It has been learnt that PLT provides more word recognition accuracy than RTC. The potential benefits of SR-mLA for students who have difficulty taking notes accurately and independently were discussed, particularly for non-native English speakers and students with disabilities.

References

K. Bain, S. Basson, and M. Wald, "Speech recognition in university classrooms: liberated learning project," Proc. The Fifth International ACM SIGCAPH Conference on Assistive Technologies (ASSETS), pp. 192-196, 2002.
M. Wald, K. Bain, "Universal access to communication and learning: the role of automatic speech recognition," Universal Access in the Information Society, vol. 6, no. 4, pp. 435-447, 2008.
K. Bain, S. Basson, A. Faisman, D. Kanevsky, "Accessibility, transcription, and access everywhere," IBM Systems Journal, vol. 44, no. 3, pp. 589-604, 2005.
D. Leitch, and T. MacMillan, "Liberated Learning Initiative Innovative Technology and Inclusion: Current Issues and Future Directions for Liberated Learning Research," Year III Report, Saint Mary's University, Nova Scotia, Canada, 2003.
K. Ryba, T. Mcivor, M. Shakir, and D. Paez, "Liberated Learning: Analysis of University Students' Perceptions and Experiences with Continuous Automated Speech Recognition," EJournal of Instructional Science and Technology (e-JIST), vol. 9, no. 1, Mar. 2006.
D. Leitch, "GIFT Atlantic Liberated Learning High School Pilot Project: A Study of the Transfer of Speech Recognition Technology from University Classrooms to High School Classrooms," Phase III Report, Saint Mary's University, Nova Scotia, Canada, 2008.
S. Repp, A. Grob, and C. Meinel, "Browsing within Lecture Videos Based on the Chain Index of Speech Transcription," IEEE Transactions on Learning Technologies, pp. 145-156, 2008.
W. Hur? st, T. Kreuzer, and M. Wiesenhut? ter, "A Qualitative Study towards Using Large Vocabulary Automatic Speech Recognition to Index Recorded Presentations for Search and Access over the Web," Proc. IADIS Int'l Conf. WWW/Internet (ICWI '02), pp. 135-143, 2002.
M. Wald, G. Wills, D. Millard, L. Gilbert, S. Khoja, J. Kajaba, and Y. Li, "Synchronised Annotation of Multimedia," Proc. 9th IEEE International Conference on Advanced Learning Technologies, pp. 594-596, Jul. 2009.
M. Wald, "Synote: Accessible and Assistive Technology Enhancing Learning for All Students," Proc. ICCHP 2010 Part II LNCS 6180, pp. 177-184, 2010.
M. Wald, "Captioning for Deaf and Hard of Hearing People by Editing Automatic Speech Recognition in Real Time," Proc. Tenth International Conference on Computers Helping People with Special Needs ICCHP 2006, LNCS 4061, pp. 683-690, 2006.
M. S. Stinson, S. Eisenberg, C. Horn, J. Larson, H. Levitt and R. Stuckless, "Real-time speech-to-text services," Reports of the National Task Force on Quality Services in Postsecondary Education of Deaf and Hard of Hearing Students, Rochester, NY, 1999.
S. Watson, and L. Johnston, "Assistive Technology in the Inclusive Science Classroom," Journal of Science Teacher, vol. 74, no. 3, pp. 34-38, 2007.
M. Wald, and K. Bain, "Enhancing the Usability of Real-Time Speech Recognition Captioning Through Personalised Displays and Real-Time Multiple Speaker Editing and Annotation," Proc. HCI International Conference, vol. 7, pp. 446-452, Jul. 2007.
B. Arons, "SpeechSkimmer: a system for interactively skimming recorded speech," ACM Transactions on Computer- Human Interaction (TOCHI), 4(1), 3-38, 1997.
B. Duerstock, R. Ranchal, Y. Guo, T. Doughty, J. Robinson, and K. Bain, "Assistive Notetaking Using Speech Recognition Technology," Proc. Festival of International Conferences on Caregiving, Disability, Aging and Technology (FICCDAT): RESNA/ICTA3, Toronto, Canada, 2011.
S. Peverly, V. Ramaswamy, C. Brown, J. Sumowski, M. Alidoost, J. Garner, "What Predicts Skill in Lecture Note Taking?" Journal of Educational Psychology, vol. 99, no. 1, pp. 167-180, 2007.
B. Titsworth, and K. Kiewra, "Spoken Organizational Lecture Cues and Student Notetaking as Facilitators of Student Learning," Journal of Contemporary Educational Psychology, vol. 29, no. 4, pp. 447-461, Oct. 2004.
N. H. Van Matre, J. Carter, "The Effects of Note-Taking and Review on Retention of Information," American Educational Research Association, Washington, D. C. , 1975.
H. Lyles, B. Robertson, M. Mangino and J. R. Cox, "Audio Podcasting in a Tablet PC-Enhanced Biochemistry Course," Biochemistry and Molecular Biology Education, vol. 35(6), pp. 456-461, 2007.
IBM®ViaScribe,http://www03. ibm. com/able/accessibility_services/ViaScribeaccessible. pdf, 2011.
Liberated Learning Hosted Transcription Service (HTS), http://www. transcribeyourclass. ca/hts. html, 2012.
J. Bell, "Enhancing accessibility through correction of speech recognition errors," SIGACCESS Newsletter, Issue 89, Sep. 2007.
Self-Regulated Learning and Academic Achievements: An Overview, http://www. unco. edu/cebs/psychology/kevinpugh/motivation_project/resources/zimmerman90. pdf
D. Hayden, D. Colbry, J. A. Black Jr. and S. Panchanathan, "Note-Taker: Enabling students who are legally blind to take notes in class," 10th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2008), Halifax, Nova Scotia, Canada, pp. 81-88, 2008.
I. Weiss, "Report of the 2000 National Survey of Science and Mathematics Education," Technical Report, Chapel Hill, NC: Horizon Research, 2001.
Rohit Ranchal, Teresa Taber-Doughty, Yiren Guo, Keith Bain, Heather Martin, J. Paul Robinson, and Bradley S. Duerstock,"Using Speech Recognition for Real-Time Captioning and Lecture Transcription in the Classroom", IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES,2013

Index Terms

Computer Science

Information Sciences

Keywords

Educational Technology Electronic Learning Multimedia Systems Notetaking Speech Recognition.