Leveraging Side Information for Transcription of Spoken Tutorials
Published by
IIT Bombay
Summary
Master's thesis focusing on improving Automatic Speech Recognition Systems through the integration of side information and an Attention-Based Encoder-Decoder model. Highlights: - Formulated and designed a system utilizing web text data and lecture slide data to enhance the accuracy of ASR for spoken-tutorial transcription. - Designed an Attention-Based Encoder-Decoder model specifically to address the complexities of code-switching in speech.