Affordable Access

Access to the full text

Real-time transcription, keyword spotting, archival and retrieval for telugu TV news using ASR

Authors
  • Pala, Mythilisharan1
  • Parayitam, Laxminarayana1
  • Appala, Venkataramana1
  • 1 Osmania University, Research and Training Unit for Navigational Electronics, Hyderabad, India , Hyderabad (India)
Type
Published Article
Journal
International Journal of Speech Technology
Publisher
Springer US
Publication Date
May 03, 2019
Volume
22
Issue
2
Pages
433–439
Identifiers
DOI: 10.1007/s10772-019-09598-6
Source
Springer Nature
Keywords
License
Yellow

Abstract

The main objective of this paper is to describe the system developed for transcription, keyword spotting and alerting, archival, and retrieval for broadcasted Telugu TV news. Real-time automatic speech recognition system is developed for searching the given keywords from the transcribed text of the broadcasted Telugu TV news. The user can set the required keywords and the system will continuously track the audio and alert in real time the user as and when the keywords appeared in the audio. The system will transcribe and index every word with time. As and when a user wanted to retrieve the data with a particular word, it will search the transcribed text in the archives and play the corresponding audio, video along with scrolling of transcribed text of audio. The system is developed fully with GUI for giving the keywords either through microphone or keyboard by transliteration in Telugu. The ASR system is developed using the annotated speech corpus of 65 h duration with two hundred thousand unique words. The acoustic models are developed using subspace Gaussian mixture model. The language modeling is developed with Witten Bell and Kneser–Ney smoothing techniques

Report this publication

Statistics

Seen <100 times