Design issues in developing speech corpus for Indian languages — A survey
Citations Over TimeTop 21% of 2012 papers
Abstract
Any spoken language system, it may either be a speech synthesis or a speech recognition system, starts with building a speech corpora. We give a detailed survey of issues in building a speech corpus for Indian languages. To begin with, an appropriate text file should be selected for building the speech corpus. Then a corresponding speech file is generated and stored. This speech file is the phonetic representation of the selected text file. The speech file is processed in different levels viz., paragraphs, sentences, phrases, words, syllables and phones. These are called the speech units of the file. Researches have been done taking these units as the basic unit for processing. This paper analyses the researches done using phones, diphones, triphones, syllables and polysyllables as their basic unit for speech synthesis. Concatenative speech synthesis involves the concatenation of these basic units to synthesize a natural sounding speech. The speech units are added with some more relevnt information about each unit, manually or automatically, based on an algorithm. The database consisting of the units along with their associated information is called as the speech corpus. Techniques that are used in the database to improve the intelligibility of the synthesized speech in Speech synthesis system are also surveyed.
Related Papers
- → Design issues in developing speech corpus for Indian languages — A survey(2012)9 cited
- Annotating Speech Corpus for Prosody Modeling in Indian Language Text to Speech Systems(2012)
- Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition(2003)
- → Validation of Speech Data for Training Automatic Speech Recognition Systems(2022)2 cited
- Process Analysis of Text-to-speech System Based on Speech Database(2010)