Speech recognition system for Tamil language using Cmusphinx

dc.contributor.authorFANOON, A. R. F. S.
dc.contributor.authorUWANTHIKA, G. A. I.
dc.date.accessioned2019-12-14T09:05:34Z
dc.date.available2019-12-14T09:05:34Z
dc.date.issued2019-11-27
dc.description.abstractSpeech recognition is the process of converting an audio signal into sequences of words paving the way for enabling a rich human-computer interaction in many emerging applications. It is a very difficult and a complex task to recognize speech using a computer though several approaches have been made to develop an accurate speech recognition system. Most of the researches have been done for English, Chinese, mandarin and Arabic languages while a very few have been done for Tamil language. Therefore, this research intends to develop a continuous speech recognition system for Tamil language. Tamil language has 12 vowels and 18 consonants. This paper is focused on building a Tamil speech recognition system using Cmusphinx toolkit. The system was developed for several desktop applications with the aim that in the future, this system could be integrated with those applications and executes those applications through Tamil speech input rather than typing and Clickin. Audio recordings from several people were recorded in different environment which were later converted to wav file format. A text corpus, transcription file and fileids file for both training and testing database were prepared accordingly. Three models namely acoustic model, language model and lexicon model have been developed. Hidden Markov model was employed for building acoustic model. The performance of the system over various speaker subsets of different sex, age and dialect was examined. The word error rate was calculated in order to measure the performance of the system while the accuracy of the system was calculated using another formula. The results from both the measurements showed a satisfactory performance.en_US
dc.identifier.citation9th International Symposium 2019 on “Promoting Multidisciplinary Academic Research and Innovation”. 27th - 28th November 2019. South Eastern University of Sri Lanka, University Park, Oluvil, Sri Lanka. pp. 666-674.en_US
dc.identifier.isbn978-955-627-189-8
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/4094
dc.language.isoen_USen_US
dc.publisherSouth Eastern University of Sri Lanka, University Park, Oluvil, Sri Lanka.en_US
dc.subjectHuman-computer interactionen_US
dc.subjectHidden Markov Modelen_US
dc.subjectAcoustic modelen_US
dc.subjectWord error rateen_US
dc.titleSpeech recognition system for Tamil language using Cmusphinxen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Final Proceedings - Page 685-693.pdf
Size:
691.9 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: