UCSY's Research Repository

Time Delay Neural Network for Myanmar Automatic Speech Recognition

Show simple item record

dc.contributor.author Aung, Myat Aye Aye
dc.contributor.author Pa, Win Pa
dc.date.accessioned 2021-01-31T13:57:38Z
dc.date.available 2021-01-31T13:57:38Z
dc.date.issued 2020-02-28
dc.identifier.uri https://onlineresource.ucsy.edu.mm/handle/123456789/2580
dc.description.abstract Time Delay Neural Network (TDNN) contains in neural network architectures. In Automatic Speech Recognition, TDNN is strong possibility in context modeling and recognizes phonemes and acoustic features, independent of position in time. There are many techniques have been applied for improving Myanmar speech processing. TDNN based acoustic model for Myanmar ASR in this paper. Myanmar language is a low resource language and no precollected data is available. A larger dataset and lexicon than our previous work are applied in this experiment. The speech corpus contains three domains: Names, Web News data and Daily conversational data. The size of the corpus is 77 Hrs and 2 Mins and 11 Secs and include 233 female speakers and 97 male speakers. The performance of TDNN for Myanmar ASR is shown by comparing with Gaussian Mixture Model (GMM) as a baseline system, Deep Neural Network (DNN) and Convolutional Neural Network (CNN). Experiments evaluation is used 2 test data: TestSet1, web news and TestSet2, recorded conversational data. The experimental results show that TDNN outperforms GMM-HMM, DNN and CNN. en_US
dc.language.iso en en_US
dc.publisher Proceedings of the Eighteenth International Conference On Computer Applications (ICCA 2020) en_US
dc.subject GMM-HMM en_US
dc.subject DNN en_US
dc.subject CNN en_US
dc.subject TDNN en_US
dc.subject acoustic modelling en_US
dc.title Time Delay Neural Network for Myanmar Automatic Speech Recognition en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics