Time Delay Neural Network for Myanmar Automatic Speech Recognition

Aung, Myat Aye Aye; Pa, Win Pa

UCSYRR Home
/
Conferences
/
International Conference on Computer Applications (ICCA)
/
Eighteenth International Conference On Computer Applications (ICCA 2020)
/
View Item

dc.contributor.author	Aung, Myat Aye Aye
dc.contributor.author	Pa, Win Pa
dc.date.accessioned	2021-01-31T13:57:38Z
dc.date.available	2021-01-31T13:57:38Z
dc.date.issued	2020-02-28
dc.identifier.uri	https://onlineresource.ucsy.edu.mm/handle/123456789/2580
dc.description.abstract	Time Delay Neural Network (TDNN) contains in neural network architectures. In Automatic Speech Recognition, TDNN is strong possibility in context modeling and recognizes phonemes and acoustic features, independent of position in time. There are many techniques have been applied for improving Myanmar speech processing. TDNN based acoustic model for Myanmar ASR in this paper. Myanmar language is a low resource language and no precollected data is available. A larger dataset and lexicon than our previous work are applied in this experiment. The speech corpus contains three domains: Names, Web News data and Daily conversational data. The size of the corpus is 77 Hrs and 2 Mins and 11 Secs and include 233 female speakers and 97 male speakers. The performance of TDNN for Myanmar ASR is shown by comparing with Gaussian Mixture Model (GMM) as a baseline system, Deep Neural Network (DNN) and Convolutional Neural Network (CNN). Experiments evaluation is used 2 test data: TestSet1, web news and TestSet2, recorded conversational data. The experimental results show that TDNN outperforms GMM-HMM, DNN and CNN.	en_US
dc.language.iso	en	en_US
dc.publisher	Proceedings of the Eighteenth International Conference On Computer Applications (ICCA 2020)	en_US
dc.subject	GMM-HMM	en_US
dc.subject	DNN	en_US
dc.subject	CNN	en_US
dc.subject	TDNN	en_US
dc.subject	acoustic modelling	en_US
dc.title	Time Delay Neural Network for Myanmar Automatic Speech Recognition	en_US
dc.type	Article	en_US