A Survey of Recent DNN Architectures on the TIMIT Phone Recognition Task
June 19, 2018 ยท The Cartographer ยท ๐ International Conference on Text, Speech and Dialogue
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Recent DNN Architectures on the TIMIT Phone Recognition Task"
Evidence collected by the PWNC Scanner
Authors
Josef Michalek, Jan Vanek
arXiv ID
1806.07974
Category
cs.CL: Computation & Language
Cross-listed
cs.HC
Citations
16
Venue
International Conference on Text, Speech and Dialogue
Last Checked
2 days ago
Abstract
In this survey paper, we have evaluated several recent deep neural network (DNN) architectures on a TIMIT phone recognition task. We chose the TIMIT corpus due to its popularity and broad availability in the community. It also simulates a low-resource scenario that is helpful in minor languages. Also, we prefer the phone recognition task because it is much more sensitive to an acoustic model quality than a large vocabulary continuous speech recognition (LVCSR) task. In recent years, many DNN published papers reported results on TIMIT. However, the reported phone error rates (PERs) were often much higher than a PER of a simple feed-forward (FF) DNN. That was the main motivation of this paper: To provide a baseline DNNs with open-source scripts to easily replicate the baseline results for future papers with lowest possible PERs. According to our knowledge, the best-achieved PER of this survey is better than the best-published PER to date.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐ฎ
๐ฎ
The Ethereal
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age