Empirical investigation of neural network acoustic models for speech recognition.
Evaluation of DNNs up to ten times larger than those used in previous works.
Comparison of densely connected, convolutional, and locally-connected untied neural networks.
Results on Switchboard and a combined 2100 hour corpus.
Explanation of combined corpus baseline system training recipe which is now a part of the Kaldi speech toolkit.