人工智能求问 各位有推荐的有关语音识别的具体研究方向吗 有推荐的相关论文吗
声学建模:包括语音信号的前端处理、特征提取、声学模型的设计和训练等。
语言建模:包括语言模型的设计和训练、语音识别中的上下文信息处理等。
模型融合:包括声学模型和语言模型的融合、多模态信息融合等。
端到端模型:包括基于神经网络的端到端语音识别模型的设计和优化等。
关于论文推荐,以下是一些经典的语音识别论文:
Hinton, G. E., Deng, L., Yu, D., Dahl, G. E., Mohamed, A. R., Jaitly, N., ... & Kingsbury, B. (2012). Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Processing Magazine, 29(6), 82-97.
Graves, A., Mohamed, A. R., & Hinton, G. (2013). Speech recognition with deep recurrent neural networks. In IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 6645-6649).
Amodei, D., Ananthanarayanan, S., Anubhai, R., Bai, J., Battenberg, E., Case, C., ... & Ranzato, M. (2016). Deep speech 2: End-to-end speech recognition in English and Mandarin. In International Conference on Machine Learning (pp. 173-182).
Chorowski, J. K., Bahdanau, D., Serdyuk, D., Cho, K., & Bengio, Y. (2015). Attention-based models for speech recognition. In Advances in Neural Information Processing Systems (pp. 577-585).
Kim, Y., Jernite, Y., Sontag, D., & Rush, A. M. (2016). Character-aware neural language models. In Thirtieth AAAI Conference on Artificial Intelligence.
希望对您有帮助。
不知道你这个问题是否已经解决, 如果还没有解决的话: