Follow
Kaisheng Yao
Kaisheng Yao
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
14902023
Recent advances in deep learning for speech research at Microsoft
L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ...
2013 IEEE international conference on acoustics, speech and signal …, 2013
10472013
Using recurrent neural networks for slot filling in spoken language understanding
G Mesnil, Y Dauphin, K Yao, Y Bengio, L Deng, D Hakkani-Tur, X He, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (3), 530-539, 2014
7682014
CNTK: Microsoft's open-source deep-learning toolkit
F Seide, A Agarwal
Proceedings of the 22nd ACM SIGKDD international conference on knowledge …, 2016
6432016
KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
D Yu, K Yao, H Su, G Li, F Seide
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
5242013
An introduction to computational networks and the computational network toolkit
D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ...
Microsoft Technical Report MSR-TR-2014–112, 2014
4752014
Recurrent neural networks for language understanding.
K Yao, G Zweig, MY Hwang, Y Shi, D Yu
Interspeech, 2524-2528, 2013
4202013
Spoken language understanding using long short-term memory neural networks
K Yao, B Peng, Y Zhang, D Yu, G Zweig, Y Shi
2014 IEEE spoken language technology workshop (SLT), 189-194, 2014
4062014
Highway long short-term memory rnns for distant speech recognition
Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass
2016 IEEE international conference on acoustics, speech and signal …, 2016
3642016
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
3582024
Assignment of semantic labels to a sequence of words using neural network architectures
A Deoras, K Yao, X He, L Deng, GG Zweig, R Sarikaya, D Yu, MY Hwang, ...
US Patent 10,867,597, 2020
2942020
Adaptation of context-dependent deep neural networks for automatic speech recognition
K Yao, D Yu, F Seide, H Su, L Deng, Y Gong
2012 IEEE Spoken Language Technology Workshop (SLT), 366-369, 2012
2572012
Incorporating structural alignment biases into an attentional neural translation model
T Cohn, CDV Hoang, E Vymolova, K Yao, C Dyer, G Haffari
arXiv preprint arXiv:1601.01085, 2016
1992016
Sequence-to-sequence neural net models for grapheme-to-phoneme conversion
K Yao, G Zweig
arXiv preprint arXiv:1506.00196, 2015
1992015
System and method for text-to-phoneme mapping with prior knowledge
K Yao
US Patent App. 11/278,497, 2007
1712007
Recurrent conditional random field for language understanding
K Yao, B Peng, G Zweig, D Yu, X Li, F Gao
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
1652014
Hyper-structure recurrent neural networks for text-to-speech
P Zhao, M Leung, K Yao, B Yan, S Zhao, FA Alleva
US Patent 10,127,901, 2018
1482018
Attention with intention for a neural network conversation model
K Yao, G Zweig, B Peng
arXiv preprint arXiv:1510.08565, 2015
1452015
Depth-gated LSTM
K Yao, T Cohn, K Vylomova, K Duh, C Dyer
arXiv preprint arXiv:1508.03790, 2015
1272015
Depth-gated recurrent neural networks
K Yao, T Cohn, K Vylomova, K Duh, C Dyer
arXiv preprint arXiv:1508.03790 9, 98, 2015
1132015
The system can't perform the operation now. Try again later.
Articles 1–20