Follow
Weizhu Chen
Weizhu Chen
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
On the variance of the adaptive learning rate and beyond
L Liu, H Jiang, P He, W Chen, X Liu, J Gao, J Han
arXiv preprint arXiv:1908.03265, 2019
10602019
Multi-task deep neural networks for natural language understanding
X Liu, P He, W Chen, J Gao
arXiv preprint arXiv:1901.11504, 2019
8842019
Deberta: Decoding-enhanced bert with disentangled attention
P He, X Liu, J Gao, W Chen
arXiv preprint arXiv:2006.03654, 2020
4582020
Reasonet: Learning to stop reading in machine comprehension
Y Shen, PS Huang, J Gao, W Chen
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge …, 2017
2912017
Short text conceptualization using a probabilistic knowledgebase
Y Song, H Wang, Z Wang, H Li, W Chen
Twenty-second international joint conference on artificial intelligence, 2011
2642011
Fusionnet: Fusing via fully-aware attention with application to machine comprehension
HY Huang, C Zhu, Y Shen, W Chen
arXiv preprint arXiv:1711.07341, 2017
1742017
Smart: Robust and efficient fine-tuning for pre-trained natural language models through principled regularized optimization
H Jiang, P He, W Chen, X Liu, J Gao, T Zhao
arXiv preprint arXiv:1911.03437, 2019
1672019
Document transformation for multi-label feature selection in text categorization
W Chen, J Yan, B Zhang, Z Chen, Q Yang
Seventh IEEE International Conference on Data Mining (ICDM 2007), 451-456, 2007
1502007
Improving multi-task deep neural networks via knowledge distillation for natural language understanding
X Liu, P He, W Chen, J Gao
arXiv preprint arXiv:1904.09482, 2019
1292019
A novel click model and its applications to online advertising
ZA Zhu, W Chen, T Minka, C Zhu, Z Chen
Proceedings of the third ACM international conference on Web search and data …, 2010
1202010
User-click modeling for understanding and predicting search-behavior
Y Zhang, W Chen, D Wang, Q Yang
Proceedings of the 17th ACM SIGKDD international conference on Knowledge …, 2011
1152011
Understanding the difficulty of training transformers
L Liu, X Liu, J Gao, W Chen, J Han
arXiv preprint arXiv:2004.08249, 2020
932020
P-packSVM: Parallel primal gradient descent kernel SVM
AZ Zeyuan, C Weizhu, W Gang, Z Chenguang, C Zheng
2009 Ninth IEEE International Conference on Data Mining, 677-686, 2009
932009
Characterizing search intent diversity into click models
B Hu, Y Zhang, W Chen, G Wang, Q Yang
Proceedings of the 20th international conference on World wide web, 17-26, 2011
842011
Personalized click model through collaborative filtering
S Shen, B Hu, W Chen, Q Yang
Proceedings of the fifth ACM international conference on Web search and data …, 2012
812012
Internet visualization system and related user interfaces
M Wang, W Chen, B Zhang, Z Chen, J Wang
US Patent 7,873,904, 2011
762011
Adversarial training for large neural language models
X Liu, H Cheng, P He, W Chen, Y Wang, H Poon, J Gao
arXiv preprint arXiv:2004.08994, 2020
732020
Large-scale L-BFGS using MapReduce
W Chen, Z Wang, J Zhou
Advances in neural information processing systems 27, 2014
692014
Beyond ten blue links: enabling user click modeling in federated web search
D Chen, W Chen, H Wang, Z Chen, Q Yang
Proceedings of the fifth ACM international conference on Web search and data …, 2012
692012
What Makes Good In-Context Examples for GPT-?
J Liu, D Shen, Y Zhang, B Dolan, L Carin, W Chen
arXiv preprint arXiv:2101.06804, 2021
682021
The system can't perform the operation now. Try again later.
Articles 1–20