Wencong Xiao

Citado por

	Todos	Desde 2019
Citações	2478	2351
Índice h	15	15
Índice i10	19	19

600

300

150

450

2016201720182019202020212022202320249 21 90 184 311 414 515 585 339

Acesso público

Ver tudo

5 artigos

4 artigos

disponível

não disponível

Com base em autorizações de financiamento

Coautores

Fan YangMicrosoft ResearchEmail confirmado em microsoft.com
Hanyu ZhaoAlibaba GroupEmail confirmado em alibaba-inc.com
Lidong ZhouMicrosoft ResearchEmail confirmado em microsoft.com
Wei LinAlibabaEmail confirmado em alibaba-inc.com
Ming WuMicrosoft ResearchEmail confirmado em microsoft.com
Chen ZhangShanghai Jiao Tong UniversityEmail confirmado em sjtu.edu.cn
Jilong XueMicrosoft ResearchEmail confirmado em microsoft.com
Lintao ZhangMicrosoft Research AsiaEmail confirmado em microsoft.com
Shijie CaoMicrosoft Research AsiaEmail confirmado em microsoft.com
Zhenhua HANMicrosoft Research AsiaEmail confirmado em microsoft.com
Haoxiang LinMicrosoft Research AsiaEmail confirmado em microsoft.com
Zain Zhenyuan RuanMIT CSAILEmail confirmado em csail.mit.edu
Xianyan JiaAlibabaEmail confirmado em alibaba-inc.com
Yangqing JiaFounder, Lepton AIEmail confirmado em daggerfs.com
Yuanwei LuAI Stealth Startup
Bojie Li

Seguir

Wencong Xiao

Alibaba Group

Email confirmado em alibaba-inc.com - Página inicial

Distributed system Machine learning system Resource management


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
Gandiva: Introspective cluster scheduling for deep learning W Xiao, R Bhardwaj, R Ramjee, M Sivathanu, N Kwatra, Z Han, P Patel, ... 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2018	514	2018
Analysis of {Large-Scale}{Multi-Tenant}{GPU} clusters for {DNN} training workloads M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang 2019 USENIX Annual Technical Conference (USENIX ATC 19), 947-960, 2019	353	2019
Kv-direct: High-performance in-memory key-value store with programmable nic B Li, Z Ruan, W Xiao, Y Lu, Y Xiong, A Putnam, E Chen, L Zhang Proceedings of the 26th Symposium on Operating Systems Principles, 137-152, 2017	269	2017
{MLaaS} in the wild: Workload analysis and scheduling in {Large-Scale} heterogeneous {GPU} clusters Q Weng, W Xiao, Y Yu, W Wang, C Wang, J He, Y Li, L Zhang, W Lin, ... 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2022	197	2022
Efficient and effective sparse LSTM on FPGA with bank-balanced sparsity S Cao, C Zhang, Z Yao, W Xiao, L Nie, D Zhan, Y Liu, M Wu, L Zhang Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019	196	2019
{AntMan}: Dynamic scaling on {GPU} clusters for deep learning W Xiao, S Ren, Y Li, Y Zhang, P Hou, Z Li, Y Feng, W Lin, Y Jia 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2020	163	2020
GraM: scaling graph computation to the trillions M Wu, F Yang, J Xue, W Xiao, Y Miao, L Wei, H Lin, Y Dai, L Zhou Proceedings of the Sixth ACM Symposium on Cloud Computing, 408-421, 2015	163	2015
Balanced sparsity for efficient dnn inference on gpu Z Yao, S Cao, W Xiao, C Zhang, L Nie Proceedings of the AAAI conference on artificial intelligence 33 (01), 5676-5683, 2019	120	2019
An empirical study on program failures of deep learning jobs R Zhang, W Xiao, H Zhang, Y Liu, H Lin, M Yang Proceedings of the ACM/IEEE 42nd international conference on software …, 2020	88	2020
Seernet: Predicting convolutional neural network feature-map sparsity through low-bit quantization S Cao, L Ma, W Xiao, C Zhang, Y Liu, L Zhang, L Nie, Z Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	83	2019
{Tux²}: Distributed Graph Computation for Machine Learning W Xiao, J Xue, Y Miao, Z Li, C Chen, M Wu, W Li, L Zhou 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2017	79	2017
Multi-tenant GPU Clusters for Deep Learning Workloads: Analysis and Implications M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang MSR-TR-2018-13, 2018	75	2018
Zico: Efficient {GPU} memory sharing for concurrent {DNN} training G Lim, J Ahn, W Xiao, Y Kwon, M Jeon 2021 USENIX Annual Technical Conference (USENIX ATC 21), 161-175, 2021	40	2021
Whale: Efficient giant model training over heterogeneous {GPUs} X Jia, L Jiang, A Wang, W Xiao, Z Shi, J Zhang, X Li, L Chen, Y Li, ... 2022 USENIX Annual Technical Conference (USENIX ATC 22), 673-688, 2022	34	2022
Memory efficient loss recovery for hardware-based transport in datacenter Y Lu, G Chen, Z Ruan, W Xiao, B Li, J Zhang, Y Xiong, P Cheng, E Chen Proceedings of the First Asia-Pacific Workshop on Networking, 22-28, 2017	27	2017
Infinite-llm: Efficient llm service for long context with distattention and distributed kvcache B Lin, T Peng, C Zhang, M Sun, L Li, H Zhao, W Xiao, Q Xu, X Qiu, S Li, ... arXiv preprint arXiv:2401.02669, 2024	12	2024
Scheduling CPU for GPU-based deep learning jobs W Xiao, Z Han, H Zhao, X Peng, Q Zhang, F Yang, L Zhou Proceedings of the ACM Symposium on Cloud Computing, 503-503, 2018	11	2018
BeamRaster: a practical fast massive MU-MIMO system with pre-computed precoders M Meng, W Xiao, T He, Y Tao, K Tan, J Zhang, W Wang IEEE Transactions on Mobile Computing 18 (5), 1014-1027, 2018	11	2018
Cognn: efficient scheduling for concurrent gnn training on gpus Q Sun, Y Liu, H Yang, R Zhang, M Dun, M Li, X Liu, W Xiao, Y Li, Z Luan, ... SC22: International Conference for High Performance Computing, Networking …, 2022	10	2022
Distributed graph computation meets machine learning W Xiao, J Xue, Y Miao, Z Li, C Chen, M Wu, W Li, L Zhou IEEE Transactions on Parallel and Distributed Systems 31 (7), 1588-1604, 2020	9	2020

O sistema não pode efectuar a operação agora. Tente mais tarde.

Artigos 1–20

Citações por ano

Citações duplicadas

Citações unidas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores