Seguir
Wencong Xiao
Wencong Xiao
Alibaba Group
Email confirmado em alibaba-inc.com - Página inicial
Título
Citado por
Citado por
Ano
Gandiva: Introspective cluster scheduling for deep learning
W Xiao, R Bhardwaj, R Ramjee, M Sivathanu, N Kwatra, Z Han, P Patel, ...
13th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2018
4882018
Analysis of {Large-Scale}{Multi-Tenant}{GPU} clusters for {DNN} training workloads
M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang
2019 USENIX Annual Technical Conference (USENIX ATC 19), 947-960, 2019
3242019
Kv-direct: High-performance in-memory key-value store with programmable nic
B Li, Z Ruan, W Xiao, Y Lu, Y Xiong, A Putnam, E Chen, L Zhang
Proceedings of the 26th Symposium on Operating Systems Principles, 137-152, 2017
2652017
Efficient and effective sparse LSTM on FPGA with bank-balanced sparsity
S Cao, C Zhang, Z Yao, W Xiao, L Nie, D Zhan, Y Liu, M Wu, L Zhang
Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019
1892019
GraM: scaling graph computation to the trillions
M Wu, F Yang, J Xue, W Xiao, Y Miao, L Wei, H Lin, Y Dai, L Zhou
Proceedings of the Sixth ACM Symposium on Cloud Computing, 408-421, 2015
1622015
{MLaaS} in the wild: Workload analysis and scheduling in {Large-Scale} heterogeneous {GPU} clusters
Q Weng, W Xiao, Y Yu, W Wang, C Wang, J He, Y Li, L Zhang, W Lin, ...
19th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2022
1612022
{AntMan}: Dynamic scaling on {GPU} clusters for deep learning
W Xiao, S Ren, Y Li, Y Zhang, P Hou, Z Li, Y Feng, W Lin, Y Jia
14th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2020
1502020
Balanced sparsity for efficient dnn inference on gpu
Z Yao, S Cao, W Xiao, C Zhang, L Nie
Proceedings of the AAAI conference on artificial intelligence 33 (01), 5676-5683, 2019
1182019
An empirical study on program failures of deep learning jobs
R Zhang, W Xiao, H Zhang, Y Liu, H Lin, M Yang
Proceedings of the ACM/IEEE 42nd international conference on software …, 2020
812020
Seernet: Predicting convolutional neural network feature-map sparsity through low-bit quantization
S Cao, L Ma, W Xiao, C Zhang, Y Liu, L Zhang, L Nie, Z Yang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
802019
{Tux²}: Distributed Graph Computation for Machine Learning
W Xiao, J Xue, Y Miao, Z Li, C Chen, M Wu, W Li, L Zhou
14th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2017
752017
Multi-tenant GPU Clusters for Deep Learning Workloads: Analysis and Implications
M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang
MSR-TR-2018-13, 2018
732018
Zico: Efficient {GPU} memory sharing for concurrent {DNN} training
G Lim, J Ahn, W Xiao, Y Kwon, M Jeon
2021 USENIX Annual Technical Conference (USENIX ATC 21), 161-175, 2021
332021
Whale: Efficient giant model training over heterogeneous {GPUs}
X Jia, L Jiang, A Wang, W Xiao, Z Shi, J Zhang, X Li, L Chen, Y Li, ...
2022 USENIX Annual Technical Conference (USENIX ATC 22), 673-688, 2022
292022
Memory efficient loss recovery for hardware-based transport in datacenter
Y Lu, G Chen, Z Ruan, W Xiao, B Li, J Zhang, Y Xiong, P Cheng, E Chen
Proceedings of the First Asia-Pacific Workshop on Networking, 22-28, 2017
252017
Scheduling CPU for GPU-based deep learning jobs
W Xiao, Z Han, H Zhao, X Peng, Q Zhang, F Yang, L Zhou
Proceedings of the ACM Symposium on Cloud Computing, 503-503, 2018
112018
BeamRaster: a practical fast massive MU-MIMO system with pre-computed precoders
M Meng, W Xiao, T He, Y Tao, K Tan, J Zhang, W Wang
IEEE Transactions on Mobile Computing 18 (5), 1014-1027, 2018
112018
Distributed graph computation meets machine learning
W Xiao, J Xue, Y Miao, Z Li, C Chen, M Wu, W Li, L Zhou
IEEE Transactions on Parallel and Distributed Systems 31 (7), 1588-1604, 2020
92020
Cognn: efficient scheduling for concurrent gnn training on gpus
Q Sun, Y Liu, H Yang, R Zhang, M Dun, M Li, X Liu, W Xiao, Y Li, Z Luan, ...
SC22: International Conference for High Performance Computing, Networking …, 2022
82022
Easyscale: Accuracy-consistent elastic training for deep learning
M Li, W Xiao, B Sun, H Zhao, H Yang, S Ren, Z Luan, X Jia, Y Liu, Y Li, ...
arXiv preprint arXiv:2208.14228, 2022
42022
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20