Follow
Anyi Rao
Anyi Rao
Assistant Professor, Arts and Machine Creativity, HKUST
Verified email at ust.hk - Homepage
Title
Cited by
Cited by
Year
Adding Conditional Control to Text-to-Image Diffusion Models
L Zhang, A Rao, M Agrawala
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
24262023
HotFlip: White-Box Adversarial Examples for Text Classification
J Ebrahimi, A Rao, D Lowd, D Dou
Proceedings of Annual Meeting of the Association for Computational Linguistics, 2018
11822018
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Y Guo, C Yang, A Rao, Z Liang, Y Wang, Y Qiao, M Agrawala, D Lin, ...
International Conference on Learning Representations, 2024
3432024
MovieNet: A Holistic Dataset for Movie Understanding
Q Huang, Y Xiong, A Rao, J Wang, D Lin
European Conference on Computer Vision, 2020
2292020
BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering
Y Xiangli, L Xu, X Pan, N Zhao, A Rao, C Theobalt, B Dai, D Lin
European Conference on Computer Vision, 2022
1942022
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
A Rao, L Xu, Y Xiong, G Xu, Q Huang, B Zhou, D Lin
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
1402020
A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language
B Su, D Du, Z Yang, Y Zhou, J Li, A Rao, H Sun, Z Lu, JR Wen
arXiv preprint arXiv:2209.05481, 2022
89*2022
A Unified Framework for Shot Type Classification Based on Subject Centric Lens
A Rao, J Wang, L Xu, X Jiang, Q Huang, B Zhou, D Lin
European Conference on Computer Vision, 2020
722020
CityNeRF: Building NeRF at City Scale
Y Xiangli, L Xu, X Pan, N Zhao, A Rao, C Theobalt, B Dai, D Lin
arXiv preprint arXiv:2112.05504, 2021
502021
Online Multi-modal Person Search in Videos
J Xia, A Rao*, Q Huang, L Xu, J Wen, D Lin
European Conference on Computer Vision, 2020
352020
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Y Guo, C Yang, A Rao, M Agrawala, D Lin, B Dai
European Conference on Computer Vision, 2023
342023
ControlNet
L Zhang, A Rao, M Agrawala
30*2023
Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences
Y Zhou, H Duan, A Rao, B Su, J Wang
Proceedings of the AAAI Conference on Artificial Intelligence, 2023
242023
White-Box Adversarial Examples for NLP
J Ebrahimi, A Rao, D Lowd, D Dou
arXiv preprint arXiv:1712.06751, 2017
20*2017
Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production
A Rao, X Jiang, Y Guo, L Xu, L Yang, L Jin, D Lin, B Dai
ACM SIGGRAPH Special Interest Group on Computer Graphics and Interactive …, 2023
142023
BlockPlanner: City Block Generation With Vectorized Graph Representation
L Xu, Y Xiangli, A Rao, N Zhao, B Dai, Z Liu, D Lin
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
142021
Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos
X Jiang, L Jin, A Rao*, L Xu, D Lin
IEEE Transactions on Multimedia, 2021
112021
AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation
X Liu, X Xu, A Rao, C Gan, L Yi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
102022
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
D Shi, C Tao, A Rao, Z Yang, C Yuan, J Wang
International Conference on Machine Learning, 2024
92024
Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization
Y Zhou, W Qiang, A Rao, N Lin, B Su, J Wang
Proceedings of the 31st ACM International Conference on Multimedia, 2023
92023
The system can't perform the operation now. Try again later.
Articles 1–20