Follow
Zhengxuan Wu
Zhengxuan Wu
Verified email at stanford.edu - Homepage
Title
Cited by
Cited by
Year
Dynabench: Rethinking Benchmarking in NLP
D Kiela, M Bartolo, Y Nie, D Kaushik, A Geiger, Z Wu, B Vidgen, G Prasad, ...
NAACL 2021, 2021
3792021
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
Z Zhong, Z Wu, CD Manning, C Potts, D Chen
EMNLP 2023, 2023
1212023
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis
Z Wu, DC Ong
AAAI 2021, 2020
932020
DynaSent: A Dynamic Benchmark for Sentiment Analysis
C Potts, Z Wu, A Geiger, D Kiela
ACL 2021, 2020
802020
Modeling emotion in complex stories: the Stanford Emotional Narratives Dataset
D Ong, Z Wu, ZX Tan, M Reddan, I Kahhale, A Mattek, J Zaki
IEEE Transactions on Affective Computing 2019, 2019
762019
Inducing causal structure for interpretable neural networks
A Geiger, Z Wu, H Lu, J Rozner, E Kreiss, T Icard, ND Goodman, C Potts
ICML 2022, 2021
732021
Interpretability at scale: Identifying causal mechanisms in alpaca
Z Wu, A Geiger, C Potts, ND Goodman
NeurIPS 2023, 2023
722023
Rotating online behavior change interventions increases effectiveness but also increases attrition
G Kovacs, Z Wu, MS Bernstein
CSCW 2018, 2018
692018
Finding alignments between interpretable causal variables and distributed neural representations
A Geiger, Z Wu, C Potts, T Icard, N Goodman
Causal Learning and Reasoning, 160-187, 2024
682024
Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability
A Geiger, D Ibeling, A Zur, M Chaudhary, S Chauhan, J Huang, A Arora, ...
arXiv preprint arXiv:2301.04709, 2024
52*2024
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior
ED Abraham, K D'Oosterlinck, A Feder, YO Gat, A Geiger, C Potts, ...
NeurIPS 2022, 2022
422022
Mapping the increasing use of llms in scientific papers
W Liang, Y Zhang, Z Wu, H Lepp, W Ji, X Zhao, H Cao, S Liu, S He, ...
CoLM 2024, 2024
412024
Causal Proxy Models for Concept-Based Model Explanations
Z Wu, K D'Oosterlinck, A Geiger, A Zur, C Potts
ICML 2023, 2022
322022
Conservation of Procrastination: Do Productivity Interventions Save Time or Just Redistribute It?
G Kovacs, DM Gregory, Z Ma, Z Wu, G Emami, J Ray, MS Bernstein
CHI 2019, 2019
322019
On explaining your explanations of bert: An empirical study with sequence classification
Z Wu, DC Ong
arXiv preprint arXiv:2101.00196, 2021
302021
Rigorously Assessing Natural Language Explanations of Neurons
J Huang, A Geiger, K D'Oosterlinck, Z Wu, C Potts
EMNLP 2023 @BlackboxNLP, 2023
262023
Zeroc: A neuro-symbolic model for zero-shot concept recognition and acquisition at inference time
T Wu, M Tjandrasuwita, Z Wu, X Yang, K Liu, R Sosič, J Leskovec
NeurIPS 2022, 2022
262022
ReFT: Representation finetuning for language models
Z Wu, A Arora, Z Wang, A Geiger, D Jurafsky, CD Manning, C Potts
NeurIPS 2024 spotlight, 2024
252024
Causal Distillation for Language Models
Z Wu, A Geiger, J Rozner, E Kreiss, H Lu, T Icard, C Potts, ND Goodman
NAACL 2022, 2021
222021
Attending to emotional narratives
Z Wu, X Zhang, T Zhi-Xuan, J Zaki, DC Ong
IEEE ACII 2019, 2019
222019
The system can't perform the operation now. Try again later.
Articles 1–20