Publications

(2024). Language Model Alignment in Multilingual Trolley Problems. NeurIPS PLuralistic Alignment Workshop 2024 Oral.

Arxiv

(2024). Implicit Personalization in Language Models: A Systematic Study. EMNLP 2024 Findings.

Arxiv

(2024). Synatra: Turning indirect knowledge into direct demonstrations for digital agents at scale. NeurIPS 2024.

Arxiv

(2024). Inducing Elasticity in Foundation Models: Post-Training Techniques for Adaptable Inference. NeurIPS ENLSP Workshop 2024.

(2024). Automatic Generation of Model and Data Cards: A Step Towards Responsible AI. NAACL 2024 Oral.

Arxiv

(2024). Analyzing the Role of Semantic Representations in the Era of Large Language Models. NAACL 2024.

Arxiv

(2023). Can Large Language Models Infer Causation from Correlation?. ICLR 2024.

Arxiv

(2023). Bias Amplification Enhances Minority Group Performance. TMLR 2024.

Arxiv

(2023). Voices of Her: Analyzing Gender Differences in the AI Publication World. Preprint 2023.

Arxiv