📝 Selected Publications
( * indicates equal contribution. Full list of publications)
Preprint.
IEEE SP 2025, DLSP workshop
Position: Adversarial ML Problems Are Getting Harder to Solve and to Evaluate
[IEEE SP 2025, DLSP workshop]
Accepted.
NeruIPS 2025

RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics
[NeruIPS 2025, Dataset $\&$ Benchmark Track]
ICML 2025

SaTML 2025

CCS 2024

ICLR 2025

IEEE SP 2025, DLSP workshop

Blind Baselines Beat Membership Inference Attacks for Foundation Models
[IEEE SP 2025, DLSP workshop]
NeurIPS 2024

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
[NeurIPS 2024 Dataset $\&$ Benchmark Track]