Avery Yen

Public Blog/Website

View My GitHub Profile

Avery Yen

Hi, I’m Avery. I’m a Research Assistant to Professor David Bau at the National Deep Inference Fabric, where I study interpretability of large language models.

Before that, I spent over 12 years as a software engineer, with my longest tenure at Pivotal Labs.

Research Interests

AI Systems Responsibility & Safety

I believe that understanding and elucidating how AI systems learn and think about the world and make decisions is essential for building technology we can trust and depend on.

I am currently working in how adversarial behaviors in LLM pre- and post-training can be understood, detected, and mitigated.

Selected Projects

Contact

Feel free to find me on LinkedIn.