Search

Marzyeh Ghassemi
Research
Papers
People
Teaching
ML+Health Seminar 2023

Yuxin Xiao

Latest

When Style Breaks Safety: Defending LLMs Against Superficial Style Alignment
KScope: A Framework for Characterizing the Knowledge Status of Language Models
Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
In the Name of Fairness: Assessing the Bias in Clinical Record De-identification

Cite