Marzyeh Ghassemi
Research
Papers
People
Teaching
ML+Health Seminar 2023
Yuxin Xiao
Latest
When Style Breaks Safety: Defending LLMs Against Superficial Style Alignment
KScope: A Framework for Characterizing the Knowledge Status of Language Models
Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
In the Name of Fairness: Assessing the Bias in Clinical Record De-identification
Cite
×