Mansi Sakarvadia
  • about
  • news
  • publications
  • blog
  • teaching

Blackboxnlp2024

October 3, 2024

2024

Excited to announce that our work on detoxifying LM outputs, Mind Your Manners: Detoxifying Language Models via Attention Head Intervention, was accepted to BlackboxNLP 2024 as an extended abstract.

© Copyright 2025 Mansi Sakarvadia. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Last updated: June 04, 2025.