Blackboxnlp2024
Excited to announce that our work on detoxifying LM outputs, Mind Your Manners: Detoxifying Language Models via Attention Head Intervention, was accepted to BlackboxNLP 2024 as an extended abstract.
Excited to announce that our work on detoxifying LM outputs, Mind Your Manners: Detoxifying Language Models via Attention Head Intervention, was accepted to BlackboxNLP 2024 as an extended abstract.