publications

2024

  1. Preprint
    Mitigating Memorization In Language Models
    Mansi Sakarvadia, Aswathy Ajith, Arham Khan, and 6 more authors
    2024
  2. Preprint
    SoK: On Finding Common Ground in Loss Landscapes Using Deep Model Merging Techniques
    Arham Khan, Todd Nief, Nathaniel Hudson, and 6 more authors
    2024
  3. M.S. Thesis
    Towards Interpreting Language Models: A Case Study in Multi-Hop Reasoning
    Mansi Sakarvdia
    University of Chicago, 2024

2023

  1. BlackboxNLP
    Memory Injections: Correcting Multi-Hop Reasoning Failures during Inference in Transformer-Based Language Models
    Mansi Sakarvadia, Aswathy Ajith, Arham Khan, and 5 more authors
    2023
    Work accepted to BlackBoxNLP 2023.
  2. ATTRIB
    Attention Lens: A Tool for Mechanistically Interpreting the Attention Head Information Retrieval Mechanism
    Mansi Sakarvadia, Arham Khan, Aswathy Ajith, and 5 more authors
    2023
    Accepted to Workshop on Attributing Model Behavior At Scale (ATTRIB) Workshop @ NeurIPS.
  3. BDCAT
    Trillion Parameter AI Serving Infrastructure for Scientific Discovery: A Survey and Vision
    Nathaniel Hudson, J. Gregory Pauloski, Matt Baughman, and 13 more authors
    In IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT2023), 2023
  4. e-Science
    Lazy Python Dependency Management in Large-Scale Systems
    Alok Kamatar, Mansi Sakarvadia, Valerie Hayot-Sasson, and 2 more authors
    In 2023 IEEE 19th International Conference on e-Science (e-Science), 2023

2020

  1. MICCAI
    Atypical Neonate Extra-axial CSF is Associated with Reduced Cognitive Development at Age 1 year (poster)
    Mansi Sakarvadia, Rui Li, SunHyung Kim, and 6 more authors
    Perinatal Preterm and Pediatric Image Analysis workshop at the Medical Image Computing and Computer Assisted Interventions conference, 2020