Arnab Sen Sharma

PhD Student

Arnab Sen Sharma

Research interests

  • Interpretable AI 

Education

  • BS in Computer Science and Engineering, Shahjalal University of Science and Engineering 

Biography

Arnab Sen Sharma is a PhD student at the Khoury College of Computer Sciences at Northeastern University. He earned his bachelor’s in computer science and engineering from the Shahjalal University of Science and Engineering. He is affiliated with the Interpretable Neural Networks Lab (BauLab).

Sharma’s research area is machine learning, and his research focus is analyzing language model architectures by dissembling them in order to enhance or align these models for practical real-world applications. His faculty advisor is Dr. David Bau. 

Recent publications

  • Language Models use Lookbacks to Track Beliefs

    Citation: Nikhil Prakash, Natalie Shapira, Arnab Sen Sharma, Christoph Riedl, Yonatan Belinkov, Tamar Rott Shaham, David Bau, Atticus Geiger. (2025). Language Models use Lookbacks to Track Beliefs CoRR, abs/2505.14685. https://doi.org/10.48550/arXiv.2505.14685
  • NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals

    Citation: Jaden Fried Fiotto-Kaufman, Alexander Russell Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla E. Brodley, Arjun Guha, Jonathan Bell , Byron C. Wallace, David Bau. (2025). NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals ICLR. https://openreview.net/forum?id=MxbEiFRf39
  • Linearity of Relation Decoding in Transformer Language Models

    Citation: Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau. (2024). Linearity of Relation Decoding in Transformer Language Models ICLR. https://openreview.net/forum?id=w7LU2s14kE
  • Function Vectors in Large Language Models

    Citation: Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau. (2024). Function Vectors in Large Language Models ICLR. https://openreview.net/forum?id=AwyxtyMwaG

Related news