Arnab Sen Sharma
PhD Student

Research interests
- Interpretable AI
Education
- BS in Computer Science and Engineering, Shahjalal University of Science and Engineering
Biography
Arnab Sen Sharma is a PhD student at the Khoury College of Computer Sciences at Northeastern University. He earned his bachelor’s in computer science and engineering from the Shahjalal University of Science and Engineering. He is affiliated with the Interpretable Neural Networks Lab (BauLab).
Sharma’s research area is machine learning, and his research focus is analyzing language model architectures by dissembling them in order to enhance or align these models for practical real-world applications. His faculty advisor is Dr. David Bau.
Recent publications
-
Language Models use Lookbacks to Track Beliefs
Citation: Nikhil Prakash, Natalie Shapira, Arnab Sen Sharma, Christoph Riedl, Yonatan Belinkov, Tamar Rott Shaham, David Bau, Atticus Geiger. (2025). Language Models use Lookbacks to Track Beliefs CoRR, abs/2505.14685. https://doi.org/10.48550/arXiv.2505.14685 -
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals
Citation: Jaden Fried Fiotto-Kaufman, Alexander Russell Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla E. Brodley, Arjun Guha, Jonathan Bell , Byron C. Wallace, David Bau. (2025). NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals ICLR. https://openreview.net/forum?id=MxbEiFRf39 -
Linearity of Relation Decoding in Transformer Language Models
Citation: Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau. (2024). Linearity of Relation Decoding in Transformer Language Models ICLR. https://openreview.net/forum?id=w7LU2s14kE -
Function Vectors in Large Language Models
Citation: Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau. (2024). Function Vectors in Large Language Models ICLR. https://openreview.net/forum?id=AwyxtyMwaG