Arnab Sen Sharma

Interpretable AI

BS in Computer Science and Engineering, Shahjalal University of Science and Engineering

Arnab Sen Sharma is a PhD student at the Khoury College of Computer Sciences at Northeastern University. He earned his bachelor’s in computer science and engineering from the Shahjalal University of Science and Engineering. He is affiliated with the Interpretable Neural Networks Lab (BauLab).

Sharma’s research area is machine learning, and his research focus is analyzing language model architectures by dissembling them in order to enhance or align these models for practical real-world applications. His faculty advisor is Dr. David Bau.

Published: May 20, 2025
Language Models use Lookbacks to Track Beliefs

Citation: Nikhil Prakash, Natalie Shapira, Arnab Sen Sharma, Christoph Riedl, Yonatan Belinkov, Tamar Rott Shaham, David Bau, Atticus Geiger. (2025). Language Models use Lookbacks to Track Beliefs CoRR, abs/2505.14685. https://doi.org/10.48550/arXiv.2505.14685
Published: January 22, 2025
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals

Citation: Jaden Fried Fiotto-Kaufman, Alexander Russell Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla E. Brodley, Arjun Guha, Jonathan Bell , Byron C. Wallace, David Bau. (2025). NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals ICLR. https://openreview.net/forum?id=MxbEiFRf39
Published: January 16, 2024
Linearity of Relation Decoding in Transformer Language Models

Citation: Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau. (2024). Linearity of Relation Decoding in Transformer Language Models ICLR. https://openreview.net/forum?id=w7LU2s14kE
Published: January 16, 2024
Function Vectors in Large Language Models

Citation: Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau. (2024). Function Vectors in Large Language Models ICLR. https://openreview.net/forum?id=AwyxtyMwaG

Dean’s Welcome To Our Community

Experiential Learning

Global Campus Experience

Redesigned introductory computing courses

The partnership that fuels America

Hiring a co-op student: What to know

Careers at Khoury College

Language Models use Lookbacks to Track Beliefs

NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals

Linearity of Relation Decoding in Transformer Language Models

Function Vectors in Large Language Models

Related news

MIT News: Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

Arnab Sen Sharma

Research interests

Education

Biography

Recent publications

Related news