CodeSense: a Real-World Benchmark and Dataset for Code Semantic Reasoning PAPER
Published in ICLR, 2026
We propose CodeSense, the first benchmark with a spectrum of fine-grained code reasoning tasks from real-world software projects, revealing a clear performance gap in state-of-the-art LLMs for code semantic reasoning.
Recommended citation: Monoshi Kumar Roy, Simin Chen, Benjamin Steenhoek, Jinjun Peng, Gail Kaiser, Baishakhi Ray, and Wei Le. 2026. CodeSense: a Real-World Benchmark and Dataset for Code Semantic Reasoning. In The Fourteenth International Conference on Learning Representations (ICLR 2026), April 24–28, 2026, Singapore. https://benjijang.com/files/2026-04-23-codesense.pdf
