Curriculum Vitae
Ph.D. student seeking machine learning research internship.
Research interests are the intersection of machine learning and software engineering.
Experience in deep learning, machine learning, program analysis, software development.
Education
PhD Computer Science
Aug 2019 - May 2024 (expected)
Iowa State University, Program Analysis Lab
Ames, IA
- Research interests: deep learning-based vulnerability detection, graph neural networks, ML for SE.
MS Computer Science
Aug 2019 - Dec 2021
Iowa State University, Program Analysis Lab
Ames, IA
BS Computer Science
Aug 2016 - May 2019
Bob Jones University
Greenville, SC
- Magna Cum Laude honors (GPA 3.84/4.00).
Professional Experience
Research Assistant
Aug 2020 - Present
Iowa State University, Program Analysis Lab
Ames, IA
- First author, "An Empirical Study of Deep Learning Models for Vulnerability Detection" (accepted ICSE 2023; 26% acceptance rate).
- First author, "Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection." (accepted ICSE 2024).
- Contributing author, "TRACED: Execution-aware Pre-training for Source Code" (accepted ICSE 2024).
- Contributing author, "Validating static warnings via testing code fragments" (accepted ISSTA 2021; 27% acceptance rate).
- Conducted novel research resulting in 2 first-author conference papers in submission to ICSE24, one accepted and one presently under review.
- Collaborated with ARiSE lab at Columbia University and CERT lab @ Carnegie-Mellon University.
- Improved experiment iteration time by building tools for static analysis (tree-climber) and dynamic analysis and code generation (pal-tools).
- Enabled collaboration on experiments by acquiring and maintaining bug benchmarks for use in experiments.
Research Intern
May 2023 - Dec 2023
- Conducted research on improving large language models such as Codex using reinforcement learning.
Software Developer Intern
2018 - 2021
Ag Leader Technology, AgFiniti team
Ames, IA
- Fall 2021: Democratized public datasets by adding GIS capability for geolocation and remote sensing.
- Summer 2020: Widened customer reach by integrating AgFiniti with John Deere data platform.
- Summer 2019: Improved UX by modernizing satellite mapping interface with Javascript/Vue.
- Summer 2018: Enabled agronomic analysis by maintaining a domain-specific language using Antlr.
Teaching Assistant
Jan 2020 - May 2020
Iowa State University, COM S 227: Object-oriented Programming (Java)
Ames, IA
Freelance Software Developer
Aug 2019 - Aug 2021
Roney Innovations (ecommerce retailer)
Des Moines, IA
- Collaborated with 2 other developers to create Amazon product listing web app using C#, ASP.NET Core, SQL Server, and Azure cloud services.
Publications
Benjamin Steenhoek, Michele Tufano, Neel Sundaresan, and Alexey Svyatkovskiy. 2025. Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation. In 2025 Sixth International Workshop on Deep Learning for Testing and Testing for Deep Learning (DeepTest ’25), April 27–May 3, 2025, Ottawa, Canada.
Benjamin Steenhoek, Siva Sivaraman, Renata Saldivar, Yevhen Mohylevskyy, Roshanak Zilouchian Moghaddam, and Wei Le. 2025. Closing the Gap: A User Study on the Real-world Usefulness of AI-powered Vulnerability Detection & Repair in the IDE. In 2025 IEEE/ACM 46th International Conference on Software Engineering (ICSE ’25), April 27–May 3, 2025, Ottawa, Canada.
Benjamin Steenhoek. 2024. Understanding and improving deep learning models for vulnerability detection (Publication No. 31562057). Available from Dissertations & Theses @ Iowa State University; ProQuest Dissertations & Theses Global.
Benjamin Steenhoek, Md Mahbubur Rahman, Monoshi Kumar Roy, Mirza Sanjida Alam, Hengbo Tong, Swarna Das, Earl T. Barr, and Wei Le. 2024. To Err is Machine: Vulnerability Detection Challenges LLM Reasoning. ArXiv.
Benjamin Steenhoek, Hongyang Gao, and Wei Le. 2024. Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability Detection. In 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE ’24), April 14–20, 2024, Lisbon, Portugal.
Yangruibo Ding, Benjamin Steenhoek, Kexin Pei, Gail Kaiser, Wei Le, and Baishakhi Ray. 2024. TRACED: Execution-aware Pre-training for Source Code. In 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE ’24), April 14–20, 2024, Lisbon, Portugal. ACM, New York, NY, USA, 12 pages.
Benjamin Steenhoek, Md Mahbubur Rahman, Shaila Sharmin, & Wei Le. (2023). Do Language Models Learn Semantics of Code? A Case Study in Vulnerability Detection. ArXiv.
Benjamin Steenhoek, Md Mahbubur Rahman, Richard Jiles, and Wei Le. 2023. An Empirical Study of Deep Learning Models for Vulnerability Detection. In Proceedings of the 45th International Conference on Software Engineering (ICSE 2023).
Guo, X., Joshy, A. K., Steenhoek, B., Le, W., & Flynn, L. (2023). A Study of Static Warning Cascading Tools (Experience Paper). ArXiv.
Steenhoek, Benjamin. (2022). Refactoring programs to improve the performance of deep learning for vulnerability detection (Poster). Presented at: Iowa State University 6th Annual Research Day.
Steenhoek, Benjamin. (2021). Refactoring programs to improve the performance of deep learning for vulnerability detection (Publication No. 28648161). Available from Dissertations & Theses @ Iowa State University; ProQuest Dissertations & Theses Global.
Ashwin Kallingal Joshy, Xueyuan Chen, Benjamin Steenhoek, and Wei Le. 2021. Validating static warnings via testing code fragments. In Proceedings of the 30th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2021). Association for Computing Machinery, New York, NY, USA, 540–552.
My Projects
Primary developer
Team project
- animal-cognitive: Deep reinforcement learning models with embodied animal cognition (PyTorch/rllib).
- precise-interrupts: Reproducing a historical interrupt handling paper in ARM architecture (C++/gem5).
Technical Skills
- Programming Languages: Proficient in Python and C#. Knowledge of C++, Java, JavaScript, SQL.
- Machine Learning & data scraping: PyTorch, rllib, pandas, numpy, Selenium, beautifulsoup.
- Web Development: Vue, ASP.NET Core, .NET Framework, SQL Server, Azure Functions, ACI, VMs, ML Studio.
- Computer architecture and program analysis: Antlr, LLVM, Intel Pin, gem5, abstract interpretation, fuzzing.
- DevOps: Git, Azure DevOps, and CI/CD, Slurm batch processing, Linux server administration.
Leadership
- Science education outreach at Greenville County Juvenile Detention, Fall 2018/Spring 2019
- Vice president of Phi Beta Chi society, Spring 2018