Pratham Singla

AI/ML Researcher & Engineer | IIT Roorkee

Roorkee, IN.

About

Highly accomplished AI/ML researcher and engineer with a 9.195/10 CGPA from IIT Roorkee, specializing in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Computer Vision. Proven ability to develop and evaluate advanced AI systems, evidenced by multiple research internships, publications in top-tier workshops (NeurIPS, AAAI), and innovative projects. Seeking to leverage deep technical expertise and research acumen to drive impactful advancements in AI development.

Work

Lossfunk

Research Intern

Remote, India

Dec 2024

→

Present

Summary

Pioneered the development of advanced AI benchmarks and contributed to reinforcement learning frameworks for vision-language models at a leading research firm.

Highlights

Developed a novel benchmark, inspired by ZeroBench, for complex visual reasoning, integrating an automated question-generation pipeline to train and evaluate vision-language models on GPT-level reasoning tasks.

Contributed to a reinforcement learning-based training framework, specifically designed to enhance models' ability to ground reasoning in visual inputs, reducing reliance on textual cues.

Pioneered the creation of a sophisticated evaluation system for next-generation AI models, setting a new standard for visual-language understanding.

Omega Intelligence

Gen AI Intern

Remote, India

Mar 2025

→

May 2025

Summary

Explored and optimized Retrieval-Augmented Generation (RAG) architectures, establishing robust local benchmarking setups for Large Language Models (LLMs).

Highlights

Explored and analyzed cutting-edge agentic Retrieval-Augmented Generation (RAG) architectures to optimize information retrieval and synthesis for complex queries.

Established a robust local benchmarking setup using tools like Ollama, enabling efficient deployment and performance evaluation of Large Language Models (LLMs).

Boston University

Research Intern

Boston, MA, US

Mar 2025

→

May 2025

Summary

Investigated self-awareness in Large Language Models (LLMs) and evaluated fine-tuning methods to understand model behavior and generalization.

Highlights

Investigated self-awareness in Large Language Models (LLMs) through in-depth analysis of alignment between internal reasoning and external outputs.

Evaluated the performance of SFT, DPO, and GRPO tuned models on tasks involving bias, risk, and reward hacking, providing critical insights into model vulnerabilities.

Designed and executed comprehensive experiments to rigorously test the awareness of learned behaviors and generalization of reasoning across diverse domains.

Volunteer

ACM, IIT Roorkee

Core Member

Roorkee, Uttarakhand, India

Mar 2024

→

Present

Summary

Actively participated in ACM chapter activities, contributing to initiatives that advance computer science education and community engagement at IIT Roorkee.

Highlights

Collaborated with fellow members to promote computer science and foster a vibrant academic community.

Contributed to the development and dissemination of technical knowledge through participation in group projects and discussions.

Supported the chapter's mission to provide resources and opportunities for students interested in computing.

Vision and Language Group, IIT Roorkee

Head of Projects

Roorkee, Uttarakhand, India

Feb 2024

→

Present

Summary

Led project planning and execution, fostered an active Deep Learning community, and moderated discussions within the Vision and Language Group at IIT Roorkee.

Highlights

Orchestrated the overall planning and moderation of research projects, contributing to strategic direction and successful project outcomes.

Organized and moderated paper discussions, enhancing knowledge sharing and collaborative research initiatives within the group.

Actively contributed to the development of an active Deep Learning community on campus, fostering growth and engagement among students.

Education

Indian Institute of Technology Roorkee

Roorkee, Uttarakhand, India

Aug 2023

→

Present

Bachelor of Technology

Mechanical Engineering

Grade: 9.195/10 CGPA

Publications

Reflective Self-Awareness and Reasoning Alignment in LLMs

Jan 2026

Published by

AAAI'26

Summary

Analyzed self-awareness and reasoning alignment in Large Language Models (LLMs) across various scenarios, evaluating the impact of SFT, DPO, and GRPO fine-tuning on model behavior, including bias, risk, and reward hacking.

Adaptive Urban Planning

Jan 2025

Published by

AAAI'25 AI4UP Workshop

Summary

Developed a Multi-Agent urban planning framework using LLMs and Genetic Algorithms for optimization and regional customization, resulting in significant improvements in livability and accessibility.

StegaVision: Enhancing Steganography with Attention Mechanism

Jan 2025

Published by

AAAI-25 Student abstract

Summary

Developed a novel image steganography model using a neural network-based approach, integrating attention mechanisms (channel and spatial attention) into an autoencoder architecture.

Give me a hint: Can LLMs take hint to solve math problems?

Dec 2024

Published by

NeurIPS’24 Math-AI Workshop

Summary

Conducted a study on LLMs' ability to utilize hints in mathematical problem-solving on the MATH dataset, benchmarking zero-shot, few-shot, chain-of-thought, and adversarial hinting against traditional prompting.

Projects

DE -VTL: A Retrieval Framework for RAG

Dec 2024

→

Feb 2025

Summary

Developed an innovative Retrieval-Augmented Generation (RAG) framework focused on enhancing retrieval quality through active learning and efficient fine-tuning.

Evaluating AI Agent frameworks

Nov 2024

→

Dec 2024

Summary

Evaluated AI agent frameworks on QA tasks to assess their reasoning capabilities, response accuracy, and efficiency.

AI Mock Interview Chatbot

Sep 2024

→

Oct 2024

Summary

Engineered a voice-interactive AI interview coach utilizing Gemini LLM and Vapi AI to provide real-time feedback and structured scoring for personalized interview preparation.

Low Light Image Enhancement

Jun 2024

→

Jul 2024

Summary

Developed a neural network-based solution for enhancing low-light images, achieving significant PSNR improvements through advanced image processing techniques.

References

Paras Chopra

Founder, Lossfunk, Wingify