VAIBHAV TANWAR

Full Stack AI Engineer

New Delhi, IN.

Experience

Infosys Center for Artificial Intelligence

Full Stack ML Engineer

New Delhi, Delhi, India

Sep 2024

→

May 2025

Highlights

Engineered a scalable, Dockerized MLOps platform utilizing annotation enabled continual learning to enhance wildlife monitoring capabilities using camera trap images, achieving <60ms average API latency across all modules and scalable data handling of over 1M image records for a seamless Human-in-the-Loop annotation experience.

Improved detection accuracy of multi-class endangered species by 54% by finetuning YOLOv8 on proprietary wildlife dataset from Wildlife Institute of India, and cut inference latency by 97% on average using TensorRT quantization, enabling near real-time processing.

Built a scalable Open Set Re-Identification service using MegaDescriptor embeddings and CLIP model, leveraging PostgreSQL with pgvector, for efficient low-latency vector and semantic searches across millions of images.

Designed and implemented Active Learning pipelines to alleviate high labelling cost in species segregation and Bird Count modules by selecting more informative samples to label based on instance level uncertainties, achieving a 3.5% increase in Mean Average Precision and 26.3% decrease in annotation budget.

Scale AI

LLM Post Training Team

Remote, N/A, India

Dec 2024

→

Feb 2025

Highlights

Curated and refined domain-specific training datasets for supervised finetuning of LLMs in complex reasoning tasks.

Leveraged Reinforcement Learning from Human Feedback to align model predictions with human preferences, enabling iterative improvement in truthfulness, harmlessness, and instruction-following.

Assisted in creating and optimizing reward models by curating preference datasets derived from human feedback.

Education

Indraprastha Institute of Information Technology Delhi

New Delhi, Delhi, India

Jan 2022

→

Jul 2025

Bachelor of Technology

Computer Science and Applied Mathematics

Languages

English

Skills

Programming Languages

Python, Typescript, JavaScript, C++.

Databases

PostgreSQL, MongoDB, Redis, KuzuDB.

Frameworks/Tools

FastAPI, Pytorch, Transformers, DSPy, LangChain/LangGraph, Astronomer, MLflow, Gemini-ADK.

Developer Tools

Visual Studio Code, Git, Docker, CI/CD, Data Version Control.

Projects

MetroSense: Vision-Language Assistant for Navigation Aid in Urban Metro Systems

Feb 2025

→

Feb 2025

Summary

Developed a novel web-based vision-language assistance platform to empower visually impaired individuals navigate the complex Delhi Metro system using YOLOv11 object detection model fine-tuned on custom annotated dataset with specific augmentations, achieving 65.1% mAP@50 for identifying environmental elements from real-time image captures.. Integrated LLAMA Vision 3.2 90B model for sophisticated Visual Question Answering, engineered with context-rich, few-shot prompting and optimized decoding parameters to achieve a BERT F1 score of 0.85, delivering semantically accurate, context-aware voice-synthesized responses to user queries for improved safety and autonomy.

Distributed Database with Leader Lease

Apr 2024

→

Apr 2024

Summary

Implemented a modified Raft algorithm for a Distributed Key Value Store with Leader Lease functionality ensuring low latency Reads (similar to those used by geo-distributed Database clusters such as CockroachDB or YugabyteDB) and operating through leader election, log replication, and commitment of entries across a cluster of nodes. Leveraging the reliability and fault tolerance offered by Raft, the system ensured consistent data replication and fault recovery across a distributed network of nodes.

Multi Model Analysis for Stock Market Trend Prediction

Dec 2023

→

Dec 2023

Summary

Implemented a novel GAN framework with an LSTM generator and CNN discriminator, investigating its feasibility in capturing real stock market data distributions, alongside Neural ODE Variational Autoencoder model and succesfully benchmarked them against classical baselines on CNNPred dataset, spanning five major market indices. Achieved a 15% improvement in F1-score for NYSE and reduced training epochs by 85% by implementing Neural ODE Infinite Depth classifier enabling faster convergence compared to CNN-based baselines. Achieved high-accuracy regression performance as measured by R² values of 0.99 across all market indices with MAE as low as 143.58 for S&P 500, by implementing a CNN-LSTM architecture.