Mani Smaran Nair

Machine Learning Engineer
Mannheim, DE.

About

Highly skilled Machine Learning Engineer and MSc Data & Computer Science candidate with expertise in designing and deploying advanced Large Language Model (LLM) and Natural Language Processing (NLP) solutions. Proven ability to leverage Transformers, RAG, and production-ready ML pipelines (PyTorch, Hugging Face) to deliver scalable, high-impact AI applications. Adept at bridging research and deployment, driving innovation in applied NLP, conversational AI, and speech technology.

Work

DKFZ German Cancer Centre
|

Research Assistant

Heidelberg, Baden-Württemberg, Germany

Summary

Conducted advanced research in deep neural networks and variational inference to improve model generalization for regulatory region tasks at a leading cancer research center.

Highlights

Integrated a Foundational model using Borzoi-generated embeddings to train a deep neural network, significantly improving model generalization across diverse regulatory region tasks.

Developed a modular guide class (AmortizedNormal) to seamlessly combine LLM embeddings with variational inference in NumPyro, streamlining complex model architectures.

Performed scalable Stochastic Variational Inference (SVI) on over 20,000 DNA sequences, boosting ELBO convergence speed by ~30%.

Freudenberg Innovation
|

Machine Learning Engineer – GenAI Systems

Weinheim, Baden-Württemberg, Germany

Summary

Led the development and deployment of GenAI systems, focusing on real-time RAG pipelines and LLM-based solutions for knowledge extraction.

Highlights

Built and deployed a real-time Retrieval-Augmented Generation (RAG) pipeline with FastAPI, Sublime, and LangChain, retrieving enterprise knowledge from over 10,000 records.

Collaborated cross-functionally on internal GenAI initiatives, including LLM-based solutions, to enhance knowledge extraction capabilities.

Conducted user interviews, documented workflows, and facilitated workshops with cross-functional teams to gather requirements and ensure alignment for AI applications.

Developed comprehensive test cases and automated evaluation pipelines for pilot AI applications, ensuring robust performance and reliability.

Synthesized complex technical and business feedback into structured reports to guide strategic deployment and iterative improvements.

Heidelberg University
|

Research Assistant

Heidelberg, Baden-Württemberg, Germany

Summary

Analyzed patient-question interactions and enhanced predictive accuracy through advanced probabilistic modeling and data visualization techniques.

Highlights

Analyzed 15,000 patient-question interactions using advanced probabilistic response models to derive critical insights.

Improved predictive accuracy by 18% compared to baseline cognitive models, demonstrating enhanced model performance.

Utilized Seaborn and Matplotlib visualizations to conduct comprehensive model performance comparisons and present findings effectively.

Central Institute of Mental Health
|

Research Assistant

Mannheim, Baden-Württemberg, Germany

Summary

Managed and automated fMRI data preprocessing pipelines for a large patient cohort, significantly enhancing efficiency and data consistency.

Highlights

Processed fMRI data from over 15,000 patients leveraging BIDS pipelines, ICA, and BET for precise brain extraction.

Automated critical data preprocessing pipelines, reducing data preparation time by 50% and ensuring high consistency for research studies.

Education

Heidelberg University
Heidelberg, Baden-Württemberg, Germany

Master's

Data and Computer Science

Courses

Natural Language Processing

Machine Learning

Deep Learning

Artificial Intelligence

Probabilistic programming languages

Software Engineering

Computer Vision

LLM

Presidency University
Bangalore, Karnataka, India

Bachelor's

Computer Science & Engineering

Courses

Machine learning

DBMS

SQL

Certificates

Getting started AWS machine learning

Issued By

Coursera

Sentiment Analysis using NLTK

Issued By

Coursera

XOBIN Bootcamp

Issued By

XOBIN

Skills

Programming & Automation

Python, R, Shell scripting, REST APIs, Workflow orchestration.

AI/ML Frameworks

PyTorch, TensorFlow, NumPyro, LangChain, LangGraph, JAX, Deep Learning.

AI Systems

RAG pipelines, LLM fine-tuning, Agent-based automation, Predictive modeling.

Data Handling

Pandas, NumPy, SQL, NoSQL, Data preprocessing & visualization, Matplotlib, Seaborn.

Development Practices

Agile/Scrum, CI/CD, Git, Containerized environments (Docker).

Mentoring & Collaboration

Cross-functional training, AI adoption advocacy, Technical workshops.

Interests

Hobbies

Reading books, Football, Trekking.

Projects

Fullstack RAG Chatbot using Langchain

Summary

Developed a fullstack Retrieval-Augmented Generation (RAG) chatbot leveraging Langchain for intelligent document querying.

Food Captioning Using Diffusion-Augmented COCO Dataset

Summary

Implemented a food captioning model using a Diffusion-Augmented COCO Dataset to enhance image understanding and descriptive capabilities.