SURAJ SUVENDU CHATTERJEE

Machine Learning Engineer
Mumbai, IN.

About

Seeking a responsible career opportunity to fully utilize my training and skills, while constantly learning and making a significant contribution to the success of the company. Equipped with astute observational capabilities and the ability to challenge unique problems and hypotheses with an organized mindset through the scientific process.

Work

MONSOON CREDITTECH
|

Data Scientist

Summary

Developed and deployed end-to-end ML models for credit risk scorecards on time series data for Indian Banks & NBFCs (Application Scorecard, Behaviour Scorecard, Collection Scorecard, etc) with a strong focus on predictive modeling, explainability, and automated deployment on AWS. Achieved a ~90% AUC metric for Behavior Scorecard predicting the probability of default in the next 3 months. Optimized and implemented multiprocessing in data processing scripts which increased processing speed by 90% Led and managed a cross-functional team of 5 engineers, overseeing end-to-end delivery of multiple ML and AI projects—from initial planning and architecture design to model development, deployment, and post-launch optimization. Automated data & model pipelines and model scoring systems using AWS Lambda, S3, and SageMaker, improving model turnaround and scaling capabilities. Processed and analyzed large-scale financial and behavioral datasets using Pandas, NumPy, and SQL, enabling robust feature engineering and anomaly detection and showcased the insights via visualizations for client and stakeholder's understanding. Gained deep familiarity and implemented statistical ML models using PyData Stack (Pandas, Numpy, Scikit-Learn), SQL, Bayesian Hyperparameter Tuning (Optuna), Ensemble Boosted Trees (LGBM, XGBoost), Feature Selection (RFE, Boruta) and deployment technologies such as RESTful Django, FastAPI, Flask, Docker & AWS and experimented with TensorFlow and PyTorch for advanced use cases. Led various experiments in deep learning, while continuously optimizing performance through hyperparameter tuning. Authored detailed internal documentation, model explainability reports, and stakeholder presentations showcasing model insights, risk stratification, and performance over time. Contributed to the adoption of AI-first approaches within the organization by identifying use cases for automation and Generative Al experimentation.

UNESCO CHAIR, INSTITUTE OF TECHNOLOGY, TRALEE IRELAND
|

Data Scientist (Freelance)

Summary

Obtained data through Web Scraping using Selenium, Scrapy, and BeautifulSoup from multinational banks' websites Maintained and preprocessed the data to remove anomalies, treat outliers, etc for further analysis using Python Performed Statistical, Contextual (using TFIDF, IDF & Glove), and Sentiment analysis (using AFINN, Vader) through Natural Language Processing (NLP) to see how Multinational Banks are supporting the Sports for development sector Created interesting visualizations on the processed data like Wordclouds, Pie Charts, etc. using Seaborn and Matplotlib libraries Designed a Bitcoin price prediction model using Deep Learning techniques such as LSTM, RNN, etc, and statistical modeling techniques like ARIMA Performed sentiment analysis on Bitcoin news articles using unsupervised Sentiment analysis techniques such as Vader, Textblob, AFINN, etc., and incorporated the same into the Bitcoin deep learning model

COLLINSON
|

Machine Learning Engineer

Summary

Designed and implemented Generative AI solutions, including LLM-based intelligent chatbots using RAG (Retrieval-Augmented Generation) architecture with AWS Bedrock, Llama2, Claude, GPT and vector databases (FAISS). Developed scalable ML pipelines for traditional ML, NLP and GenAI models using Amazon SageMaker, Bedrock, EC2, Lambda, ECS, ECR, S3, Terraform and Snowflake adhering to MLOps best practices for versioning, CI/CD, and model monitoring. Integrated serverless backend APIs for AI services using AWS Lambda and deployed them using API Gateway and Terraform. Contributed to real-time AI monitoring dashboards and performance reporting using Python-based visualization libraries, Streamlit and Amazon CloudWatch logs. Explored and implemented NLP, AI & LLM techniques in NLP tasks like entity resolution, question answering, text summarization, and semantic search, with a focus on scalability, latency, and response accuracy. Embraced MLOps workflows, using Github Actions, Docker and Terraform for containerization and infrastructure provisioning, and monitored deployments through CloudWatch. Regularly presented technical findings and PoC results to stakeholders and cross-functional teams through clear visualizations and demos.

Education

University of Mumbai

Bachelor of Engineering

Information Technology

Grade: 9.28/10

Courses

Data Structures and Algorithms

Database Management System

Artificial Intelligence

Cloud Computing and Services

Big Data Analytics

Internet Programming

Computer Networks

Software Engineering with Project Management

Awards

Winner of the Coding competition held at St. Francis Institute of Technology (SFIT) for IT Colloquium

Awarded By

St. Francis Institute of Technology (SFIT)

Certificates

Oracle Cloud Infrastructure 2024 Generative AI Certified Professional

Issued By

Oracle

Skills

PROGRAMMING

R, Python, SQL, C, C++, Java, HTML5, CSS3, Javascript.

DATABASE SYSTEMS

MySQL, PostgreSQL, Oracle.

TECHNOLOGIES

Django, Docker, Flask, Jupyter Notebook, Excel, Powerpoint, AWS - Bedrock, Sagemaker, Lambda, S3, NLP, Time Series Forecasting and analysis, Terraform, Generative AI (GenAI), LLM, Large Language Model, Snowflake.

GENERAL

Android Development, Web Development, Data Structures & Algorithms, Machine Learning, AI.

Interests

Extra Curricular Activities

Technical Executive at Information Technology Students' Association (ITSA) in SFIT: 2018 - 2019, Participated in Technical & Business Event (Pragati 2019 and 2020) held at SFIT with topics such as Smart Helmet (IOT project for multimedia helmet) & Campus Live (intercampus website for event and job registration) respectively, Participated in Thadomal Shahani Engineering College (TSEC) hackathons in 2019 and 2020 with topics such as Fire Buzzer (web app for prevention and mitigation of fire hazards) and Shopify (an e-commerce website with an advanced recommendation system using NLP of customer reviews and ML) respectively, Participated in Smart India Hackathon (SIH) 2020 with Crime chatbot (an AI chatbot for quick crime registration and processing).

Certification Courses

Deep Learning Specialization – deeplearning.ai, Coursera, Tensorflow in Practice Specialization – deeplearning.ai, Coursera, Data Visualization with python – Cognitive Class, Sentiment Analysis with Deep Learning using BERT – Coursera, Accelerating Deep Learning with GPU – Cognitive Class.

Projects

ADVANCED RECOMMENDATION SYSTEM USING SENTIMENT ANALYSIS - ML, WEB

Summary

Developed an Advanced product recommendation system using Machine Learning clustering techniques like KNN and NLP Processed customer reviews using NLP, generated sentiments of reviews using Vader, trained an ML model to recommend products, and finally used Flask to provide a web application for the same and achieved a precision of 85.75% on the model Wrote a research paper based on the project which is available here: https://easychair.org/publications/preprint/LHFb

HANDWRITTEN DIGIT RECOGNITION - DEEP LEARNING

Summary

An ML application made using python language for recognizing handwritten digits using CNN by using the MNIST dataset

NEURAL STYLE TRANSFER - DEEP LEARNING

Summary

Performing neural style transfer of an artistic image to a regular image using transfer learning by using the VGG-19 model

SENTIMENT CLASSIFIER - DEEP LEARNING

Summary

A project to implement sentiment analysis using the IMDb review dataset with the help of TensorFlow Keras library and Recurrent Neural Network (RNN) and Long Short Term Memory (LSTM)

IMAGE RECOGNITION AND VERIFICATION - DEEP LEARNING

Summary

Created a project to implement face recognition and verification using Convolutional Neural Network (CNN)