Siddharth Prasad

Data Scientist & ML Enthusiast
Delhi, IN.

About

Highly motivated Data Scientist and ML Enthusiast with hands-on experience in developing and deploying robust machine learning models for critical applications like fraud detection, churn prediction, and NLP chatbots. Proficient in Python, SQL, ML, and data visualization tools (Power BI, Tableau), adept at transforming complex datasets into actionable insights to drive smart, data-driven business decisions and achieve measurable impact.

Work

Robustrix AI
|

NLP Developer (Project work)

Delhi, India, India

Summary

Developed and integrated NLP solutions for a news aggregation platform, enhancing content processing and real-time data insights.

Highlights

Engineered a news aggregation and sentiment analysis pipeline, processing approximately 5,000 articles daily using APIs and NLP.

Implemented advanced topic tagging and de-duplication algorithms, resulting in a 35% reduction in duplicate content.

Integrated enhanced NLP features into an AI dashboard, providing real-time updates for improved data analysis and decision-making.

Academic Projects (B.Tech Students - Freelance)
|

Freelance Project Developer

Delhi, India, India

Summary

Delivered diverse academic projects for B.Tech students, ensuring timely and successful deployment of custom solutions.

Highlights

Developed and delivered over 5 academic projects, including chatbots, voice assistants, games, and websites, for B.Tech students.

Achieved 100% on-time project delivery, ensuring successful deployment for student submissions and demonstrations.

Education

Delhi University
Delhi, India, India

Bachelor of

Arts

Certificates

Data Science

Issued By

National Institute of Electronics and Information Technology

Python for Data Analysis

Issued By

Great Learning

Business Professional Programmer (O level)

Issued By

National Institute of Electronics Information Technology

CCC

Issued By

National Institute of Electronics and Information Technology

Skills

Programming Languages

Python, SQL.

Machine Learning

Supervised Learning, Unsupervised Learning, Natural Language Processing (NLP), Fraud Detection, Churn Prediction, Recommendation Systems, Classification, Regression, Decision Trees, Random Forest, TF-IDF, Cosine Similarity, Naïve Bayes, Logistic Regression, SMOTE.

Frameworks & Libraries

LangChain, Hugging Face, OpenAI API, Flask, Streamlit, Regex.

Data Visualization

Tableau, Power BI.

Databases

PostgreSQL, MongoDB.

Cloud Platforms

AWS (Basic).

Projects

Credit Card Fraud Detection

Summary

Built an end-to-end pipeline for credit card fraud detection, leveraging Random Forest and SMOTE on a large transaction dataset.

Email Spam Classification

Summary

Developed an email spam classification system using TF-IDF, Naïve Bayes, and Logistic Regression to efficiently filter emails.

Real Estate Price Prediction

Summary

Developed a content-based recommender system for real estate price prediction, delivering personalized property suggestions.

Movie Recommendation System

Summary

Engineered and deployed a content-based movie recommender system using TF-IDF and cosine similarity on a dataset of 5,000 movies.

Crop Recommendation System

Summary

Designed a recommendation system to suggest optimal crops based on environmental parameters, enhancing agricultural decision-making.

AI Doctor Chatbot

Summary

Developed an NLP-based chatbot to provide medical Q&A, leveraging Python and Regex for robust functionality.