Shivam Koli

Data Engineer | Machine Learning Specialist
Pune, IN.

About

Highly motivated Data Engineer and Machine Learning Specialist with expertise in designing and implementing scalable ETL/ELT pipelines, managing data warehouses, and building robust ML models. Proven ability to optimize big data workflows, enhance data quality, and deliver actionable insights through advanced data visualization and business intelligence tools like Power BI and Tableau, driving informed decision-making and operational efficiency.

Work

TIBIL COMPUTER SOLUTIONS PRIVATE LIMITED
|

Intern - Data Engineer

Pune, Maharashtra, India

Summary

As a Data Engineer Intern, designed and implemented scalable ETL/ELT pipelines, optimized big data workflows, and developed Power BI dashboards to deliver real-time operational insights.

Highlights

Engineered and deployed scalable ETL/ELT data pipelines using Python and PySpark, automating data ingestion from diverse structured and unstructured sources into Data Warehousing and Data Lake environments, reducing manual effort by 40%.

Optimized big data workflows through advanced data cleaning, transformation, and feature engineering on datasets exceeding 100,000+ rows, significantly enhancing data consistency for analytics and ML pipelines.

Executed complex SQL queries (PostgreSQL, MySQL) for robust data validation, schema integration, and Business Intelligence reporting, supporting critical enterprise-wide analytics use cases.

Developed and deployed interactive Power BI dashboards, incorporating DAX measures, KPIs, and drill-through reports to deliver real-time insights on operational performance and key business trends.

Education

SKN Sinhgad Institute of Technology and Science, Lonavala
Pune, Maharashtra, India

B.E.

Computer Engineering

Languages

English

Certificates

Advanced Data Science with AI-ML

Issued By

upGrad, Pune

Data Analytics Job Simulation

Issued By

Deloitte

Skills

Programming & Databases

Python, SQL, PostgreSQL, MongoDB.

Data Science & Machine Learning

Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, PySpark, NLP, TensorFlow, PyTorch, EDA, Feature Engineering, ML Models (Random Forest, XGBoost, Gradient Boosting).

Data Engineering & Cloud

ETL/ELT Pipelines, Data Warehousing, Data Lake, Apache Airflow, AWS, Azure, CI/CD Pipelines, Git, GitHub, Postman.

Business Intelligence & Tools

Power BI, Tableau, MS Excel, Power Query, DAX, Data Modeling.

Professional Skills

Analytical Thinking, Problem Solving, Stakeholder Communication, Team Collaboration.

Projects

AI-Powered Learning Assistant Chatbot

Summary

Engineered an intelligent chatbot leveraging LLM-based NLP for personalized learning pathways. The project involved integrating various data sources and APIs to provide dynamic content and automated reporting.

Heart Disease Prediction Using Machine Learning

Summary

A machine learning project focused on predicting heart disease, involving comprehensive data analysis and model development to achieve high accuracy and generalization.

Customer Churn Analysis

Summary

Performed in-depth Exploratory Data Analysis (EDA) on a large customer dataset to identify key drivers of churn and evaluate machine learning models for predictive performance.

Coffee Shop Sales Dashboard

Summary

Developed an interactive Power BI dashboard to monitor and analyze sales, orders, and product trends across multiple retail store locations, facilitating data-driven decision-making.

Shivam Koli