Cherishma Cherukuru

Data Engineer | Data Scientist
Srikalahasthi, IN.

About

Recent B. Tech Data Science graduate with 6+ months of hands-on experience in cloud-based ETL pipelines and backend systems. Proven ability to design scalable data workflows, drive analytics through efficient engineering solutions, and develop robust machine learning models. Eager to leverage expertise in Python, SQL, PySpark, Azure Data Factory, and Databricks to contribute to innovative data-driven initiatives.

Work

Rockwell Automation
|

Data Engineer Intern

Bengaluru, Karnataka, India

Summary

Contributed to the Enterprise Data Lake Implementation project, focusing on structured data ingestion, transformation, and governance to enhance data accessibility and integrity.

Highlights

Engineered and deployed an end-to-end dynamic data pipeline for MKPF and MSEG files, facilitating seamless data ingestion and processing.

Automated data pipelines using triggers and Logic Apps, enhancing monitoring for 10+ stakeholders and significantly reducing manual tracking efforts.

Developed and optimized ETL pipelines leveraging Azure Data Factory, Databricks, and Delta Lake, automating critical data workflows.

Authored PySpark scripts to support incremental and full data loads, boosting data freshness and pipeline performance.

Implemented robust data quality, logging, and monitoring best practices, ensuring reliability for enterprise reporting needs.

Education

Madanapalle Institute of Technology & Science
Madanapalle, Andhra Pradesh, India

Bachelor of Technology

Data Science

Courses

Python Programming

Probability & Statistics

Object-Oriented Programming (OOP)

Data Visualization

Database Management System

Machine Learning

Awards

1st Place - Decoding Contest

Awarded By

Madanapalle Institute of Technology & Science, CITA Event

Achieved first place in a competitive coding contest, demonstrating strong problem-solving and programming skills.

Participant - Smart India Hackathon (SIH)

Participated in the Smart India Hackathon, contributing to innovative solutions for real-world problems.

Languages

English

Certificates

End-to-End Data Engineering with Azure

Issued By

Udemy

YUKTHI Innovation Challenge 1.0
Data Science

Issued By

IBM

Python for Data Science

Issued By

Internshala

Power BI for Beginners

Issued By

Skill Up

Skills

Programming Languages

Python, SQL, PySpark.

Data Engineering

Azure Data Factory, Databricks, Delta Lake, Logic Apps, ETL Pipelines, Data Ingestion, Data Governance, Data Quality.

Machine Learning

Scikit-learn, TensorFlow, Classification, Regression, Random Forest, Gradient Boosting, Voting Classifier, TF-IDF.

Data Visualization & Analysis

Power BI, Pandas, Matplotlib, Seaborn.

Frameworks & Databases

FastAPI, MongoDB, Motor, Pydantic, Swagger, CRUD Operations.

Professional Skills

Problem-Solving, Communication, Teamwork, Adaptability, System Design, Data Modeling.

Interests

Data Community Engagement

Departmental Symposium Coordination (100+ participants), Data Oracle Club Member.

Projects

Student Management System

Summary

Developed a full-featured backend system to manage student data with comprehensive CRUD operations using Python, FastAPI, and MongoDB.

Heart Disease Prediction

Summary

Developed an ensemble machine learning model to predict heart disease from clinical datasets, demonstrating expertise in ML and data analytics.

Spam Email Prediction

Summary

Built a spam email prediction model using Random Forest to accurately classify emails as spam or ham.