About
Recent B. Tech Data Science graduate with 6+ months of hands-on experience in cloud-based ETL pipelines and backend systems. Proven ability to design scalable data workflows, drive analytics through efficient engineering solutions, and develop robust machine learning models. Eager to leverage expertise in Python, SQL, PySpark, Azure Data Factory, and Databricks to contribute to innovative data-driven initiatives.
Work
Bengaluru, Karnataka, India
→
Summary
Contributed to the Enterprise Data Lake Implementation project, focusing on structured data ingestion, transformation, and governance to enhance data accessibility and integrity.
Highlights
Engineered and deployed an end-to-end dynamic data pipeline for MKPF and MSEG files, facilitating seamless data ingestion and processing.
Automated data pipelines using triggers and Logic Apps, enhancing monitoring for 10+ stakeholders and significantly reducing manual tracking efforts.
Developed and optimized ETL pipelines leveraging Azure Data Factory, Databricks, and Delta Lake, automating critical data workflows.
Authored PySpark scripts to support incremental and full data loads, boosting data freshness and pipeline performance.
Implemented robust data quality, logging, and monitoring best practices, ensuring reliability for enterprise reporting needs.
Awards
1st Place - Decoding Contest
Awarded By
Madanapalle Institute of Technology & Science, CITA Event
Achieved first place in a competitive coding contest, demonstrating strong problem-solving and programming skills.
Participant - Smart India Hackathon (SIH)
Participated in the Smart India Hackathon, contributing to innovative solutions for real-world problems.
Languages
English
Skills
Programming Languages
Python, SQL, PySpark.
Data Engineering
Azure Data Factory, Databricks, Delta Lake, Logic Apps, ETL Pipelines, Data Ingestion, Data Governance, Data Quality.
Machine Learning
Scikit-learn, TensorFlow, Classification, Regression, Random Forest, Gradient Boosting, Voting Classifier, TF-IDF.
Data Visualization & Analysis
Power BI, Pandas, Matplotlib, Seaborn.
Frameworks & Databases
FastAPI, MongoDB, Motor, Pydantic, Swagger, CRUD Operations.
Professional Skills
Problem-Solving, Communication, Teamwork, Adaptability, System Design, Data Modeling.
Interests
Data Community Engagement
Departmental Symposium Coordination (100+ participants), Data Oracle Club Member.