About
Results-driven Data Engineer with 2+ years of experience in designing and optimizing scalable ETL pipelines and transforming complex unstructured data to power machine learning initiatives. Proficient in Python, SQL, and Azure, with expertise in NLP, feature engineering, and cloud-native data processing. Eager to transition into advanced AI/ML roles, leveraging a strong foundation in data engineering and applied machine learning to deliver innovative solutions.
Work
Tech Mahindra
|Associate Software Engineer
Bengaluru, Karnataka, India
→
Summary
As an Associate Software Engineer, led a team project on customer churn prediction, developing and deploying ML models with 87% accuracy from diverse data sources within the BFSI domain.
Highlights
Led a critical team project to predict customer churn, leveraging diverse structured and unstructured data sources within the BFSI domain.
Executed comprehensive data cleaning and feature engineering for ~50K customer interaction records (audio, text, demographic), enhancing dataset readiness for ML model training.
Performed NLP preprocessing on ~10K chat transcripts and extracted acoustic features from ~100+ call recordings, facilitating advanced textual and audio analysis for model development.
Developed and fine-tuned machine learning models using ensemble methods, achieving a predictive accuracy of ~87% in customer churn prediction.
Collaborated on the successful deployment of the final ML model prototype on Azure cloud, demonstrating its capabilities to key business stakeholders and enabling data-driven decision-making.
Gnanig Technologies
|Data Engineering Intern
Bhubaneswar, Odisha, India
→
Summary
As a Data Engineering Intern, designed and optimized ETL pipelines and Power BI dashboards, reducing manual reporting by 40% and improving ML data consistency by 25%.
Highlights
Engineered robust ETL pipelines using Pandas and SQL to process multi-source raw datasets, ensuring data readiness for both reporting and machine learning initiatives.
Created dynamic reports and dashboards in Power BI, resulting in a ~40% reduction in manual reporting effort and improved data accessibility for stakeholders.
Designed and implemented feature engineering pipelines that enhanced ML model input consistency by ~25%, optimizing data quality and model performance.
Facilitated REST API integration and backend data delivery, providing AI-ready data to power client-facing solutions and improve data-driven product functionality.
Education
Veer Surendra Sai University of Technology
→
Bachelor of Technology
Information Technology
Grade: CGPA: 8.54 out of 10
Languages
English
Certificates
Machine Learning Specialization
Issued By
Stanford Online
Azure AI Fundamentals (AI:900)
Issued By
Microsoft
Azure Fundamentals (AZ:900)
Issued By
Microsoft
Skills
Languages & Libraries
Python, Java, SQL, scikit-learn, XGBoost, Keras, TensorFlow, Pandas, NumPy.
AI/ML & Gen AI
NLP, Feature Engineering, RAG, Transformers, Model Evaluation, Prompt Engineering.
Data Engineering
ETL, Data Modeling, Data Warehousing, Data Pipelines, PySpark, Airflow.
Cloud and Tools
Git, Github, Maven, REST APIs, Docker, Azure.
ML Engineering
Model Deployment, Model Optimization.