Shruti Jain

Data Engineer
Noida, IN.

About

Highly analytical and results-oriented Data Engineer with a Bachelor of Technology in Computer Science, specializing in AI. Proven ability to design, implement, and optimize end-to-end data pipelines leveraging cloud platforms like AWS and Azure Databricks, enhancing data scalability, reliability, and real-time analytics. Seeking to apply expertise in big data technologies, ETL workflows, and data visualization to drive impactful business decisions in a dynamic tech environment.

Work

Koantek Information Systems India Pvt. Ltd.
|

Data Engineer

Remote, Not Applicable, India

Summary

Leading large-scale data migration and ETL workflow optimization initiatives to enhance system scalability and cloud readiness for critical production environments.

Highlights

Led the successful migration of large-scale production data jobs from on-premise HDFS to Azure Databricks, significantly improving system scalability and performance for critical operations.

Optimized ETL workflows by upgrading and deploying processes from Talend 7 to Talend 8, integrating builds into Azure Blob Storage to enhance reliability and cloud readiness.

To The New
|

Data Engineer

Noida, Uttar Pradesh, India

Summary

Designed and implemented robust end-to-end data pipelines for real-time data processing and analytics, delivering actionable insights to business stakeholders.

Highlights

Designed and implemented an end-to-end data pipeline leveraging AWS S3, Glue (PySpark), and Kafka streaming to process both incremental and real-time employee data under the Medallion Architecture framework.

Delivered curated datasets to PostgreSQL and developed SQL-driven Grafana dashboards, enabling real-time monitoring and actionable insights for business stakeholders.

InternQ
|

Data Analyst Intern

Remote, Not Applicable, India

Summary

Conducted comprehensive data analysis and visualization to drive data-driven decision-making and support customer retention strategies.

Highlights

Performed comprehensive Exploratory Data Analysis (EDA), data cleaning, and visualization using Python (pandas, matplotlib) to extract key insights, directly supporting data-driven decision-making.

Executed a Customer Churn Analysis project, applying data preprocessing (encoding, standardization) and visual analytics to identify critical churn drivers and inform strategic retention initiatives.

Education

Teerthanker Mahaveer University
Moradabad, Uttar Pradesh, India

Bachelor of Technology

Computer Science, Major in AI

Grade: CGPA - 9.1

St.R. C Convent School
Shamli, Uttar Pradesh, India

High School Diploma (XII)

General Studies

Grade: 93.8%

St.R. C Convent School
Shamli, Uttar Pradesh, India

High School Diploma (X)

General Studies

Grade: 89.9%

Skills

Programming Languages

Python, SQL.

Big Data & ETL Tools

PySpark, Apache Kafka, Hadoop, Apache Hive, Talend, Apache Airflow.

Cloud Platforms

AWS (S3, Glue, EC2), Azure Databricks.

Databases

PostgreSQL, MySQL.

Visualization Tools

Power BI, Tableau, Grafana, Streamlit.

Projects

Employee Data Management Pipeline

Summary

Developed an end-to-end data pipeline leveraging AWS Glue, PySpark, S3, and Kafka for efficient data management and real-time KPI visualization.

HR Analytics Dashboard

Summary

Designed and developed an interactive HR dashboard in Power BI to track key workforce metrics and enable data-driven decision-making.