SHRIYANKI DUBEY

Data Engineer

Jabalpur, IN.

About

Results-driven Data Engineer with 2 years of experience designing and implementing scalable data solutions across SQL, Apache Spark, Hadoop, HDFS, and AWS. Proven expertise in leveraging Snowflake, Databricks, and the modern data stack to optimize large-scale pipelines, automate ETL workflows, and drive business insights, resulting in significant improvements in efficiency and user engagement. Adept at building cloud-native, secure, and resilient data platforms to deliver high-performance data solutions in fast-paced enterprise environments.

Work

RELIANCE JIO PLATFORM LIMITED

DATA ENGINEER

Mumbai, Maharashtra, India

Dec 2023

→

Present

Summary

As a Data Engineer at Reliance Jio, I orchestrate large-scale data ingestion and processing, optimizing pipelines and delivering actionable insights for network reliability and user engagement.

Highlights

Orchestrated ingestion of 2TB UBR data from Oracle DB to Hive using Sqoop, optimizing pipeline runtime by 93% (from 10 hours to 40 minutes) and enabling faster identification of high-congestion building IDs.

Processed 50GB+ per partition of customer-level network data using Apache Spark (Scala) with structured Hive partitioning, identifying that 12% of cells caused 80% of outages, significantly improving network reliability.

Correlated 50GB+ batch customer ID data with cell names using Spark and Hive to identify high unavailability regions, automating root-cause analysis pipelines and reducing Mean Time to Resolution (MTTR) by 72% (from 18 hours to 5 hours).

Enabled tower health and coverage analytics for 200K+ assets, delivering actionable insights to 500+ cross-functional stakeholders across network, planning, and operations teams using AWS and Snowflake.

Enhanced the Apex DECK dashboarding suite by integrating daily, weekly, and monthly insights from STB, OTT, and Jio Gaming platforms using Snowflake, PySpark, and ADF pipelines, resulting in a 30% uplift in active user engagement.

Processed over 10M daily STB logs using PySpark to compute engagement KPIs, improving data quality by 40% and enabling accurate customer behavior insights.

Developed customer segmentation models from log analytics, revealing a 20% higher adoption rate of OTT platforms among users watching over 5 hours of YouTube per week.

Automated KPI reporting pipelines with Airflow, saving 20+ analyst hours monthly and reducing heatmap navigation time by 30% through optimization.

Cognizant

Software Development Intern (Remote)

Remote, Maharashtra, India

Jul 2023

→

Sep 2023

Summary

As a Software Development Intern at Cognizant, I developed and maintained Spring Boot-based backend services, optimizing database queries and integrating applications into CI/CD pipelines.

Highlights

Developed and maintained Spring Boot-based backend services in Java, implementing REST APIs and optimizing database queries to enhance system throughput.

Built and packaged applications using Maven, ensuring efficient dependency management and seamless integration into CI/CD pipelines.

Applied clean code practices and scalable service design principles to strengthen enterprise backend development.

Ouranos Robotics

Software Engineering Intern (Remote)

Remote, Maharashtra, India

Mar 2023

→

Jun 2023

Summary

As a Software Engineering Intern at Ouranos Robotics, I contributed to frontend development, focusing on UI/UX design and collaboration to enhance web module usability.

Highlights

Contributed to frontend development for robotics applications, focusing on UI/UX design to create user-friendly interfaces for internal dashboards.

Collaborated with cross-functional teams to implement responsive layouts and enhance usability across web modules.

Bridged design principles with functional software implementation through hands-on experience in UI/UX development.

Education

Gyan Ganga Institute of Technology and Sciences

Jabalpur, Madhya Pradesh, India

Aug 2019

→

May 2023

Bachelors of Technology

Electronics and Communication

Grade: 8.63 CGPA

Skills

Big Data Frameworks

Apache Spark, Hadoop, Apache NiFi, YARN.

Monitoring & Dashboarding

Grafana, Cloudera Observability.

Data Storage & Warehousing

Hive, MySQL.

Cloud Platforms

AWS, Azure (Databricks, Blob Storage).

Programming Languages

Java, Scala, Python, SQL.

Streaming & Messaging

Spark Streaming, Apache Kafka.

Projects

End-to-End Data Engineering Pipeline on Azure Databricks

Jan 2023

→

Sep 2023

Summary

A self-directed project focused on building a scalable, enterprise-grade data engineering pipeline on Microsoft Azure, processing both near real-time and batch data to support large-scale telecom use cases.