B. Kishore

Data Engineer

About

Results-driven Data Engineer with 1 year and 7 months of hands-on experience in building, optimizing, and automating robust data pipelines. Expertly leverages big data frameworks, ETL tools, and cloud technologies including AWS and Snowflake to design and implement efficient, reliable data migration and automation strategies. Proven ability to enhance data delivery, reduce manual tasks, and ensure data integrity for critical business operations.

Work

Entrust Software and IT Services

Data Operations Analyst

Feb 2024

→

Jul 2024

Summary

Currently serving as a Data Operations Analyst, responsible for designing, maintaining, and automating large-scale data pipelines and managing ETL workflows to ensure data delivery and accuracy.

Highlights

Engineered and automated robust, large-scale data pipelines using Spark, PySpark, Informatica, and AWS Glue, ensuring high data throughput and reliability.

Oversaw comprehensive ETL workflows, guaranteeing timely and accurate data delivery to support critical business operations.

Utilized SQL and Python for efficient data extraction, cleansing, and validation processes, orchestrating complex workflows with Apache Airflow and AWS Step Functions.

Facilitated seamless data migrations, constructing scalable S3-based data lakes and integrating diverse data sources with Snowflake for advanced analytics.

Spearheaded the implementation of advanced monitoring and automation solutions, successfully reducing manual operational tasks by 40% and enhancing system efficiency.

Collaborated cross-functionally with stakeholders to deliver actionable data insights and established comprehensive documentation for best practices, improving team efficiency and knowledge transfer.

Tata Communications Limited (Client Project)

Data Engineer

Sep 2022

→

Dec 2023

Summary

Functioned as a Data Engineer on a telecom domain project, developing and optimizing pipelines for cloud data ingestion, transformation, and loading, focusing on high-volume data.

Highlights

Developed and optimized robust data pipelines for efficient ingestion, transformation, and loading of high-volume telecom data into cloud environments and Snowflake.

Architected and implemented high-performance ETL workflows specifically for high-volume telecom data using PySpark and AWS Glue.

Directed critical data migration efforts to AWS S3 and Snowflake, automating daily data loads and significantly enhancing the reliability and timeliness of KPI delivery.

Automated daily data loads and comprehensive health checks using Apache Airflow, ensuring consistent and reliable delivery of key performance indicators.

Partnered with client teams to establish stringent data quality rules and executed thorough data validation processes utilizing SQL and Python, ensuring data integrity.

Education

SKP Engineering College

Aug 2018

→

May 2022

B.E.

Electronics & Communication Engineering

Grade: 8.5 CGPA

Sishya Matric Hr. Sec. School

Jun 2016

→

May 2018

HSC

Higher Secondary Certificate

Grade: 65%

Sishya Matric Hr. Sec. School

Jun 2014

→

May 2016

SSLC

Secondary School Leaving Certificate

Grade: 85%

Skills

Programming/Querying

SQL, Python, PySpark.

Big Data

Apache Spark, Hadoop.

ETL Tools

Informatica, AWS Glue.

Orchestration

Apache Airflow, AWS Step Functions.

Cloud Platforms

AWS Lambda, S3, Glue, Step Functions, Snowflake.

Development Tools

Google Colab, Git, JIRA.

Core Areas

Data Pipelines, Data Migration, Data Processing, Automation.