Kopal Ghosh

Data Engineer
Sharjah, UAE.

About

An accomplished Data Engineer with 10 years of experience, specializing in designing, developing, and optimizing large-scale data pipelines within the banking sector. Proven expertise in Hadoop, Spark, Kafka, and cloud platforms (AWS, Azure), driving efficiency through advanced ETL processes, real-time data streaming, and big data analytics. Successfully reduced database response times by 20% and processed terabytes of data, contributing to significant business insights and scalable solutions for stakeholders.

Work

Emirates National Bank Of Dubai
|

Data Engineer

Sharjah, Sharjah, UAE

Summary

Currently leading data engineering development for critical banking applications, focusing on robust data pipelines and real-time streaming solutions to enhance operational efficiency and reporting.

Highlights

Engineered and optimized Hive queries and Unix shell scripts for comprehensive data analysis and automated report generation, ensuring timely delivery to stakeholders.

Developed high-performance Spark solutions using Scala, Dataframes, and Spark SQL API to accelerate data processing for interactive queries and ingesting terabytes of data into MongoDB.

Designed and implemented advanced ETL processes and data models, improving reporting efficiency and data availability through Kafka-based real-time streaming solutions.

Analyzed structured, semi-structured, and unstructured data to ensure quality and completeness, extracting critical business value for downstream applications.

Mastercard
|

Spark Developer

Sharjah, Sharjah, UAE

Summary

Developed and optimized Apache Spark-based data pipelines and REST APIs for Mastercard's card processing and settlement domain, enhancing data access and integration.

Highlights

Built and maintained robust data pipelines using Apache Spark for structured and unstructured data, ensuring high availability and integrity for critical transaction data.

Developed REST APIs for seamless data access and integration with third-party platforms, improving system interoperability and data exchange capabilities.

Optimized database indexing and query performance, reducing response times by 20% for critical card transaction data analysis.

Processed and transformed large datasets (AVRO, PARQUET, ORC, CSV, JSON, XML) using Spark, providing real-time serviceability insights during crisis periods.

Managed CI/CD processes by monitoring Spark jobs via Spark UI, pushing code to GIT, conducting rigorous code reviews, and automating deployments with Jenkins.

Mastercard
|

Spark Developer

Sharjah, Sharjah, UAE

Summary

Contributed to the ESA Settlement project, developing Spark-based solutions for end-to-end data integration and analysis, transforming raw data into actionable insights for banking operations.

Highlights

Led end-to-end data integration for the Autowrite project, pulling data from Windows-based applications via API and establishing connectivity with Synapse, JTux, and JNI.

Automated data verification using Postgres, Toad, and Unix, streamlining the data processing pipeline for settlement information files.

Implemented and managed CI/CD processes, including Jenkins pipelines, Git code management, peer reviews, and Chef changes for efficient deployment.

Designed and deployed NIFI pipelines for automated, scheduled data ingestion, processing large volumes of structured and unstructured datasets.

Developed Spark code to cleanse, merge, and process diverse datasets, ensuring data quality and readiness for storage in Hive, HBase, or S3 for upstream consumption.

Cognizant Technology Solutions
|

Spark Developer

Sharjah, Sharjah, UAE

Summary

Designed and implemented APIs for E-Wallets and Core Billing applications, enabling mobile payments and credit card billing while ensuring high-quality deliverables.

Highlights

Automated end-to-end API testing for E-Wallet platforms (Samsung Pay, Apple Pay, Amazon Pay, Android Pay) using Cucumber and Java, ensuring robust functionality for digital payment services.

Developed and automated data validation processes for Oracle and PostgreSQL databases, enhancing data integrity for financial transactions.

Managed and monitored Jenkins jobs for continuous integration and deployment, improving delivery efficiency for E-Wallets and Core Billing applications.

Utilized the CRAFT framework to design and execute batch jobs for over 600 test cases on credit card operations via TS2, ensuring comprehensive backend application testing.

Supported core billing operations by handling complex calculations for minimum payments, finance charges, late fees, and membership fees, contributing to accurate billing cycles.

Education

BITS PILANI
Pilani, Rajasthan, India

Post Graduate Program

Big Data Engineering

Northern India Institute of Technology
Ghaziabad, Uttar Pradesh, India

Bachelors in Technology

Information Technology

Awards

Gems Award

Awarded By

Emirates National Bank of Dubai

Recognized for outstanding contributions and performance.

Sense of Urgency Award (Tomcat Migration)

Awarded By

Mastercard

Awarded for dedicated focus and swift resolution in critical tomcat migration efforts.

A Great Job Award (Priceless Planet Tree)

Awarded By

Mastercard

Recognized for significant contributions to the 'Priceless Planet Tree' initiative.

Sense of Urgency Award (Autowrite Project)

Awarded By

Mastercard

Awarded for demonstrating exceptional focus and urgency on the Autowrite Project.

Certificates

AWS Certified (LinkedIn Certification)

Issued By

LinkedIn / AWS

Big Data Technologies Training (Spark, Scala, Hive, Kafka, Hadoop, MapReduce, Sqoop)

Issued By

Online Learning Center (OLC)

Skills

Cloud Platforms

AWS (EMR, S3, EC2, Elastic Search), Azure Databricks.

Databases

PostgreSQL, MongoDB, MySQL.

Programming Languages

Java, Scala, SQL.

Operating Systems

Linux, Windows.

Development Tools

Eclipse, IntelliJ, VSCode, Git, Jenkins, Spark UI, Chef, NIFI.

Methodologies

Agile, Scrum.

Big Data Technologies

Hadoop, Spark, Hive, HBase, Kafka, Sqoop.