Praveen Kumar

Data Engineer
Dallas, US.

About

Highly accomplished Data Engineer with over 4 years of expertise in designing, developing, and optimizing high-performance ELT/ETL pipelines using Snowflake. Proven ability to integrate and transform large-scale datasets, implement robust data quality and governance practices, and enhance system performance for significant cost optimization. Adept at leveraging AWS for cloud data solutions and collaborating cross-functional teams to deliver scalable, secure, and analytics-ready data critical for regulatory reporting and real-time processing.

Work

eAlliance Corporation
|

Data Engineer

Dallas, TX, US

Summary

As a Data Engineer, delivered critical Snowflake-based data solutions for diverse clients including Mr. Cooper (mortgage servicing), Trustmark Insurance, and Citigroup (financial services), driving data ingestion, transformation, and regulatory compliance.

Highlights

Developed and managed high-volume ELT pipelines in Snowflake, ingesting and transforming critical mortgage servicing data from core systems including DB2 for Mr. Cooper.

Architected complex data models using star and snowflake schemas for mortgage entities and financial data, supporting robust analytical frameworks and regulatory reporting.

Engineered near real-time incremental data loads using Snowflake Streams, Tasks, and Snowpipe, significantly reducing data latency for critical business updates across multiple clients.

Optimized Snowflake query performance and storage efficiency through clustering keys, materialized views, and strategic partitioning, reducing compute costs and improving data retrieval speeds.

Ensured stringent data quality by implementing comprehensive validation rules, reconciliation logic, and audit checkpoints across all data layers, maintaining data integrity for financial and insurance data.

Enabled compliant regulatory reporting by developing traceable, analytics-ready datasets with complete data lineage and comprehensive compliance documentation for financial and insurance clients.

Led the migration of critical financial and regulatory data from on-premise databases to Snowflake, ensuring seamless transition and data integrity for Citigroup.

Collaborated cross-functionally with stakeholders across Mortgage Servicing, Risk, Compliance, and Finance to translate complex business requirements into scalable Snowflake data solutions.

Ameri Cloud Solutions
|

Data Engineer

Dallas, TX, US

Summary

As a Data Engineer, led the design and optimization of ELT/ETL pipelines for ALG Worldwide Logistics, driving efficient data integration and cost-effective warehouse utilization.

Highlights

Engineered and optimized ELT/ETL pipelines leveraging Snowflake's Snowpipe, Streams, and Tasks to process high-volume logistics data, significantly enhancing real-time data availability.

Integrated diverse large-scale datasets from transportation, warehouse management, and ERP systems into Snowflake, establishing a centralized, analytics-ready data repository.

Optimized Snowflake query performance and warehouse resource utilization, resulting in significant cost savings and accelerated data processing speeds.

Implemented robust data quality checks, validation rules, and exception handling mechanisms within pipelines, ensuring data accuracy and reliability for critical logistics operations.

Automated daily logistics reporting and real-time alert generation using Snowflake Tasks and advanced orchestration tools, improving operational visibility and response times.

Authored comprehensive technical documentation for data models, pipeline architectures, and business data flows, facilitating knowledge transfer and compliance.

Education

Saveetha School of Engineering
Chennai, Tamil Nadu, India

Bachelor of Engineering

Computer Science

Skills

Data Warehousing & ETL

Snowflake, Snowpipe, Streams, Tasks, Stored Procedures, Materialized Views, SQL-based Transformations, Incremental Data Loads.

Databases

DB2, SQL Server, MySQL, MongoDB.

Programming Languages

SQL, Snowflake Scripting, Python, Shell.

Cloud Platforms

AWS.

DevOps & CI/CD

Git, Github.

Business Intelligence

Power BI.

Data Governance & Security

Role-Based Access Control (RBAC), Data Access Governance, Sensitive Data Protection.

Data Modeling

Star Schema, Snowflake Schema, Dimensional Modeling.