About
Results-driven Data Engineer with 1.6 years of experience specializing in designing and optimizing robust ETL pipelines, Python automation, and web scraping solutions. Proficient in a comprehensive stack including AWS, Snowflake, Airflow, DBT, and Power BI, adept at transforming complex data into actionable insights and improving data quality. Proven collaborator, consistently delivering data-driven solutions that support strategic decision-making and enhance operational efficiency.
Work
Clarity Travel Technology solutions
|Data Engineer
Chennai, Tamil Nadu, India
→
Summary
Spearheaded data engineering initiatives at Clarity Travel Technology solutions, designing and optimizing ETL pipelines, data warehouses, and automation scripts to enhance data quality and support strategic decision-making.
Highlights
Optimized MySQL query performance by implementing strategic indexing, resulting in a significant reduction in execution time and improved data retrieval efficiency.
Engineered a scalable Data Warehouse on Snowflake, utilizing Apache Airflow for orchestration and DBT for robust data transformations, enhancing data accessibility and analytical capabilities.
Developed an AWS-native data pipeline with Lambda and S3, automating the extraction, cleaning, and transformation of flight invoice text files into structured JSON for downstream analysis.
Improved data quality and search relevance by automating the correction of hotel type misclassifications using Python, reducing errors and enhancing user experience.
Streamlined business intelligence by designing and scheduling Apache Airflow ETL workflows, automating periodic report distribution to key stakeholders for informed decision-making.
Education
Anna University
→
BE
Geoinformatics
Certificates
Power BI Data Analytics for All levels 3.0
Issued By
Codebasics
Math and Statistics for AI, Data Science
Issued By
Codebasics
Advanced Programming and Master Data Engineering
Issued By
IITM & GUVI
Skills
Programming Languages
Python, SQL, MongoDB, Pandas, BeautifulSoup, Selenium, PySpark.
Data Engineering
Apache Airflow, DBT, Data Warehousing, Data Modeling, Hadoop, Hive, Spark, PySpark, Snowflake, Kafka.
AWS Services
S3, RDS, Redshift, Glue, Lambda, IAM, EC2, VPC, CloudWatch, Athena.
Tools & Technologies
Git, Power BI, MS Excel.
Professional Skills
Teamwork, Communication, Adaptability.