Mukul Yadav

B.Tech (CSE) Student | Aspiring Data Engineer
Gandhinagar, IN.

About

Highly motivated B.Tech (CSE) student with a 7.7 CGPA, specializing in Data Engineering and Big Data Analytics. Proven ability to design and implement scalable data pipelines, optimize data warehousing solutions, and develop data-driven applications using Python, SQL, AWS, Azure, and Databricks. Eager to leverage technical expertise and project leadership experience to drive impactful data initiatives in a dynamic professional environment.

Work

Infosys
|

Ai Engineer Intern

Remote

Summary

Developing an AI-driven expense forecasting application leveraging LLMs and ML models to provide personalized financial insights and budget planning. Implemented time-series forecasting and predictive analytics to identify spending patterns and project future expenses. Integrated natural language interfaces powered by LLMs for intuitive, conversational financial queries.

Highlights

Applied regression models, ARIMA/LSTM, and anomaly detection techniques to enhance forecast accuracy and detect irregular spending.

Delivered an interactive dashboard for real-time visualization of financial health, enabling smarter decision-making

Mactores
|

Data Engineer Intern

United States (Remote)

Summary

Developed expertise in designing and implementing scalable data warehousing and ETL pipelines using Snowflake, Databricks, and AWS services to support real-time data initiatives.

Highlights

Gained hands-on experience in designing and implementing data warehousing and ETL pipelines utilizing Snowflake, Databricks, and AWS services (S3, Lambda, Glue, Redshift) for efficient data transformation.

Collaborated with cross-functional teams to analyze and define requirements for real-time data ingestion and processing, ensuring alignment with business needs.

Contributed to the development of scalable data solutions, focusing on optimizing data flow and enhancing system performance.

Rashtriya Raksha University
|

Data Centre Intern

Gandhinagar, Gujarat, India

Summary

Managed data centre operations and enhanced system security, ensuring optimal performance and proactive issue resolution for critical university infrastructure.

Highlights

Assisted in the maintenance and optimization of data centre systems, contributing to enhanced operational efficiency and system stability.

Monitored systems continuously, identifying and addressing potential security vulnerabilities and operational issues in accordance with established IT protocols.

Volunteer

National Security Council (NSCS)
|

Volunteer

Gandhinagar, Gujarat, India

Summary

Volunteered for the National Cyber Security Exercise, contributing to national cybersecurity initiatives and awareness for NCX-2023 & NCX-2024.

Highlights

Contributed to the National Cyber Security Exercise (NCX-2023 & NCX-2024) organized by the National Security Council (NSCS), supporting critical national security objectives.

Assisted in various capacities to ensure the smooth execution of cybersecurity drills and awareness programs.

Gained exposure to national-level cybersecurity strategies and operational protocols.

GDSC'RRU
|

Projects Lead

Gandhinagar, Gujarat, India

Summary

Led technical initiatives and community engagement as Projects Lead for GDSC'RRU, fostering skill development and collaboration among students.

Highlights

Orchestrated and led technical workshops, enhancing programming and leadership skills for the student community.

Facilitated networking events and collaboration opportunities for over 100 students, fostering a vibrant technical ecosystem.

Managed project timelines and resources, ensuring successful execution of student-led technical projects.

Education

Rashtriya Raksha University
Gandhinagar, Gujarat, India

B.Tech (CSE)

Computer Science and Engineering

Grade: CGPA: 7.7

Courses

Data Structure and Algorithms

Database Management Systems

Big Data Analytics

Human Computer Interaction

Cloud Computing

Data Analytics and Visualization

Software Security

Languages

English

Skills

Programming Languages

Python, Java, C, C++.

Databases & Query Languages

SQL, MySQL, MongoDB.

Big Data & Streaming Technologies

PySpark, Apache Kafka, Apache Flink, Hadoop, Databricks, Snowflake, Delta Lake.

Cloud Data Services

Azure Data Factory, Azure Synapse Analytics, Microsoft Fabric, AWS S3, AWS Lambda, AWS Glue, AWS Redshift.

Version Control & DevOps

Git, GitHub, CI/CD pipelines.

Web Technologies

HTML, CSS, JavaScript, Bootstrap, Flask.

Data Analysis & Visualization

Pandas, Geopy Geocoder, DBT (Data Build Tool).

Machine Learning

Computer Vision, Machine Learning Projects.

Projects

CrimeScope: Crime Records Analytics

Summary

Developed a web-based crime record analysis hub utilizing Python, HTML, and CSS to predict crime patterns and visualize incident locations, enhancing public safety insights.

Airline Booking Data Architecture

Summary

Designed and implemented a scalable data architecture for airline booking data using Databricks and Delta Lake, optimizing data processing and enhancing decision-making capabilities.

Smart Classroom Management Software

Summary

Developed a data-driven Smart Classroom Management Software using HTML, CSS, JavaScript, Flask, and Python, providing real-time attendance tracking, consolidated alerting, and resource organization for over 200 students and teachers.