Deepak Mane

Senior Data Architect & AI/ML Solution Leader
Zurich, CH.

About

Highly accomplished and results-oriented Technology Professional with over 20 years of extensive expertise in Artificial Intelligence, Data Engineering, Data Science, Machine Learning, and Deep Learning. Proven leader and trusted Enterprise Solution Architect, specializing in architecting, designing, developing, and deploying complex data ingestion frameworks, Enterprise Data Lakes, and advanced analytics platforms across diverse industries including Banking/Finance, Telecom, Media, and Pharma. Recognized for driving multi-million dollar revenue growth, optimizing operations, and holding 11 patents in AI, Deep Learning, and IoT, demonstrating a strong track record of innovation and impactful solutions.

Work

SIX Group
|

Data Architect / Engineer

Zurich, Zurich, Switzerland

Summary

As a Data Architect/Engineer, I am leading the migration of on-premises Oracle DWH to Google Cloud Platform, designing comprehensive architecture, governance, and ETL pipelines while leveraging AI/ML for data transformation and security.

Highlights

Prepared data design models and implemented a comprehensive data integration solution leveraging Google Cloud Platform and Dataflow for large-scale DWH migration.

Designed and executed a robust Data Governance strategy for migration projects, aligning diverse stakeholder needs (Business, Technology, Operations).

Implemented a target data storage architecture utilizing Google Storage and Google Databricks (Medallion – Bronze, Silver, Gold layers) to optimize data organization and accessibility.

Managed the complete migration of Oracle SQL queries to Databricks SQL format, achieving 100% data modeling accuracy and efficiency.

Established a secure integration runtime environment, including SIX FTS, to facilitate smooth data ingestion from core source systems.

Developed and deployed security and monitoring solutions using Google Monitor, ensuring data integrity and compliance across the cloud platform.

Converted complex Oracle SQL queries into Databricks SQL using LLM models, streamlining the migration process and enhancing data transformation capabilities.

SIX Group
|

Data Architect/Engineer

Zurich, Zurich, Switzerland

Summary

As a Data Engineer/Solution Architect at SIX Group, I designed and implemented advanced data integration solutions, optimized storage, and ensured robust data governance and security on Google Cloud, developing and deploying scalable data pipelines.

Highlights

Optimized and developed Google Data Fusion pipelines, reducing data ingestion time by 30% and significantly enhancing overall reporting efficiency.

Streamlined data flow by creating robust ETL pipelines in Google Data Fusion, decreasing data loading time by 25%.

Implemented a Google Data Lake Store for thin provisioning, resulting in a 20% reduction in data storage costs.

Designed and implemented a medallion architecture (Bronze, Silver, Gold layers) for processing and transforming data using Databricks and Google Storage.

Developed and deployed a Log Analytics framework using KQL and Google Data Fusion, enhancing operational monitoring and troubleshooting capabilities.

Designed a scalable data ingestion pipeline using Google Data Fusion, processing over 1 million records per hour from diverse sources.

Implemented comprehensive data governance and security measures, ensuring data quality, integrity, and compliance with regulatory requirements.

Collaborated with cross-functional teams to integrate the analytics platform with existing systems and applications, improving data accessibility.

Commonwealth Bank of Australia
|

Technical Manager/Solution Architect

Sydney, New South Wales, Australia

Summary

As Technical Manager/Solution Architect, I led an L3 Support team to design and deploy utilities for memory optimization, cost reduction, and security frameworks, significantly boosting L1/L2 application support efficiency.

Highlights

Designed, built, and deployed Amazon Machine Images (AMI) and Elastic Container Registry (ECR) using automated pipelines and scripts, streamlining infrastructure provisioning.

Developed an automated cost optimizer model leveraging CloudWatch and AWS Trust Advisor, achieving significant cost savings for cloud resources.

Created an advanced error handling and log exception framework using AWS CloudWatch, monitoring, and audit logs to improve system reliability.

Developed and implemented scripts for automated patching at the OS, application, and code levels, ensuring system security and compliance.

Published efficient physical data models for NoSQL, SQL, and message-based target data layers, optimizing data storage and retrieval.

Resolved performance bottlenecks in data pipelines and transformation processes, improving system throughput and efficiency.

Integrated Spark with Abinitio and AWS Kinesis to enable real-time data streaming and processing.

Commonwealth Bank of Australia
|

Senior Data Scientist/Engineer

Sydney, New South Wales, Australia

Summary

As a Senior Data Scientist/Engineer, I led a team to develop a real-time financial fraud detection system, reducing false positive rates through graph databases and Python, while optimizing ML algorithms and preparing cloud deployment guidelines.

Highlights

Engineered and optimized Spark and Airflow for performance tuning, enhancing the efficiency of data processing workflows.

Designed, developed, and documented various machine learning models for world-class enterprise applications, ensuring high accuracy and scalability.

Led and collaborated with the Bank's Financial Security team to resolve critical issues, enhancing fraud detection capabilities.

Developed a machine intelligence-based model for auto-classification of transactions and entity extraction using NLTK, achieving high data accuracy.

Created advanced Graph-based machine learning models (e.g., community detection, similarity, heuristic pattern, path finding, search) for complex fraud patterns.

Achieved real-time processing of transactions in milliseconds, crucial for immediate fraud detection and prevention.

Developed a real-time data ingestion model using Kafka with CDC from various sources, ensuring continuous and up-to-date data for analysis.

Led cloud platform provisioning, build deployment, and release management, ensuring seamless deployment and operation of the fraud detection system.

Foxtel Australia
|

Data Scientist/Architect

Sydney, New South Wales, Australia

Summary

As a Data Scientist/Architect, I contributed to Foxtel's Enterprise Analytics, building a real-time AWS cloud data ingestion framework to enable 360-degree customer views and diverse analytics use cases.

Highlights

Developed a logging and feedback framework to ensure data delivery to storage (AWS Kinesis) with high reliability.

Implemented a real-time data ingestion platform leveraging AWS Kinesis in the cloud for high-throughput data processing.

Managed cloud platforms (AWS and GCP) to ensure optimal performance, cost-efficiency, and operational stability.

Developed diverse machine learning models, including Collaborative Filtering, K-Means Clustering, and Bayesian Networking, to extract deeper customer insights.

Designed and developed a real-time online dashboard using Amazon Kinesis, ELK, and Python Pusher for immediate data visualization.

Developed a Data Quality/Transformation framework using Spark Scala, ensuring data integrity and consistency for analytics.

Designed a robust data ingestion architecture model based on 50 parameters, ensuring no zero data loss, high SAR, auditing, security, and compliance.

Client Project
|

Data Scientist/Engineer

Remote, Various, US

Summary

In this predictive analytics project, I focused on developing advanced sales forecasting models and a customer churn prediction model to optimize marketing campaigns and reduce costs for sales and marketing departments through data science approaches.

Highlights

Applied advanced missing value treatments and outlier reduction techniques to improve data quality and model accuracy.

Developed a feature engineering extraction methodology using Python libraries to enhance predictive power from customer data.

Built a customer classification model using Light Machine Learning algorithms, achieving high accuracy in customer segmentation.

Developed an accurate sales prediction model using regression techniques, improving sales forecasting capabilities.

Created a time-series machine learning model to suggest optimal order quantities, optimizing inventory and supply chain.

Trained existing datasets to achieve high accuracy for machine learning models, ensuring reliable predictive outcomes.

VISA
|

Data Scientist/Engineer

Remote, Various, US

Summary

For VISA's Credit Card Fraud Management project, I enhanced existing fraud detection machine learning models by incorporating new fraud patterns, significantly reducing false positive rates for credit card transactions and improving overall system performance.

Highlights

Developed trend detection and co-relation finder to identify abnormal transaction behaviors, enhancing fraud pattern recognition.

Built a web crawler to gather intelligence on fraudulent cyber activity, expanding fraud detection capabilities.

Featured and modeled a fraud detection model using Spark, optimizing processing for large datasets.

Developed a classification model using four diverse algorithms (Boost, KNN, SVM, Random Forest) for robust fraud detection.

Increased the accuracy of existing machine learning models by 20% through the application of Boost algorithms.

Bank Bukovina
|

Big Data Consultant

Remote, Various, US

Summary

As a Big Data Consultant for Bank Bukovina, I designed a comprehensive enterprise architecture strategy for building a hybrid cloud and on-premises Enterprise Data Lake, delivering a 3-year roadmap for its implementation and evolution.

Highlights

Conducted detailed AS-IS analysis, gap analysis, and recommendations across three phases to inform the Enterprise Data Lake strategy.

Developed an end-to-end solution architecture model for the Enterprise Data Lake, integrating diverse fabrics (Integration, Knowledge, Delivery, Analytics, Information Management) across cloud (Amazon) and on-premises environments.

Designed a business use case onboarding model, incorporating Cost Benefit Analytics and Return on Investment metrics for new initiatives.

Implemented Change Data Capture using Kafka and Attunity for structured databases, ensuring real-time data synchronization.

Designed detailed Data Processing Flows for the data lake, including various layers and fabrics (Integration, Knowledge, Delivery, Analytics).

Developed a 12-month Data Ingestion Roadmap with integrated business use cases.

Provided a blueprint model for Hadoop Disaster Recovery, enhancing data resilience and business continuity.

Education

Visvesvaraya Technological University
Belgaum, Karnataka, India

Master of Computer Application

Computer Science

University of Mumbai
Mumbai, Maharashtra, India

Bachelor of Science

Mathematics

Awards

11 Patents in AI, Deep Learning, and IoT

Awarded By

Government of Australia and India

Recognized for significant contributions and innovations in Artificial Intelligence, Deep Learning, and Internet of Things across various applications.

Machine Learning in Safety Critical Cyber Security System

Awarded By

Government of Australia

Patent for developing a machine learning system to enhance safety in critical cybersecurity.

A System for Controlling Traffic Based on AI and Machine Learning

Awarded By

Government of Australia

Patent for a system designed to control traffic using Artificial Intelligence and Machine Learning.

Effectiveness of Internet Advertising on Consumer Buying Behavior Towards Mobile Phones

Awarded By

Government of Australia

Patent for research on the effectiveness of internet advertising in influencing consumer buying behavior via mobile phones.

IoT based Home Automation using Cloud

Awarded By

Government of Australia

Patent for an IoT-based home automation system leveraging cloud computing.

A novel process for generating a customer-value-driven plan to create high growth business opportunities

Awarded By

Government of Australia

Patent for a novel process to generate customer-value-driven plans for high-growth business opportunities.

A Systematic Framework for Healthcare Information Exchange Through Blockchain based Approaches

Awarded By

Government of India

Patent for a systematic framework for secure healthcare information exchange using blockchain technology.

A System for Geo-Positioning Using Smart Identity Card

Awarded By

Government of India

Patent for a geo-positioning system utilizing smart identity cards.

Monitoring and Detection of Covid-19 Patients using Internet of Things

Awarded By

Government of India

Patent for an IoT-based system to monitor and detect Covid-19 in patients.

Internet of Things based Ebola Virus Detection Among Masses

Awarded By

Government of India

Patent for an IoT-based system to detect Ebola virus spread among populations.

A System and Method for Analyzing Dynamic Out-Of-Home Advertising based on Real-Time Viewers Biometric Information

Awarded By

Government of India

Patent for a system and method to analyze dynamic out-of-home advertising using real-time viewer biometric information.

Skills

Artificial Intelligence & Machine Learning

Artificial Intelligence, Machine Learning, Deep Learning, Data Science, Graph Data Science, Advanced Artificial Intelligence, Azure Copilot, LLM (PHI, Qwen, ChatGPT), AWS Q-Developer, Collaborative Filtering, K-Means Clustering, Bayesian Networking, NLTK, Boost Algorithms, KNN, SVM, Random Forest, TensorFlow, Caffee, Theano, Mahout.

Cloud Platforms & Services

Google Cloud Platform, Azure Cloud Platform, Amazon Cloud Platform, Google Storage, Google Data Fusion, Big Query, Google Analytics, Azure Data Fabric, Azure Data Factory, KQL, Azure Log Analytics Workspace, ADF, Databricks, Storage Blobs, Synapse, Azure Monitoring, Azure DevOPS, EC2, EBS, S3, Elastic Container Services, VPC, Snowball, Redshift, Amazon RDS, Amazon Glacier, Amazon CloudFront, Amazon Kinesis, Amazon Redshift, Virtual Machines, MySQL Databases, Cosmos, Openstack, Savanna.

Big Data & Data Engineering

Big Data, Data Engineering, Enterprise Data Lake, Data Lake Creation, Data Ingestion/Processing Framework, ETL Tools, Change Data Capture, Hadoop, Cloudera, Hortonworks, MapR, Collibra, HDFS, PIG, Hive, HBASE, SPARK, SQOOP, Kafka, Flume, ELK, NIFI, Ab Initio, Talend Data Fabric, Pentaho, CDC, GlobalID, Platfora, Tresata, Attunity, Wandisco.

Programming & Scripting

Python, SPARK, Scala, Java, MongoDB, Couchbase, GraphX, KQL, SQL, Python Excel, Python Anaconda.

Data Management & Governance

Data Modelling Tools (ERWIN), Data Governance, Cyber Security, System Management, Automation, Metadata Strategy, Data Quality, Audit Log.

Analytics & Visualization

Analytics Tools, Power BI, Alteryx, Attavio, Tressata, Google Monitor, Log Analytics Workspace.

Architectural & Leadership

Solution Architect/Designer, Enterprise Solution Architect, Technical Lead, IT Management, Pre-Sales, Training/Mentoring, Architectural Thinking, Fundamentals of Architecting Solutions: Structuring the Solution.

Projects

Autonomous Car Analytics

Summary

Led Solution Architecture and Data Science initiatives for developing advanced analytics solutions for autonomous vehicles, leveraging cloud computing for a major automotive company.

Pharma's Patients Analytics Platform

Summary

Served as Solution Architect and Data Science lead for developing a patient analytics platform in the pharmaceutical industry for Pfizer.

Enterprise Data Lake (Cloud and on-premises)

Summary

Executed a Data Science and Cloud project for Bank Bukopin, focusing on building a hybrid enterprise data lake solution.

Analytics 2.0 Platform

Summary

Developed an advanced Analytics 2.0 platform as a Data Science and Cloud project for TCS (Ultimatix).

BABAR (Data Ingestion, Analytics and Quality)

Summary

Managed a Data Science, Data Ingestion, and Analytics project for HSBC (UK), focusing on complex data processing and quality assurance.

Text Mining

Summary

Conducted a Data Mining and Data Science project for a large Aviation Industry client, focusing on advanced text analysis.

Aircraft Engine Defect Identification (Image Analytics)

Summary

Led a Data Science and Deep Learning project for GE, focusing on identifying defects in aircraft engines and parts using advanced image analytics.

Automated Medical Image Screening & Classification

Summary

Executed a Data Science and Deep Learning project for GE, developing automated screening and classification systems for medical images for a leading medical equipment manufacturer.

Web Intelligence

Summary

Managed a Big Data and Analytics project for a Sweden-based Supermarket company, focusing on web intelligence solutions.

Dynamic Big Data Clusters (Openstack)

Summary

Led a Big Data and Cloud Computing project for Santander Bank, focusing on building dynamic Big Data clusters in a private cloud environment using OpenStack.

Hadoop Cluster Performance Monitoring & Alert

Summary

Managed a Big Data project for Nokia - Finland, focused on implementing performance monitoring and alerting systems for Hadoop clusters.

Music Recommendation System

Summary

Developed a Big Data-driven music recommendation system for Nokia - Finland.

Hadoop Disaster Recovery

Summary

Implemented a Big Data / Hadoop disaster recovery solution for British Petroleum.

Deepak Mane