Deepak Mane
Senior Data Architect & AI/ML Solution Leader
Zurich, CH.About
Highly accomplished and results-oriented Technology Professional with over 20 years of extensive expertise in Artificial Intelligence, Data Engineering, Data Science, Machine Learning, and Deep Learning. Proven leader and trusted Enterprise Solution Architect, specializing in architecting, designing, developing, and deploying complex data ingestion frameworks, Enterprise Data Lakes, and advanced analytics platforms across diverse industries including Banking/Finance, Telecom, Media, and Pharma. Recognized for driving multi-million dollar revenue growth, optimizing operations, and holding 11 patents in AI, Deep Learning, and IoT, demonstrating a strong track record of innovation and impactful solutions.
Work
Zurich, Zurich, Switzerland
→
Summary
As a Data Architect/Engineer, I am leading the migration of on-premises Oracle DWH to Google Cloud Platform, designing comprehensive architecture, governance, and ETL pipelines while leveraging AI/ML for data transformation and security.
Highlights
Prepared data design models and implemented a comprehensive data integration solution leveraging Google Cloud Platform and Dataflow for large-scale DWH migration.
Designed and executed a robust Data Governance strategy for migration projects, aligning diverse stakeholder needs (Business, Technology, Operations).
Implemented a target data storage architecture utilizing Google Storage and Google Databricks (Medallion – Bronze, Silver, Gold layers) to optimize data organization and accessibility.
Managed the complete migration of Oracle SQL queries to Databricks SQL format, achieving 100% data modeling accuracy and efficiency.
Established a secure integration runtime environment, including SIX FTS, to facilitate smooth data ingestion from core source systems.
Developed and deployed security and monitoring solutions using Google Monitor, ensuring data integrity and compliance across the cloud platform.
Converted complex Oracle SQL queries into Databricks SQL using LLM models, streamlining the migration process and enhancing data transformation capabilities.
Zurich, Zurich, Switzerland
→
Summary
As a Data Engineer/Solution Architect at SIX Group, I designed and implemented advanced data integration solutions, optimized storage, and ensured robust data governance and security on Google Cloud, developing and deploying scalable data pipelines.
Highlights
Optimized and developed Google Data Fusion pipelines, reducing data ingestion time by 30% and significantly enhancing overall reporting efficiency.
Streamlined data flow by creating robust ETL pipelines in Google Data Fusion, decreasing data loading time by 25%.
Implemented a Google Data Lake Store for thin provisioning, resulting in a 20% reduction in data storage costs.
Designed and implemented a medallion architecture (Bronze, Silver, Gold layers) for processing and transforming data using Databricks and Google Storage.
Developed and deployed a Log Analytics framework using KQL and Google Data Fusion, enhancing operational monitoring and troubleshooting capabilities.
Designed a scalable data ingestion pipeline using Google Data Fusion, processing over 1 million records per hour from diverse sources.
Implemented comprehensive data governance and security measures, ensuring data quality, integrity, and compliance with regulatory requirements.
Collaborated with cross-functional teams to integrate the analytics platform with existing systems and applications, improving data accessibility.
Sydney, New South Wales, Australia
→
Summary
As Technical Manager/Solution Architect, I led an L3 Support team to design and deploy utilities for memory optimization, cost reduction, and security frameworks, significantly boosting L1/L2 application support efficiency.
Highlights
Designed, built, and deployed Amazon Machine Images (AMI) and Elastic Container Registry (ECR) using automated pipelines and scripts, streamlining infrastructure provisioning.
Developed an automated cost optimizer model leveraging CloudWatch and AWS Trust Advisor, achieving significant cost savings for cloud resources.
Created an advanced error handling and log exception framework using AWS CloudWatch, monitoring, and audit logs to improve system reliability.
Developed and implemented scripts for automated patching at the OS, application, and code levels, ensuring system security and compliance.
Published efficient physical data models for NoSQL, SQL, and message-based target data layers, optimizing data storage and retrieval.
Resolved performance bottlenecks in data pipelines and transformation processes, improving system throughput and efficiency.
Integrated Spark with Abinitio and AWS Kinesis to enable real-time data streaming and processing.
Sydney, New South Wales, Australia
→
Summary
As a Senior Data Scientist/Engineer, I led a team to develop a real-time financial fraud detection system, reducing false positive rates through graph databases and Python, while optimizing ML algorithms and preparing cloud deployment guidelines.
Highlights
Engineered and optimized Spark and Airflow for performance tuning, enhancing the efficiency of data processing workflows.
Designed, developed, and documented various machine learning models for world-class enterprise applications, ensuring high accuracy and scalability.
Led and collaborated with the Bank's Financial Security team to resolve critical issues, enhancing fraud detection capabilities.
Developed a machine intelligence-based model for auto-classification of transactions and entity extraction using NLTK, achieving high data accuracy.
Created advanced Graph-based machine learning models (e.g., community detection, similarity, heuristic pattern, path finding, search) for complex fraud patterns.
Achieved real-time processing of transactions in milliseconds, crucial for immediate fraud detection and prevention.
Developed a real-time data ingestion model using Kafka with CDC from various sources, ensuring continuous and up-to-date data for analysis.
Led cloud platform provisioning, build deployment, and release management, ensuring seamless deployment and operation of the fraud detection system.
Sydney, New South Wales, Australia
→
Summary
As a Data Scientist/Architect, I contributed to Foxtel's Enterprise Analytics, building a real-time AWS cloud data ingestion framework to enable 360-degree customer views and diverse analytics use cases.
Highlights
Developed a logging and feedback framework to ensure data delivery to storage (AWS Kinesis) with high reliability.
Implemented a real-time data ingestion platform leveraging AWS Kinesis in the cloud for high-throughput data processing.
Managed cloud platforms (AWS and GCP) to ensure optimal performance, cost-efficiency, and operational stability.
Developed diverse machine learning models, including Collaborative Filtering, K-Means Clustering, and Bayesian Networking, to extract deeper customer insights.
Designed and developed a real-time online dashboard using Amazon Kinesis, ELK, and Python Pusher for immediate data visualization.
Developed a Data Quality/Transformation framework using Spark Scala, ensuring data integrity and consistency for analytics.
Designed a robust data ingestion architecture model based on 50 parameters, ensuring no zero data loss, high SAR, auditing, security, and compliance.
Remote, Various, US
→
Summary
In this predictive analytics project, I focused on developing advanced sales forecasting models and a customer churn prediction model to optimize marketing campaigns and reduce costs for sales and marketing departments through data science approaches.
Highlights
Applied advanced missing value treatments and outlier reduction techniques to improve data quality and model accuracy.
Developed a feature engineering extraction methodology using Python libraries to enhance predictive power from customer data.
Built a customer classification model using Light Machine Learning algorithms, achieving high accuracy in customer segmentation.
Developed an accurate sales prediction model using regression techniques, improving sales forecasting capabilities.
Created a time-series machine learning model to suggest optimal order quantities, optimizing inventory and supply chain.
Trained existing datasets to achieve high accuracy for machine learning models, ensuring reliable predictive outcomes.
Remote, Various, US
→
Summary
For VISA's Credit Card Fraud Management project, I enhanced existing fraud detection machine learning models by incorporating new fraud patterns, significantly reducing false positive rates for credit card transactions and improving overall system performance.
Highlights
Developed trend detection and co-relation finder to identify abnormal transaction behaviors, enhancing fraud pattern recognition.
Built a web crawler to gather intelligence on fraudulent cyber activity, expanding fraud detection capabilities.
Featured and modeled a fraud detection model using Spark, optimizing processing for large datasets.
Developed a classification model using four diverse algorithms (Boost, KNN, SVM, Random Forest) for robust fraud detection.
Increased the accuracy of existing machine learning models by 20% through the application of Boost algorithms.
Remote, Various, US
→
Summary
As a Big Data Consultant for Bank Bukovina, I designed a comprehensive enterprise architecture strategy for building a hybrid cloud and on-premises Enterprise Data Lake, delivering a 3-year roadmap for its implementation and evolution.
Highlights
Conducted detailed AS-IS analysis, gap analysis, and recommendations across three phases to inform the Enterprise Data Lake strategy.
Developed an end-to-end solution architecture model for the Enterprise Data Lake, integrating diverse fabrics (Integration, Knowledge, Delivery, Analytics, Information Management) across cloud (Amazon) and on-premises environments.
Designed a business use case onboarding model, incorporating Cost Benefit Analytics and Return on Investment metrics for new initiatives.
Implemented Change Data Capture using Kafka and Attunity for structured databases, ensuring real-time data synchronization.
Designed detailed Data Processing Flows for the data lake, including various layers and fabrics (Integration, Knowledge, Delivery, Analytics).
Developed a 12-month Data Ingestion Roadmap with integrated business use cases.
Provided a blueprint model for Hadoop Disaster Recovery, enhancing data resilience and business continuity.
Awards
11 Patents in AI, Deep Learning, and IoT
Awarded By
Government of Australia and India
Recognized for significant contributions and innovations in Artificial Intelligence, Deep Learning, and Internet of Things across various applications.
Machine Learning in Safety Critical Cyber Security System
Awarded By
Government of Australia
Patent for developing a machine learning system to enhance safety in critical cybersecurity.
A System for Controlling Traffic Based on AI and Machine Learning
Awarded By
Government of Australia
Patent for a system designed to control traffic using Artificial Intelligence and Machine Learning.
Effectiveness of Internet Advertising on Consumer Buying Behavior Towards Mobile Phones
Awarded By
Government of Australia
Patent for research on the effectiveness of internet advertising in influencing consumer buying behavior via mobile phones.
IoT based Home Automation using Cloud
Awarded By
Government of Australia
Patent for an IoT-based home automation system leveraging cloud computing.
A novel process for generating a customer-value-driven plan to create high growth business opportunities
Awarded By
Government of Australia
Patent for a novel process to generate customer-value-driven plans for high-growth business opportunities.
A Systematic Framework for Healthcare Information Exchange Through Blockchain based Approaches
Awarded By
Government of India
Patent for a systematic framework for secure healthcare information exchange using blockchain technology.
A System for Geo-Positioning Using Smart Identity Card
Awarded By
Government of India
Patent for a geo-positioning system utilizing smart identity cards.
Monitoring and Detection of Covid-19 Patients using Internet of Things
Awarded By
Government of India
Patent for an IoT-based system to monitor and detect Covid-19 in patients.
Internet of Things based Ebola Virus Detection Among Masses
Awarded By
Government of India
Patent for an IoT-based system to detect Ebola virus spread among populations.
A System and Method for Analyzing Dynamic Out-Of-Home Advertising based on Real-Time Viewers Biometric Information
Awarded By
Government of India
Patent for a system and method to analyze dynamic out-of-home advertising using real-time viewer biometric information.
Skills
Artificial Intelligence & Machine Learning
Artificial Intelligence, Machine Learning, Deep Learning, Data Science, Graph Data Science, Advanced Artificial Intelligence, Azure Copilot, LLM (PHI, Qwen, ChatGPT), AWS Q-Developer, Collaborative Filtering, K-Means Clustering, Bayesian Networking, NLTK, Boost Algorithms, KNN, SVM, Random Forest, TensorFlow, Caffee, Theano, Mahout.
Cloud Platforms & Services
Google Cloud Platform, Azure Cloud Platform, Amazon Cloud Platform, Google Storage, Google Data Fusion, Big Query, Google Analytics, Azure Data Fabric, Azure Data Factory, KQL, Azure Log Analytics Workspace, ADF, Databricks, Storage Blobs, Synapse, Azure Monitoring, Azure DevOPS, EC2, EBS, S3, Elastic Container Services, VPC, Snowball, Redshift, Amazon RDS, Amazon Glacier, Amazon CloudFront, Amazon Kinesis, Amazon Redshift, Virtual Machines, MySQL Databases, Cosmos, Openstack, Savanna.
Big Data & Data Engineering
Big Data, Data Engineering, Enterprise Data Lake, Data Lake Creation, Data Ingestion/Processing Framework, ETL Tools, Change Data Capture, Hadoop, Cloudera, Hortonworks, MapR, Collibra, HDFS, PIG, Hive, HBASE, SPARK, SQOOP, Kafka, Flume, ELK, NIFI, Ab Initio, Talend Data Fabric, Pentaho, CDC, GlobalID, Platfora, Tresata, Attunity, Wandisco.
Programming & Scripting
Python, SPARK, Scala, Java, MongoDB, Couchbase, GraphX, KQL, SQL, Python Excel, Python Anaconda.
Data Management & Governance
Data Modelling Tools (ERWIN), Data Governance, Cyber Security, System Management, Automation, Metadata Strategy, Data Quality, Audit Log.
Analytics & Visualization
Analytics Tools, Power BI, Alteryx, Attavio, Tressata, Google Monitor, Log Analytics Workspace.
Architectural & Leadership
Solution Architect/Designer, Enterprise Solution Architect, Technical Lead, IT Management, Pre-Sales, Training/Mentoring, Architectural Thinking, Fundamentals of Architecting Solutions: Structuring the Solution.