Wisdom O. Ikezogwo

Machine Learning Researcher | Ph.D. Candidate in Computer Science

Seattle, US.

About

Highly accomplished Machine Learning Researcher and Ph.D. Candidate at the University of Washington’s Paul G. Allen School of Computer Science & Engineering, specializing in generative modeling, multimodal representation learning, and large-scale data curation. My research drives state-of-the-art advancements in medical imaging and video generation, backed by prestigious grants from Microsoft and the UW Population Health Initiative. With a First Class B.Sc. in Electronic & Electrical Engineering from Obafemi Awolowo University and impactful research internships at Apple and Mayo Clinic, I bring a proven track record of innovation. My work is published in top-tier venues like CVPR and NeurIPS, spanning critical applications in healthcare, finance, and recommendation systems.

Work

University of Washington

Research Assistant – Graphics and Imaging Laboratory (GRAIL)

Summary

Leading cutting-edge research in multimodal AI for medical imaging and advanced video generation, driving innovation in state-of-the-art models and dataset creation.

Highlights

Spearheaded research on Multimodal Large Language Models (LLMs) for medical imaging, developing novel medical multimodal datasets (Quilt-1M, MedNarratives) and state-of-the-art models (QuiltNet, Quilt-LLaVA).

Engineered a multi-agent AI framework, PathFinder, for clinical diagnosis, achieving performance superior to human experts and establishing improved benchmarks (MedBlink).

Directing efforts to integrate Newtonian physics into image and video generative models, focusing on generating large-scale, temporally consistent video scene graph datasets and pipelines.

University of Washington

Teaching Assistant

Summary

Provided comprehensive instructional and academic support for undergraduate and graduate courses in Computer Science.

Highlights

Taught Data Programming (CSE 160) for Fall 2021, Winter 2022, and Fall 2022 semesters, guiding students through fundamental concepts and practical applications.

Provided instruction and support for Introduction to Artificial Intelligence (CSE 473) during Spring 2023, Fall 2023, and Winter 2024 academic terms.

Apple

Ph.D. Machine Learning Research Internship

Summary

Led research on efficient multimodal representations for egocentric data, focusing on reducing data capture costs.

Highlights

Pioneered research in efficient multimodal representations for egocentric data, including video, text, audio, IMU, and hands.

Developed 'Perceive-Predict,' a novel framework leveraging predictive coding between co-occurring modalities to reconstruct missing data, significantly reducing capture costs for expensive modalities like video.

Mayo Clinic

Ph.D. Quantitative Health Sciences Internship

Summary

Directed research efforts to develop a foundational model for histopathology using large-scale image datasets.

Highlights

Led research towards developing a foundational model for histopathology, training on millions of gigapixel-sized histology images.

Scaled-up compute operations on the Argonne National Lab computing cluster, resulting in clinically evaluated models for improved diagnostic capabilities.

Okra, Inc.

ML Engineer

Summary

Developed machine learning models for financial information extraction and integration into lending systems.

Highlights

Engineered models to extract crucial customer financial information from unstructured banking data.

Processed key customer earning and spending data to feed into downstream lending pipelines, including predicting income, analyzing spending patterns, and performing reconciliations.

Demz Analytics Limited

Data Scientist / ML Engineer

Summary

Designed and implemented production-grade recommendation systems for enhanced user experience.

Highlights

Developed and deployed production recommendation systems utilizing advanced techniques such as attention mechanisms and epsilon-greedy bandit strategies.

Obafemi Awolowo University

UG. Research Assistant – Biosignal Processing, Inst. & Control Lab

Summary

Conducted research in biosignal processing, focusing on EEG signal analysis and neural network applications.

Highlights

Integrated disparate multivariate time series data, employing spectral component characterization for dynamical dimensionality reduction.

Developed and trained neural networks for accurate classification and characterization of brain EEG signals.

Education

University of Washington

Ph.D.

Computer Science and Engineering

Grade: 3.97/4.00

Courses

Advanced Machine Learning

Computer Vision

Natural Language Processing

Deep Learning

Generative Models

Obafemi Awolowo University

B.Sc.

Electronic & Electrical Engineering

Grade: 4.73/5.00

Courses

Digital Signal Processing

Control Systems Engineering

Applied Electronics

Microprocessor Systems

Electromagnetic Fields and Waves

Awards

Population Health Initiative AI Pilot Research Grant Award

Awarded By

University of Washington

Awarded $100,000 for pioneering AI research in population health.

Microsoft's Accelerate Foundation Models Research Grant

Awarded By

Microsoft

Received $20,000 grant to advance research in foundational AI models.

IBRO-Simons Computational Neuroscience Summer School Travel Grant

Awarded By

IBRO-Simons

Awarded travel grant to attend a prestigious computational neuroscience summer school in Cape Town.

Prof. Kehinde Prize for the Best Graduating Student in the Control Option

Awarded By

Obafemi Awolowo University

Recognized as the top-performing student in the Control Engineering specialization.

Oyebolu Prize for Best Male Graduating Student

Awarded By

Obafemi Awolowo University

Awarded for outstanding academic achievement as the best male graduating student.

Federal Government Scholarship Award, Nigeria

Awarded By

Federal Government of Nigeria

Received a cumulative scholarship valued at $1500 for academic excellence.

Total/NNPC National Merit Scholarship

Awarded By

Total/NNPC

Awarded a cumulative scholarship valued at $1500 based on national merit.

Etisalat Nigeria Merit Scholarship

Awarded By

Etisalat Nigeria

Received a scholarship valued at $250 for academic merit.

Publications

Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos

Published by

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Summary

Introduces a novel approach for visual instruction tuning using localized narratives from histopathology videos to enhance multimodal LLMs.

Quilt-1M: One Million Image-Text Pairs for Histopathology

Published by

NeurIPS

Summary

Presents a large-scale dataset of one million image-text pairs specifically curated for histopathology, enabling advanced medical imaging research.

Multi-modal Masked Autoencoders Learn Compositional Histopathological Representations

Published by

Machine Learning for Health (ML4H)

Summary

Explores the use of multimodal masked autoencoders to learn compositional representations from histopathological data, improving diagnostic capabilities.

Risk Stratification of Solitary Fibrous Tumor Using Whole Slide Image Analysis

Published by

LABORATORY INVESTIGATION, ELSEVIER SCIENCE INC

Summary

Applies whole slide image analysis for risk stratification of solitary fibrous tumors, contributing to more precise medical diagnostics.

Recent Advances, Applications and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables

Published by

ML4H Symposium

Summary

A collaborative reflection on the latest advancements, applications, and challenges in machine learning for health, derived from research roundtables.

PathFinder: A Multi-Modal Multi-Agent Framework for Diagnostic Decision-Making in Histopathology

Published by

In Submission to ICCV

Summary

Proposes a multi-modal, multi-agent framework to enhance diagnostic decision-making processes in histopathology.

MedicalNarratives: Connecting Medical Vision and Language with Procedural and Localized Narratives across all medical imaging domains

Published by

In Submission to ICCV

Summary

Explores linking medical vision and language through procedural and localized narratives to create a comprehensive framework for medical imaging analysis.

MedBlink: Probing the Fundamental Medical Imaging Knowledge of Multimodal Language Models

Published by

In Submission to ICCV

Summary

Investigates the foundational medical imaging knowledge embedded within multimodal language models to assess their understanding and capabilities.

Percieve-Predict: Modality and Time-Aware Egocentric Efficient Multi-Modal Representations

Published by

In Preparation for NeurIPS

Summary

Developing efficient multi-modal representations for egocentric data, incorporating modality and time awareness for improved predictive capabilities.

VPhysics: Temporally consistent Physics in Video (multiframe) Generation via Alignment

Published by

In Preparation for NeurIPS

Summary

Focuses on generating temporally consistent video frames by integrating physics principles, ensuring realistic motion and interactions.

Synthetic Video Scene Graph Generation

Published by

NeurIPS D&B

Summary

Research on generating synthetic video scene graphs to improve understanding and manipulation of complex video content.

Multi-Scale Cross-Attention Multiple Instance Learning (MsCAMIL) Network for Automated Triage of Colorectal Polyps

Published by

United States and Canadian Academy of Pathology's (USCAP) 114th Annual Meeting

Summary

Introduces MsCAMIL network for automated triage of colorectal polyps, enhancing efficiency in pathological diagnosis.

Comparative Performance of Multi-Scale Cross-Attention Multiple Instance Learning (MsCAMIL) and Pathology Trainees in Colorectal Polyp Diagnosis

Published by

United States and Canadian Academy of Pathology's (USCAP) 114th Annual Meeting

Summary

Compares the diagnostic performance of MsCAMIL against human pathology trainees in colorectal polyp diagnosis.

Supervised domain generalization for integration of disparate scalp EEG datasets for automatic epileptic seizure detection

Published by

Computers in Biology and Medicine

Summary

Investigates domain generalization techniques to integrate diverse EEG datasets for improved automatic epileptic seizure detection.

Empirical Characterization of the Temporal Dynamics of EEG Spectral Components

Published by

International Journal of Online and Biomedical Engineering (IJOE)

Summary

Provides an empirical characterization of the temporal dynamics of EEG spectral components, contributing to a deeper understanding of brain activity.

Languages

English

Native

Skills

Machine Learning

Generative Modeling, Multimodal Representation Learning, Deep Learning, Neural Networks, Foundation Models, Predictive Coding, Recommendation Systems, Attention Mechanisms, Epsilon-Greedy Bandit Strategy.

Computer Vision

Medical Imaging, Histopathology, Video Generation, Scene Graph Generation, Image Analysis, Whole Slide Imaging, Egocentric Data.

Natural Language Processing

Multimodal LLMs, Text Analysis, Localized Narratives.

Data Science

Data Curation, Large-Scale Datasets, Time Series Analysis, EEG Signal Processing, Dimensionality Reduction, Statistical Analysis, Data Programming.

Programming Languages

Python.

Machine Learning Frameworks

PyTorch, TensorFlow.

Research & Development

Experimental Design, Model Development, Performance Evaluation, Scientific Writing, Peer Review, Grant Writing.

Leadership & Mentorship

Project Leadership, Team Collaboration, Teaching, Mentoring.

Interests

Research

Generative Modeling, Multimodal Representation Learning, Data Curation, Artificial Intelligence in Healthcare.