Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos
Published by
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Summary
Introduces a novel approach for visual instruction tuning using localized narratives from histopathology videos to enhance multimodal LLMs.
