Blog
4 days ago
Multimodal Fusion: MIVPG's Hierarchical MIL Approach for Multi-Image Samples
Details MIVPG's hierarchical approach to MIL for multi-image samples. It treats both image patches and whole images as 'instances' for feature aggregation via cross-attention.
Source: HackerNoon →