Blog
Nov 11, 2025
MIVPG: Multi-Instance Visual Prompt Generator for MLLMs
MIVPG enhances MLLMs by using Multi-Instance Learning to incorporate correlated visual data. It outperforms the simplified Q-former across diverse visual-language tasks, proving superior effectiveness.
Source: HackerNoon →