Blog
1 week ago
MIVPG: Multi-Instance Visual Prompt Generator for MLLMs
MIVPG enhances MLLMs by using Multi-Instance Learning to incorporate correlated visual data. It outperforms the simplified Q-former across diverse visual-language tasks, proving superior effectiveness.
Source: HackerNoon →