Blog
5 days ago
MIL Perspective: Analyzing Q-Former as a Multi-Head Mechanism
Proves Q-Former is a Multi-Head MIL module due to permutation invariance in its cross-attention. Notes its limitation: it assumes i.i.d. instances, overlooking crucial instance correlation.
Source: HackerNoon →