Blog

Nov 17, 2025

IGQ-ViT: Instance-Aware Group Quantization for Low-Bit Vision Transformers

IGQ-ViT introduces a dynamic channel-grouping strategy for quantizing Vision Transformers, improving accuracy and hardware efficiency with minimal latency trade-offs. Unlike previous methods that rely on static groups or fake quantization, it computes per-input statistical properties to assign channels more effectively, supports practical accelerator designs, and outperforms prior PTQ approaches on models like DETR.

Source: HackerNoon →


Share

BTCBTC
$90,314.00
1.95%
ETHETH
$3,197.71
5.42%
USDTUSDT
$1.00
0%
XRPXRP
$2.00
2.78%
BNBBNB
$872.38
2.68%
USDCUSDC
$1.000
0%
SOLSOL
$133.92
2.13%
STETHSTETH
$3,195.24
4.99%
TRXTRX
$0.280
0.96%
DOGEDOGE
$0.138
6.01%
ADAADA
$0.413
10.66%
FIGR_HELOCFIGR_HELOC
$1.03
1.99%
WBTWBT
$60.95
3.26%
WSTETHWSTETH
$3,905.57
5.05%
WBETHWBETH
$3,472.54
4.86%
WBTCWBTC
$90,181.00
2.11%
BCHBCH
$560.89
1.69%
USDSUSDS
$1.000
0.01%
LINKLINK
$13.64
4.48%
WEETHWEETH
$3,463.33
5.08%