Blog
3 days ago
PowerInfer-2 Achieves 29x Speedup, Running 47-Billion Parameter LLMs on Smartphones
PowerInfer-2 runs massive LLMs (47B+) on smartphones at record speeds by optimizing for heterogeneous hardware and minimizing I/O overhead.
Source: HackerNoon →