I'm guessing 3.5-27b would beat 3.6-35b. MoE is a bad idea. Because for the same...

zozbot234 · 2026-04-16T14:59:47 1776351587

MoE is not a bad idea for local inference if you have fast storage to offload to, and this is quickly becoming feasible with PCIe 5.0 interconnect.

perbu · 2026-04-16T16:50:53 1776358253

MoE is excellent for the unified memory inference hardware like DGX Sparc, Apple Studio, etc. Large memory size means you can have quite a few B's and the smaller experts keeps those tokens flowing fast.