Hey thanks - yes agreed - for now we do: 1. Split metadata into shard 0 for huge... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		danielhanchen 18 hours ago \| parent \| context \| favorite \| on: Qwen3.6-35B-A3B: Agentic coding power, now open to... Hey thanks - yes agreed - for now we do: 1. Split metadata into shard 0 for huge models so 10B is for chat template fixes - however sometimes fixes cause a recalculation of the imatrix, which means all quants have to be re-made 2. Add HF discussion posts on each model talking about what changed, and on our Reddit and Twitter 3. Hugging Face XET now has de-duplication downloading of shards, so generally redownloading 100GB models again should be much faster - it chunks 100GB into small chunks and hashes them, and only downloads the shards which have changed

		help

ssrshh 9 hours ago | [–]

If you would know - is this also why LM Studio and Ollama model downloads often fail with a signature mismatch error?

danielhanchen 3 hours ago | | [–]

Probably yes

evilduck 15 hours ago | [–]

Ah thanks, I wasn't aware of #3, that should be a huge boon.

danielhanchen 3 hours ago | [–]

Oh yes! This only applies if one uses hf download / snapshot_download - other normal download methods sadly won't have XET

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact