pizzao's comments

pizzao · 2026-06-13T14:04:33 1781359473

I wonder if there is way local small LLMs can complement each other in away that the sum-total yields a much more performant LLM

killerstorm · 2026-06-13T14:17:22 1781360242

Perhaps some radical MoE where you download _exactly_ the components you need as you need them. Currently MoE is switched usually on per-token per-layer basis, so you need all weights locally. But e.g. Apple made one which pre-selects all experts based on prompt embedding. That might be further scaled up - e.g. predict exactly what's needed

eblanshey · 2026-06-14T02:14:02 1781403242

I don't understand why no labs create dedicated models per industry/expert. E.g. physics, electronics, chemistry, etc. Each model would be much smaller and better suitable for running locally. Everyone is trying to cram everything into a single model.

salter2 · 2026-06-13T18:54:40 1781376880

Perhaps something similar to speculative decoding.

Speculating Experts Accelerates Inference for Mixture-of-Experts: https://arxiv.org/abs/2603.19289

Flere-Imsaho · 2026-06-13T20:02:16 1781380936

Sort of like how ants in a colony produce a working "society" that no individual ant could muster.

pizzao · 2026-05-20T19:53:10 1779306790

Can someone explain to me what is their "prompting-scaffolding" to make it work ?

yusufozkan · 2026-05-20T19:56:48 1779307008

"This is a general-purpose LLM. It wasn’t targeted at this problem or even at mathematics. Also, it’s not a scaffold. We have not pushed this model to the limit on open problems. Our focus is to get it out quickly so that everyone can use it for themselves." - Noam Brown (OpenAI reasoning researcher) on X