Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
ptrj_
on May 28, 2025
|
parent
|
context
|
favorite
| on:
Look Ma, No Bubbles: Designing a Low-Latency Megak...
This could also give a nice speedup for MoE models w/ total 7B-70B parameters but O(10x) fewer active params, e.g.
https://huggingface.co/Qwen/Qwen3-30B-A3B
, assuming the expert router can be effectively scheduled within the monolithic mega-kernel.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: