systems-engineering
an archive of posts with this tag
| Mar 29, 2026 | Scaling LLMs: MoE Routing & JAX Parallelism on TPU |
|---|---|
| Feb 07, 2026 | Sharding Strategies: The Art of Distributed Matrix Multiplication |
| Feb 04, 2026 | TPU Architecture: Understanding the Bandwidth Hierarchy |
| Feb 03, 2026 | Roofline Analysis: When Does Your Model Hit the Wall? |
| Feb 02, 2026 | Scaling LLMs: From Alchemy to Science (Part 0) |