GPU-NYU
an archive of posts in this category
| May 15, 2026 | FlashAttention 3 - A Worklog[WIP] |
|---|---|
| Apr 18, 2026 | CuTe DSL - Notes |
| Apr 18, 2026 | CUTLASS WGMMA on Hopper - Notes |
| Apr 14, 2026 | Investigating Flaky `test_eagle_dp` — Batch Invariance Failure on L4 GPUs |
| Mar 29, 2026 | GEMM Kernel Optimization Notes |
| Mar 25, 2026 | SiLU+Mul+FP8 Block Quant Pattern Matching Pipeline - vLLM Notes |
| Mar 25, 2026 | Fused SiLU+Mul+FP8 Block Quantization CUDA Kernel - vLLM Notes |
| Dec 15, 2025 | Reading Notes from Aleksa Gordic's GPU BlogPost |
| Dec 11, 2025 | GPU Notes |
| Sep 23, 2025 | GPU Essentials - A Concise Technical Guide |