| 1. | | Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon (dao-lab.ai) |
| 2 points by matt_d 3 hours ago | past | discuss |
|
| 2. | | Compiler as a Service: C++ Goes Live – Interactive C++, interop, and beyond [video] (youtube.com) |
| 2 points by matt_d 6 hours ago | past | 1 comment |
|
| 3. | | Measuring AI Ability to Complete Long Software Tasks (muratbuffalo.blogspot.com) |
| 4 points by matt_d 7 hours ago | past | 1 comment |
|
| 4. | | 6o6 v1.1: Faster 6502-on-6502 virtualization for a C64/Apple II Apple-1 emulator (oldvcr.blogspot.com) |
| 3 points by matt_d 20 hours ago | past | discuss |
|
| 5. | | uops.info Update: Emerald Rapids, Meteor Lake, Arrow Lake, and Zen 5 (uops.info) |
| 3 points by matt_d 21 hours ago | past | discuss |
|
| 6. | | MXFP8 GEMM: Up to 99% of cuBLAS Performance Using CUDA and PTX (danielvegamyhre.github.io) |
| 2 points by matt_d 21 hours ago | past | discuss |
|
| 7. | | PyTorch Autograd and Mutation (ezyang.com) |
| 4 points by matt_d 22 hours ago | past | discuss |
|
| 8. | | The Future of Python: Evolution or Succession – Brett Slatkin – PyCascades 2026 [video] (youtube.com) |
| 3 points by matt_d 2 days ago | past | discuss |
|
| 9. | | SlopCodeBench: Benchmarking How Coding Agents Degrade over Long-Horizon Tasks (scbench.ai) |
| 2 points by matt_d 2 days ago | past | discuss |
|
| 10. | | AutoRocq: Agentic Theorem Prover for Verification (github.com/nus-program-verification) |
| 2 points by matt_d 3 days ago | past | discuss |
|
| 11. | | Wax: Optimizing Data Center Applications with Stale Profile (github.com/ice-rlab) |
| 1 point by matt_d 3 days ago | past | discuss |
|
| 12. | | Dijkstra's Shortest-Path Algorithm: A visual exploration, following Sedgewick (joshmpollock.com) |
| 2 points by matt_d 3 days ago | past | 1 comment |
|
| 13. | | Speculative Decoding: Performance or Illusion? (specdecode-bench.github.io) |
| 3 points by matt_d 3 days ago | past | discuss |
|
| 14. | | Goedel-Code-Prover: Hierarchical Proof Search for Open SotA Code Verification (goedelcodeprover.github.io) |
| 3 points by matt_d 3 days ago | past | discuss |
|
| 15. | | MLSys 2026 Papers (mlsys.org) |
| 1 point by matt_d 4 days ago | past | discuss |
|
| 16. | | An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU (arxiv.org) |
| 3 points by matt_d 4 days ago | past | discuss |
|
| 17. | | Specula: A framework for finding deep bugs in system code using TLA+ (github.com/specula-org) |
| 3 points by matt_d 5 days ago | past | discuss |
|
| 18. | | Equality Saturation for Optimizing High-Level Julia IR (acm.org) |
| 2 points by matt_d 5 days ago | past | 2 comments |
|
| 19. | | UniTe: A Universal Tensor Abstraction for Capturing Spatial Relationships (acm.org) |
| 2 points by matt_d 5 days ago | past | discuss |
|
| 20. | | Co-Design of B+-Tree Index with Emerging Zone Interfaces for Small KV Pairs (acm.org) |
| 4 points by matt_d 5 days ago | past | discuss |
|
| 21. | | CounterPoint: Using Hardware Counters to Refute and Refine µarch Assumptions (arxiv.org) |
| 2 points by matt_d 5 days ago | past | discuss |
|
| 22. | | PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost (arxiv.org) |
| 2 points by matt_d 6 days ago | past | discuss |
|
| 23. | | SysMoBench: Evaluating AI on Formally Modeling Complex Real-World Systems (muratbuffalo.blogspot.com) |
| 5 points by matt_d 6 days ago | past | discuss |
|
| 24. | | What Is Coordination, Really? (jhellerstein.github.io) |
| 2 points by matt_d 6 days ago | past | discuss |
|
| 25. | | Idempotent Slices with Applications to Code-Size Reduction (arxiv.org) |
| 1 point by matt_d 6 days ago | past | discuss |
|
| 26. | | Microsoft Rust Training Books: Beginner, advanced, expert level material (github.com/microsoft) |
| 3 points by matt_d 7 days ago | past | discuss |
|
| 27. | | LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis (arxiv.org) |
| 3 points by matt_d 7 days ago | past | discuss |
|
| 28. | | Challenges and Design Issues in Finding CUDA Bugs via GPU-Native Fuzzing (arxiv.org) |
| 2 points by matt_d 7 days ago | past | discuss |
|
| 29. | | SEVI: Silent Data Corruption of Vector Instructions in Hyper-Scale Datacenters (acm.org) |
| 1 point by matt_d 7 days ago | past | discuss |
|
| 30. | | CrypTorch: PyTorch-based Auto-tuning Compiler for ML w/ Multi-party Computation (github.com/psu-paws) |
| 2 points by matt_d 8 days ago | past | discuss |
|
|
| More |