Hacker Newsnew | past | comments | ask | show | jobs | submit | matt_d's submissionslogin
1.Challenges and Design Issues in Finding CUDA Bugs via GPU-Native Fuzzing (arxiv.org)
1 point by matt_d 56 minutes ago | past | discuss
2.SEVI: Silent Data Corruption of Vector Instructions in Hyper-Scale Datacenters (acm.org)
1 point by matt_d 5 hours ago | past | discuss
3.CrypTorch: PyTorch-based Auto-tuning Compiler for ML w/ Multi-party Computation (github.com/psu-paws)
2 points by matt_d 1 day ago | past | discuss
4.SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels (arxiv.org)
3 points by matt_d 2 days ago | past | discuss
5.Tony Hoare and His Imprint on Computer Science (acm.org)
7 points by matt_d 2 days ago | past | 1 comment
6.The End of Dijkstra's Algorithm? Breaking the Sorting Barrier for Shortest Paths [video] (youtube.com)
2 points by matt_d 3 days ago | past | discuss
7.AlgoVeri: An Aligned Benchmark for Verified Code Gen. On Classical Algorithms (arxiv.org)
2 points by matt_d 3 days ago | past | discuss
8.Specy: Learning Specifications for Distributed Systems from Event Traces [pdf] (princeton.edu)
2 points by matt_d 3 days ago | past | discuss
9.Generalized Dot-Product Attention: Tackling Real-World Challenges in GPU Kernels (pytorch.org)
1 point by matt_d 3 days ago | past | discuss
10.M^2RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling (arxiv.org)
2 points by matt_d 3 days ago | past | discuss
11.Tools of the Trade: C2C Activation Offloading on Grace Blackwell (poolside.ai)
1 point by matt_d 3 days ago | past | discuss
12.EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages (esolang-bench.vercel.app)
97 points by matt_d 3 days ago | past | 58 comments
13.Speed-Of-Light ExecBench: A benchmark of real-world DL kernel problems (github.com/nvidia)
1 point by matt_d 3 days ago | past | discuss
14.Equality Saturation and Symbolic Regression (egraphs.org)
2 points by matt_d 4 days ago | past | discuss
15.NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL (arxiv.org)
3 points by matt_d 4 days ago | past | discuss
16.Vectorization of Verilog Designs and its Effects on Verification and Synthesis (arxiv.org)
33 points by matt_d 4 days ago | past | 6 comments
17.LATTE ’26: Workshop on Languages, Tools, and Techniques for Accelerator Design (cornell.edu)
2 points by matt_d 4 days ago | past | discuss
18.Read Less, Steer More (ezyang.com)
4 points by matt_d 4 days ago | past | discuss
19.The Data Structures of Roads (sandboxspirit.com)
2 points by matt_d 4 days ago | past | discuss
20.Verifying Move Borrow Checker in Lean:An Experiment in AI-Assisted PL Metatheory (proofsandintuitions.net)
4 points by matt_d 5 days ago | past | 1 comment
21.Real or Slop? – Programming Languages Papers Edition (zackg.me)
6 points by matt_d 5 days ago | past | 2 comments
22.Mamba-3 (together.ai)
298 points by matt_d 5 days ago | past | 55 comments
23.EvoX: Letting AI Evolve Its Own Evolution Process (skydiscover-ai.github.io)
1 point by matt_d 6 days ago | past | discuss
24.Native DSLs Ops in PyTorch (ianbarber.blog)
1 point by matt_d 6 days ago | past | discuss
25.Flash-KMeans: Fast and Memory-Efficient Exact K-Means (arxiv.org)
184 points by matt_d 6 days ago | past | 14 comments
26.Gluon: Explicit Performance (lei.chat)
22 points by matt_d 7 days ago | past | discuss
27.Block Number Formats are (Still!) Direction Preservers (constantinides.net)
2 points by matt_d 8 days ago | past | discuss
28.cuTile Rust: a safe, tile-based kernel programming DSL for Rust (github.com/nvlabs)
4 points by matt_d 9 days ago | past | discuss
29.KernelBlaster: A framework for in context learning for code optimization (github.com/nvlabs)
1 point by matt_d 9 days ago | past | discuss
30.Demystifying and Improving Lazy Promotion in Cache Eviction [pdf] (vldb.org)
1 point by matt_d 9 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: