Hacker Newsnew | past | comments | ask | show | jobs | submit | matt_d's submissionslogin
1.Tuning LLVM's SLP Vectorizer Cost Model (kaving.me)
2 points by matt_d 1 hour ago | past | discuss
2.A Friendly Tour of Substructural, Uniqueness, Ownership, Capabilities and more! (federicobruzzone.github.io)
2 points by matt_d 15 hours ago | past | discuss
3.FlashLib: Bringing Flash Magic to Classical Machine Learning Operators (flashml-org.github.io)
1 point by matt_d 1 day ago | past | discuss
4.FML-Bench: A Controlled Study of AI Research Agent Strategies (arxiv.org)
1 point by matt_d 1 day ago | past | discuss
5.Finding deadlocks in CuTe kernels with SPIN (metaworld.me)
2 points by matt_d 1 day ago | past | discuss
6.A Case for Tracing Based DSL Kernel Languages (metaworld.me)
2 points by matt_d 1 day ago | past | discuss
7.You don't need all the LLM benchmarks (smola.org)
4 points by matt_d 2 days ago | past | discuss
8.Elusive order of async GPU kernels: scheduling, abstractions, DSL implications (ianbarber.blog)
1 point by matt_d 2 days ago | past | discuss
9.MileStone: A Multi-Objective Compiler Phase Ordering Framework (arxiv.org)
1 point by matt_d 2 days ago | past | discuss
10.SSV: Sparse Speculative Verification for Efficient LLM Inference (arxiv.org)
4 points by matt_d 4 days ago | past | discuss
11.Characterizing Real-World Bugs in Tile Programs for Automated Bug Detection (arxiv.org)
2 points by matt_d 4 days ago | past | discuss
12.Characterization of machine learning compilers for LLM inference on NVIDIA GPUs (springer.com)
3 points by matt_d 4 days ago | past | discuss
13.Chip design from the bottom up – Reiner Pope [video] (youtube.com)
2 points by matt_d 5 days ago | past | discuss
14.LT2: Linear-Time Looped Transformers (charlesdddd.github.io)
2 points by matt_d 5 days ago | past | discuss
15.Event Tensor: A Unified Abstraction for Compiling Dynamic Megakernel (arxiv.org)
6 points by matt_d 5 days ago | past | discuss
16.PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Apps (arxiv.org)
1 point by matt_d 6 days ago | past | discuss
17.CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs (arxiv.org)
105 points by matt_d 6 days ago | past | 12 comments
18.[RFC] Open Access to Standards Documents – LLVM Project (llvm.org)
6 points by matt_d 7 days ago | past | discuss
19.Curly braces: An evolution of UNIX and C (thalia.dev)
6 points by matt_d 7 days ago | past | 2 comments
20.NanoTag: Systems Support for Efficient Byte-Granular Overflow Detection on Arm (github.com/ice-rlab)
2 points by matt_d 7 days ago | past | discuss
21.InferenceBench: A Benchmark for Open-Ended Inference Optimization by AI Agents (inferencebench.ai)
2 points by matt_d 7 days ago | past | discuss
22.Tracking Capabilities for Safer Agents (arxiv.org)
2 points by matt_d 7 days ago | past | discuss
23.Scalable Packed Layouts for Vector-Length-Agnostic ML Code Generation (arxiv.org)
2 points by matt_d 8 days ago | past | discuss
24.Verifying EDA and compiler optimizations once and for all (samuelcoward.co.uk)
2 points by matt_d 8 days ago | past | discuss
25.StepStone: LLM-Based GPU Kernel Driver Fuzzing via User-Space Libraries [pdf] (ucr.edu)
2 points by matt_d 8 days ago | past | discuss
26.Graded Modal Types for Memory and Communication Safety (kent.ac.uk)
1 point by matt_d 8 days ago | past | discuss
27.Systems Are Changing: The Architect's Role in the Era of Agentic Co-Design (sigarch.org)
1 point by matt_d 8 days ago | past | discuss
28.Code-Specify-Test-Debug-Prove: Flexibly Integrating Separation Logic [pdf] (cam.ac.uk)
1 point by matt_d 8 days ago | past | discuss
29.Detecting Relaxed Memory Concurrency Bugs in C and C++ Compilers (lukegeeson.com)
5 points by matt_d 8 days ago | past | discuss
30.The downgrading semantics of memory safety (Extended version) (arxiv.org)
2 points by matt_d 8 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: