Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
matt_d's submissions
login
1.
Tuning LLVM's SLP Vectorizer Cost Model
(
kaving.me
)
2 points
by
matt_d
1 hour ago
|
past
|
discuss
2.
A Friendly Tour of Substructural, Uniqueness, Ownership, Capabilities and more!
(
federicobruzzone.github.io
)
2 points
by
matt_d
15 hours ago
|
past
|
discuss
3.
FlashLib: Bringing Flash Magic to Classical Machine Learning Operators
(
flashml-org.github.io
)
1 point
by
matt_d
1 day ago
|
past
|
discuss
4.
FML-Bench: A Controlled Study of AI Research Agent Strategies
(
arxiv.org
)
1 point
by
matt_d
1 day ago
|
past
|
discuss
5.
Finding deadlocks in CuTe kernels with SPIN
(
metaworld.me
)
2 points
by
matt_d
1 day ago
|
past
|
discuss
6.
A Case for Tracing Based DSL Kernel Languages
(
metaworld.me
)
2 points
by
matt_d
1 day ago
|
past
|
discuss
7.
You don't need all the LLM benchmarks
(
smola.org
)
4 points
by
matt_d
2 days ago
|
past
|
discuss
8.
Elusive order of async GPU kernels: scheduling, abstractions, DSL implications
(
ianbarber.blog
)
1 point
by
matt_d
2 days ago
|
past
|
discuss
9.
MileStone: A Multi-Objective Compiler Phase Ordering Framework
(
arxiv.org
)
1 point
by
matt_d
2 days ago
|
past
|
discuss
10.
SSV: Sparse Speculative Verification for Efficient LLM Inference
(
arxiv.org
)
4 points
by
matt_d
4 days ago
|
past
|
discuss
11.
Characterizing Real-World Bugs in Tile Programs for Automated Bug Detection
(
arxiv.org
)
2 points
by
matt_d
4 days ago
|
past
|
discuss
12.
Characterization of machine learning compilers for LLM inference on NVIDIA GPUs
(
springer.com
)
3 points
by
matt_d
4 days ago
|
past
|
discuss
13.
Chip design from the bottom up – Reiner Pope [video]
(
youtube.com
)
2 points
by
matt_d
5 days ago
|
past
|
discuss
14.
LT2: Linear-Time Looped Transformers
(
charlesdddd.github.io
)
2 points
by
matt_d
5 days ago
|
past
|
discuss
15.
Event Tensor: A Unified Abstraction for Compiling Dynamic Megakernel
(
arxiv.org
)
6 points
by
matt_d
5 days ago
|
past
|
discuss
16.
PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Apps
(
arxiv.org
)
1 point
by
matt_d
6 days ago
|
past
|
discuss
17.
CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs
(
arxiv.org
)
105 points
by
matt_d
6 days ago
|
past
|
12 comments
18.
[RFC] Open Access to Standards Documents – LLVM Project
(
llvm.org
)
6 points
by
matt_d
7 days ago
|
past
|
discuss
19.
Curly braces: An evolution of UNIX and C
(
thalia.dev
)
6 points
by
matt_d
7 days ago
|
past
|
2 comments
20.
NanoTag: Systems Support for Efficient Byte-Granular Overflow Detection on Arm
(
github.com/ice-rlab
)
2 points
by
matt_d
7 days ago
|
past
|
discuss
21.
InferenceBench: A Benchmark for Open-Ended Inference Optimization by AI Agents
(
inferencebench.ai
)
2 points
by
matt_d
7 days ago
|
past
|
discuss
22.
Tracking Capabilities for Safer Agents
(
arxiv.org
)
2 points
by
matt_d
7 days ago
|
past
|
discuss
23.
Scalable Packed Layouts for Vector-Length-Agnostic ML Code Generation
(
arxiv.org
)
2 points
by
matt_d
8 days ago
|
past
|
discuss
24.
Verifying EDA and compiler optimizations once and for all
(
samuelcoward.co.uk
)
2 points
by
matt_d
8 days ago
|
past
|
discuss
25.
StepStone: LLM-Based GPU Kernel Driver Fuzzing via User-Space Libraries [pdf]
(
ucr.edu
)
2 points
by
matt_d
8 days ago
|
past
|
discuss
26.
Graded Modal Types for Memory and Communication Safety
(
kent.ac.uk
)
1 point
by
matt_d
8 days ago
|
past
|
discuss
27.
Systems Are Changing: The Architect's Role in the Era of Agentic Co-Design
(
sigarch.org
)
1 point
by
matt_d
8 days ago
|
past
|
discuss
28.
Code-Specify-Test-Debug-Prove: Flexibly Integrating Separation Logic [pdf]
(
cam.ac.uk
)
1 point
by
matt_d
8 days ago
|
past
|
discuss
29.
Detecting Relaxed Memory Concurrency Bugs in C and C++ Compilers
(
lukegeeson.com
)
5 points
by
matt_d
8 days ago
|
past
|
discuss
30.
The downgrading semantics of memory safety (Extended version)
(
arxiv.org
)
2 points
by
matt_d
8 days ago
|
past
|
discuss
More
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: