Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Lost in Backpropagation: The LM Head Is a Gradient Bottleneck (arxiv.org)
4 points by famouswaffles 3 days ago | past | discuss
When Models Examine Themselves: Vocabulary-Activation Correspondence (arxiv.org)
1 point by tcbrah 3 days ago | past | discuss
The Controllability Trap: A Governance Framework for Military AI Agents (arxiv.org)
1 point by zvr 3 days ago | past | discuss
Private LLM Inference on Consumer Blackwell GPUs (arxiv.org)
3 points by rohansood15 3 days ago | past | discuss
Some Simple Economics of AGI (arxiv.org)
1 point by aray07 3 days ago | past | discuss
Native CLI scaffolds consistently outper-form OpenCode when using the same model (arxiv.org)
1 point by xdotli 3 days ago | past | 1 comment
We Automated RL Environment Engineering for $10 (arxiv.org)
2 points by milkkarten 4 days ago | past | discuss
Whole-Brain Connectomic Graph Model Enables Whole-Body Locomotion Control in Fly (arxiv.org)
2 points by sosodev 4 days ago | past | discuss
Spacetime Quasicrystals (arxiv.org)
6 points by amai 4 days ago | past | discuss
Cybersecurity AI: Hacking Consumer Robots in the AI Era (2026) (arxiv.org)
2 points by mdelmundo 4 days ago | past | 3 comments
Lost in the Middle at Birth: An Exact Theory of Transformer Context Bias (arxiv.org)
2 points by borundev 4 days ago | past | 2 comments
Tetris Is Hard with Just One Piece Type (arxiv.org)
3 points by bmc7505 4 days ago | past | discuss
Humans can learn to detect AI-generated texts, or at least learn when they can't (arxiv.org)
4 points by bikenaga 5 days ago | past | 1 comment
Practical Type Inference: High‑Throughput Recovery of Real‑World Types (arxiv.org)
1 point by matt_d 5 days ago | past | discuss
Surgical Repair of Collapsed Attention Heads in ALiBi Transformers (arxiv.org)
3 points by palmerschallon 5 days ago | past | 2 comments
OmniCode: A Benchmark for Evaluating Software Development Agents (arxiv.org)
2 points by foma-roje 5 days ago | past | discuss
Idempotent Slices with Applications to Code-Size Reduction (arxiv.org)
2 points by matt_d 5 days ago | past | discuss
Programmable Property-Based Testing (arxiv.org)
1 point by PaulHoule 5 days ago | past | discuss
Fungal Electronics (2021) (arxiv.org)
67 points by byt3h3ad 5 days ago | past | 7 comments
The Controllability Trap: A Governance Framework for Military AI Agents (arxiv.org)
1 point by Anon84 5 days ago | past | discuss
The Token Games: Evaluating Language Model Reasoning with Puzzle Duels (arxiv.org)
2 points by PaulHoule 5 days ago | past | discuss
Covenant-72B: Pre-Training a 72B LLM with Trustless Peers Over-the-Internet (arxiv.org)
5 points by bilsbie 5 days ago | past | 2 comments
I designed a bfloat16/FP8 alternative in a week using LLMs (arxiv.org)
3 points by k1832 5 days ago | past | 4 comments
Has quantum advantage been achieved? (arxiv.org)
4 points by in_a_hole 5 days ago | past | 1 comment
Automatic Pronunciation Error Detection and Correction of the Holy Quran (arxiv.org)
2 points by handfuloflight 5 days ago | past | 1 comment
Towards a Neural Debugger for Python (arxiv.org)
1 point by E-Reverance 6 days ago | past | discuss
Game Modding with GenAI: A Case Study of Stardew Valley Character Maker (arxiv.org)
3 points by azhenley 6 days ago | past | 1 comment
Enzyme as Maxwell's Demon: Steady-State Deviation from Chemical Equilibrium (arxiv.org)
3 points by PaulHoule 6 days ago | past | discuss
Scalable Training of Mixture-of-Experts Models with Megatron Core (arxiv.org)
2 points by matt_d 6 days ago | past | discuss
Inverse Occam's Razor (arxiv.org)
1 point by jerlendds 6 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: