Dangerously Skip Reading Code: When LLMs Write Code Faster Than You Can Read It

The premise is simple: what if we stop reading LLM-generated code altogether? Treat it like assembly, bytecode, or transpiled JavaScript — the high-level language source becomes another form of machine code. This idea comes from Thoughtworks' retreat report and Facundo Olano's blog post.
Why this makes sense
LLMs produce non-deterministic output and generate code far faster than humans can read. Reviewing every diff is no longer viable. Instead of giving up on rigor, move it elsewhere: specifications and tests.
Organizational prerequisite
This is not an individual or team decision — it must be organizational. Amdahl's law applies: maximizing code generation speed without restructuring processes yields no real gains. You can't have some devs pumping 20k lines of slop a day while others still read and approve it.
Requirements include:
- Remove humans-in-the-loop, reduce coordination and gate-keeping
- Virtually infinite supply of requirements, engineers owning streams of work autonomously
- Rework is almost free, so don't prevent incorrect work — detect via specs/tests
Proposed workflow
Use a standardized Markdown specification as the new unit of knowledge. Product owners and engineers collaborate on the spec and test cases for business rules. Check these into the repository alongside the implementing code.
Automated pull-request checks verify:
- Tests pass
- Code conforms to the spec
The spec — not the code — is what the team understands, reviews, and is accountable for.
Key distinction
Specifications are not prompts. Tests are not TDD. This is about rigor shifted to the contract layer, not the implementation layer.
📖 Read the full source: HN AI Agents
👀 See Also

Why Anthropic's Activation Steering Struggles with Generating Valid JSON
Activation steering, a technique used for AI safety, fails to generate valid JSON, achieving only 24.4% validity compared to 86.8% from the untrained base model.

Qwen3.5-122B on Blackwell SM120: fp8 KV Cache Corruption Issue and Performance Findings
Testing Qwen3.5-122B on 8x RTX PRO 6000 Blackwell hardware revealed that fp8_e4m3 KV cache silently produces corrupt output without errors, requiring bf16 KV cache instead. MTP optimization provided a 2.75x single-request speedup while DeltaNet constraints blocked other optimizations.

Claude App Ranks Second in US App Store After Pentagon Dispute
Anthropic's Claude chatbot app rose to number two among free apps in Apple's US App Store, climbing from outside the top 100 in late January to second place by late February 2026. This surge followed the company's public negotiations with the Pentagon over AI usage restrictions.

Claude Code Source Leak Reveals Anti-Distillation, Undercover Mode, and Frustration Detection
A leaked source code map file from Claude Code's npm package reveals anti-distillation techniques using fake tools, an undercover mode that hides AI authorship, and frustration detection via regex patterns.