Feynman: Open Source Research Agent with Paper-Codebase Audit Tool

What Feynman Does
Feynman is an open source research agent CLI that handles research questions through a multi-agent architecture. When you ask a research question, it dispatches four subagents in parallel:
- Researcher: Searches papers and web
- Reviewer: Runs simulated peer review with severity grading
- Writer: Produces structured output
- Verifier: Checks every citation and kills dead links
Key Features from Source
The standout feature mentioned in the source is the audit tool: Feynman audit [arxiv-id] pulls a paper's claims and compares them against the actual public codebase. This addresses the common problem of wondering whether published code actually implements what's claimed in the paper.
Other capabilities include:
- Experiment replication on local or cloud GPUs via modal/runpod
- Literature reviews showing consensus vs disagreements vs open questions
- Deep research mode with multi-agent parallel investigation
- Option to install just the research skills into Claude Code or Codex without the full terminal app
Technical Details
- One command installation
- MIT license
- Built on pi for the agent runtime
- Uses alphaxiv for paper search
- 2.3k stars on GitHub at time of source publication
- Launch tweet received 2,768 bookmarks from an account with 1,400 followers
The architecture specifically addresses hallucination issues common in AI research tools by dedicating an entire agent to catching incorrect citations before they reach the user.
📖 Read the full source: r/LocalLLaMA
👀 See Also

harshal-mcp-proxy Now on npm: Single Daemon Replaces 12 MCP Server Configs
harshal-mcp-proxy is now available as a 54 kB npm package. Install globally, run as a daemon, and replace 12 separate MCP server configs with 6 tools, saving ~2.7 GB RAM and ~50K tokens per session.

OpenAlly: Local AI Assistant for Android with Phone Control
OpenAlly is an Android app that runs an AI assistant locally on your phone via an embedded Node.js process, with 51 built-in skills and phone control capabilities through Aster companion. It connects to 19+ messaging platforms and supports 18 model providers with your own API keys.

Free macOS Menu Bar Monitor for Claude AI Usage Limits
A free, open-source macOS tool that displays Claude AI's session usage percentage, weekly limits, and reset countdowns directly in your menu bar. Installs with a single Homebrew command and piggybacks on existing Chrome sessions without storing data.

GitVelocity: AI Scoring of 50k PRs Reveals Insights on Code Complexity
GitVelocity uses Claude to score merged pull requests 0-100 across six dimensions: scope, architecture, implementation, risk, quality, and performance/security. After analyzing 50,000+ PRs across TypeScript, Python, Rust, Go, Java, and Elixir, the team found surprising patterns about PR size, test coverage, and AI adoption.