Deep Research Reports with Hermes Agent and Qwen3.6-35b-a3b: A Practical Walkthrough

A Reddit user with 15+ years in social research for public bodies details their process for generating deep research reports using Hermes Agent with the qwen3.6-35b-a3b model at Q6_K quantization. The goal was to produce McKinsey-style reports comparable to Perplexity's output. After five hours of continuous processing at ~28 tokens/second on a 12th Gen Intel Core with 32GB RAM and an RTX 4060 running Linux Mint, the agent produced a 21-page report on the current state of AI in Europe, with six loops of iterative refinement including diagnosing problems, fixing them, creating charts, and inserting them — all nearly autonomously.
Key Details
- Model: qwen3.6-35b-a3b Q6_K (quantized), running via Hermes Agent.
- Hardware: 12th Gen Intel Core CPU, 32GB RAM, RTX 4060 GPU, Linux Mint. Achieved ~28 tokens/second.
- Workflow: The user ran six loops over the same document. Each loop: generate draft, diagnose problems, fix issues, add charts, re-insert. The agent used custom skills (provided in the repo) to compensate for the built-in Hermes Agent skill being "lacking."
- Output: Final report in Markdown, DOCX, and PDF formats. All intermediate artifacts (prompts, meta-prompts, Python scripts, charts) are included in the repo.
- Repo contents: Skills, prompts, meta-prompts, Python scripts, intermediate artifacts, and the final report. The README and folder structure were also AI-generated.
- User caveats: Non-native English speaker (not AI-edited). Results described as "quite acceptable" — not excellent but a good starting point for public research use.
Who It's For
Developers and researchers working on AI-powered report generation, especially those in public administration or policy research who want to automate long-form document creation using local LLMs.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Building Vertical Data Layers for OpenClaw Agents
The real opportunity with OpenClaw isn't just using it—it's building industry-specific data layers that connect messy data sources, normalize them into usable schemas, and expose them as clean tool endpoints that return structured JSON.

Building a Persistent Personal OS for Claude: Psychology Profile, Goals, and Live Context Injection via Notion + Shortcuts
A developer built a persistent Personal OS in Notion that injects a compressed 800-word psychological profile, goals, relationships, and live context (location, time, calendar, weather) into every Claude API call via iOS Shortcuts, with a nightly debrief loop to keep context fresh.

Non-developer builds word chain game in one day using Claude AI
A user with zero coding experience created a complete browser game in one session using Claude AI. The word chain game includes a 74k word dictionary, sound effects, design elements, and a mascot.

OpenClaw Personal Assistant Use Cases: Morning Briefings and Behavior Tracking
A Reddit user demonstrates using OpenClaw as a personal assistant for morning briefings with weather, calendar, and task integration, and built a custom smoke tracker skill that logs triggers to SQLite and combines data with calendar and sleep patterns.