How to Build an Autonomous Research Agent with C# and Local LLMs

Here's a look at a new autonomous research agent built in C# utilizing local LLMs, specifically Ollama along with the llama3.1:8b model. The agent automates the process of URL handling by generating search queries, conducting web searches through the Brave Search API, and extracting relevant data, all culminating in a structured markdown report.

Key Details

The agent accepts a topic input, for instance, "persistent memory for AI agents".
It autonomously formulates 5-8 search queries.
Searches are executed via the Brave Search API, and the top sources are fetched and analyzed.
The agent reads through 8-12 sources and extracts 5-8 key findings.
All data processing is executed locally using the Ollama (llama3.1:8b) model without reliance on OpenAI/Anthropic APIs.
The output is a markdown report complete with citations.

Performance and Architecture

The current setup runs on a Ryzen 5 5500, CPU-only, with 16GB RAM, taking approximately 15 minutes per research cycle. The developer notes that 3B models, such as llama3.2, are inadequate for tool calling, thereby making 8B a necessary minimum for reliable performance.

Key challenges include the need for findings truncation before synthesis to prevent the model from stalling on lengthy contexts, and occasional malformed tool calls even with 8B models, resolved by retrying with altered prompts. The agent utilizes SQLite paired with embeddings for managing memory at a personal scale, obviating the need for a vector database.

Technology Stack

C# / .NET 8
Ollama
SQLite
Brave Search API (free tier)

For developers interested in building their own agent, there's a starter kit and an 8-chapter guide available on the project's GitHub repository, provided under the MIT license, along with the full source code: hex-dynamics.

📖 Read the full source: r/LocalLLaMA

Building an Autonomous Research Agent with C# and Local LLMs

Key Details

Performance and Architecture

Technology Stack

👀 See Also

Orloj: Declarative Orchestration Runtime for Multi-Agent AI Systems

Pleng: Self-Hosted Cloud Platform with AI-Driven Infrastructure Management

Codebook Lossless LLM Compression: 10-25% RAM Reduction with Bitwise Packing

Terminal-Based 3D Renderer Built with Multi-Agent Claude Code System