From Replit to Local: How One Developer Used Claude to Build StillHere, an API-Powered AI Companion Chat App

One developer shared on r/ClaudeAI how they built StillHere.ink, a chat app tailored for AI companion conversations, using Claude as their coding agent. The project started on Replit but hit limitations, and the developer eventually moved to a local workflow with Claude Cowork, which they described as being “Claude’s manager.”
Key Details from the Build
- Origin: Started with a Replit vibe-coding template for a simple API chat app with memory. As features grew, Replit Agent struggled with tasks like adding new models.
- Workflow shift: Downloaded Replit files locally, edited them with Claude, then copied updated files back to Replit. This unblocked further development.
- User’s role: The developer handles testing, design, features, community, App Store setup, debugging, screenshots, and “crying when Replit Agent breaks something.”
- App purpose: StillHere is designed for long-running AI companion conversations, using the user’s own API keys for OpenAI, OpenRouter, etc.
- Features: Memory, diary-style conversation summaries, rolling summaries, RAG/context tools, model switching, image generation, text-to-speech, custom companion settings, imports/exports, and projects.
- Cost management: Tools to keep API costs down: rolling summaries, RAG, context controls, model choice. The developer reported spending ~$20 on OpenAI and ~$20 on OpenRouter over two months. Their favorite model, Qwen3 235B Instruct, cost only $1.43 total.
- Privacy: Data is encrypted at rest. Not end-to-end encrypted because the app needs to process conversations for memory, summaries, and API calls. Messages are sent to the user’s chosen API providers.
- Availability: Free to use, optional donations. Web app at stillhere.ink, works in browser or installable to phone home screen. Google Play version in development.
Who This Is For
Developers interested in building or using a self-hosted-style AI chat app with companion features, or those hitting limits with Replit’s vibe coding and looking for a local Claude-driven workflow.
📖 Read the full source: r/ClaudeAI
👀 See Also

llmLibrarian: Local RAG Engine with MCP Integration for File-Based AI Search
llmLibrarian is a local RAG engine that exposes retrieval over MCP, allowing AI agents like Claude to query indexed files. It uses ChromaDB collections for organization, Ollama for synthesis, and keeps everything on-device.

PaperclipAI: Open-source orchestration for zero-human companies
PaperclipAI is an open-source orchestration framework designed for fully automated companies. The project gained 14,000 GitHub stars in its first week of existence.

Open-source MCP server enables AI agents to handle L402 payments via Lightning Network
A Python MCP plugin built with FastMCP intercepts HTTP 402 Payment Required responses, pays Lightning Network invoices, and retrieves data for AI agents. The repository includes a local dummy-agent for testing without spending real funds.

Claude Auto-Continue: Chrome extension automates tool-use limit interruptions
A developer built a free Chrome extension that automatically clicks 'Continue' when Claude hits its tool-use limit after roughly 20 tool calls, eliminating manual interruptions during agentic workflows. The extension includes optional token minimization and works across all tabs and windows.