LM Studio plugins add web image analysis for vision-capable LLMs

A developer has created plugins for LM Studio that allow vision-capable LLMs to fetch images from the web and analyze them directly within the application. The plugins work without requiring MCP/APIs or registration, using simple scripts that can be installed with one click from the LM Studio website.
Key Features
The main plugin, "analyze-images," enables LLMs to:
- Fetch images from the web for analysis
- Chain tools automatically based on the task
- Convert fetched images into smaller thumbnail files for chat embedding to avoid clutter
- Use full-resolution images for analysis when possible
- Embed images in responses or use markdown table galleries when users request multiple images
The developer also updated existing plugins:
- Duck-Duck-Go plugin now works with images
- Visit Website plugin now works with images
Requirements and Setup
To use these plugins, you need:
- A vision-capable model (Qwen 3.5 9b or 27b are recommended)
- LM Studio with plugin support
The developer shared specific Qwen 3.5 settings that worked well:
Temperature: 1 Top K sampling: 20 Repeat Penalty: 1 Presence Penalty: 1.9 Top P sampling: 0.95 Min P sampling: 0
They noted that the Presence Penalty setting at 1.9 helped fix repetition problems and prevent loops.
The system prompt used was: "You are a capable, thoughtful, and precise assistant. Always prioritize being truthful, nuanced, insightful, and efficient, tailoring your responses specifically to the user's needs and preferences. Research before answering the questions: use both reasoning and tool calls to synthesize a proper conclusion."
Plugin Links
- Analyze Images plugin: https://lmstudio.ai/vadimfedenko/analyze-images
- Duck-Duck-Go reworked: https://lmstudio.ai/vadimfedenko/duck-duck-go-reworked
- Visit Website reworked: https://lmstudio.ai/vadimfedenko/visit-website-reworked
The developer also shared a Jinja Prompt Template on Pastebin that helped fix tool call errors.
📖 Read the full source: r/LocalLLaMA
👀 See Also

oMLX introduces SSD KV caching for Apple Silicon, reducing OpenClaw response times from 30-90 seconds to 5 seconds
oMLX is a new backend that persists KV cache blocks to SSD in safetensors format, preventing cache invalidation when context shifts. This reduces OpenClaw response times from 30-90 seconds down to 5 seconds on subsequent turns.

Real-World Insights on Using OpenClaw with LLMs: Challenges and Limitations
An OpenClaw user describes integration issues with LLMs, citing nonsensical responses from a Discord bot.

the-knowledge-guy: Turn Your Bookshelf Into a Tutor With Claude Code Skills
A Claude Code skill set that ingests your PDF/EPUB books locally and lets you ask questions, get taught topic-by-topic, or pull cheatsheets — all with citations across your library.

Orchino: Local Multi-Agent Orchestration System for Windows with Parallel Browser and UI Automation
Orchino is a local multi-agent orchestration system for Windows that runs parallel browser and Windows tasks without hijacking the UI. A demo shows 4 agents completing 'Search Sony earbuds on Flipkart and Amazon, email the results, save to Notepad' in 29.5 seconds using true parallel execution.