All Local LLM Tools — In a Single .exe File
LLM Runner AIO is a comprehensive, self-contained desktop application that bundles all the tools you need to run local AI models on your own hardware. No complex setup, no dependency hell — just download, run, and start chatting with AI locally.
Beautiful web interface for chatting with local LLMs
High-performance C++ inference engine
Privacy-respecting metasearch engine
AI-powered browser automation tool
2.5 GB .exe with all dependencies included
Open WebUI, llama.cpp, Vane, SearXNG
No installs needed — Python and Node.js are bundled
Runs in system tray on startup
Turkish, English, Spanish, German, French, Portuguese, Chinese, Japanese
Hardware detection, port config, auto-start

┌─────────────────────────────────────────────┐ │ LLM Runner AIO Launcher │ ├─────────────────────────────────────────────┤ │ ┌──────────┐ ┌──────────┐ │ │ │ SearXNG │ │ llama.cpp│ │ │ │ :8080 │ │ :8000 │ │ │ └──────────┘ └──────────┘ │ │ ┌──────────┐ ┌──────────┐ │ │ │OpenWebUI │ │ Vane │ │ │ │ :3000 │ │ :3001 │ │ │ └──────────┘ └──────────┘ │ └─────────────────────────────────────────────┘
http://localhost:3000http://localhost:8080http://localhost:3001Download LLM-Runner-AIO.exe (2.5 GB) from Hugging Face
Double-click to execute — no installation needed
Access Open WebUI at http://localhost:3000
| Requirement | Minimum | Recommended |
|---|---|---|
| OS | Windows 10/11 | Windows 11 |
| RAM | 8 GB | 16 GB+ |
| GPU | CPU Only | NVIDIA RTX 3060+ |
| VRAM | N/A | 4 GB+ |
| Python | 3.11 (bundled) | 3.11 (bundled) |
Note: Python 3.11 and Node.js are bundled inside the executable. No separate installation needed!
This project would not be possible without the incredible work of:
Download LLM Runner AIO and start running local LLMs instantly.