All-in-One Package

LLM Runner AIO

All Local LLM Tools — In a Single .exe File

Download LLM-Runner-AIO.exe (2.5 GB)Download .RAR (2.3 GB)

Overview

LLM Runner AIO is a comprehensive, self-contained desktop application that bundles all the tools you need to run local AI models on your own hardware. No complex setup, no dependency hell — just download, run, and start chatting with AI locally.

What is Included

Open WebUI

MIT

Beautiful web interface for chatting with local LLMs

llama.cpp

MIT

High-performance C++ inference engine

SearXNG

GPLv3

Privacy-respecting metasearch engine

Vane

Open Source

AI-powered browser automation tool

Features

Single File Deployment

2.5 GB .exe with all dependencies included

4 Core Services

Open WebUI, llama.cpp, Vane, SearXNG

Automatic Setup

No installs needed — Python and Node.js are bundled

Windows Integration

Runs in system tray on startup

8 Language Support

Turkish, English, Spanish, German, French, Portuguese, Chinese, Japanese

Advanced Settings

Hardware detection, port config, auto-start

How It Works

┌─────────────────────────────────────────────┐
│           LLM Runner AIO Launcher           │
├─────────────────────────────────────────────┤
│  ┌──────────┐  ┌──────────┐                │
│  │ SearXNG  │  │ llama.cpp│                │
│  │ :8080    │  │ :8000    │                │
│  └──────────┘  └──────────┘                │
│  ┌──────────┐  ┌──────────┐                │
│  │OpenWebUI │  │  Vane    │                │
│  │ :3000    │  │ :3001    │                │
│  └──────────┘  └──────────┘                │
└─────────────────────────────────────────────┘

llama.cpp runs the AI model inference engine
Open WebUI provides the chat interface at http://localhost:3000
SearXNG enables local web search at http://localhost:8080
Vane handles browser automation at http://localhost:3001

Installation

Download

Download LLM-Runner-AIO.exe (2.5 GB) from Hugging Face

Run

Double-click to execute — no installation needed

Ready

Access Open WebUI at http://localhost:3000

System Requirements

Requirement	Minimum	Recommended
OS	Windows 10/11	Windows 11
RAM	8 GB	16 GB+
GPU	CPU Only	NVIDIA RTX 3060+
VRAM	N/A	4 GB+
Python	3.11 (bundled)	3.11 (bundled)

Note: Python 3.11 and Node.js are bundled inside the executable. No separate installation needed!

Data Privacy

No Cloud Dependencies — Everything runs locally
No Telemetry — No data is sent anywhere
Local Database — Chat history stored only on your machine
No Account Required — No registration or login needed

Supported Languages

🇹🇷 Turkish🇬🇧 English🇪🇸 Spanish🇩🇪 German🇫🇷 French🇵🇹 Portuguese🇨🇳 Chinese🇯🇵 Japanese

Credits

This project would not be possible without the incredible work of:

• Georgi Gerganov — llama.cpp
• Open WebUI Team — Open WebUI
• SearXNG Contributors — SearXNG
• All open-source contributors who make local AI accessible

Ready to Get Started?

Download LLM Runner AIO and start running local LLMs instantly.

Download .exe (2.5 GB)Download .RAR (2.3 GB)

Features

Single File Deployment

2.5 GB .exe with all dependencies included

4 Core Services

Open WebUI, llama.cpp, Vane, SearXNG

Automatic Setup

No installs needed — Python and Node.js are bundled

Windows Integration

Runs in system tray on startup

8 Language Support

Turkish, English, Spanish, German, French, Portuguese, Chinese, Japanese

Advanced Settings

Hardware detection, port config, auto-start

How It Works

┌─────────────────────────────────────────────┐
│           LLM Runner AIO Launcher           │
├─────────────────────────────────────────────┤
│  ┌──────────┐  ┌──────────┐                │
│  │ SearXNG  │  │ llama.cpp│                │
│  │ :8080    │  │ :8000    │                │
│  └──────────┘  └──────────┘                │
│  ┌──────────┐  ┌──────────┐                │
│  │OpenWebUI │  │  Vane    │                │
│  │ :3000    │  │ :3001    │                │
│  └──────────┘  └──────────┘                │
└─────────────────────────────────────────────┘

llama.cpp runs the AI model inference engine
Open WebUI provides the chat interface at http://localhost:3000
SearXNG enables local web search at http://localhost:8080
Vane handles browser automation at http://localhost:3001

Requirement

Minimum

Recommended

Windows 10/11

Windows 11

RAM

8 GB

16 GB+

GPU

CPU Only

NVIDIA RTX 3060+

VRAM

N/A

4 GB+

Python

3.11 (bundled)

Data Privacy

No Cloud Dependencies — Everything runs locally
No Telemetry — No data is sent anywhere
Local Database — Chat history stored only on your machine
No Account Required — No registration or login needed

Supported Languages

🇹🇷 Turkish🇬🇧 English🇪🇸 Spanish🇩🇪 German🇫🇷 French🇵🇹 Portuguese🇨🇳 Chinese🇯🇵 Japanese