Local AI Hub
LLM ConfigsLLM Runner AIORankingsCoding AgentsLLM RunnersLLM Web UIMultimodal☕Support
Buy Me a Coffee

© 2026 Local AI Hub. All rights reserved.

All-in-One Package

LLM Runner AIO

All Local LLM Tools — In a Single .exe File

Download LLM-Runner-AIO.exe (2.5 GB)Download .RAR (2.3 GB)
Download Node.jsDownload Python 3.11
View on GitHub|Hugging Face Page

Overview

LLM Runner AIO is a comprehensive, self-contained desktop application that bundles all the tools you need to run local AI models on your own hardware. No complex setup, no dependency hell — just download, run, and start chatting with AI locally.

What is Included

Open WebUI

MIT

Beautiful web interface for chatting with local LLMs

llama.cpp

MIT

High-performance C++ inference engine

SearXNG

GPLv3

Privacy-respecting metasearch engine

Vane

Open Source

AI-powered browser automation tool

Features

Single File Deployment

2.5 GB .exe with all dependencies included

4 Core Services

Open WebUI, llama.cpp, Vane, SearXNG

Automatic Setup

No installs needed — Python and Node.js are bundled

Windows Integration

Runs in system tray on startup

8 Language Support

Turkish, English, Spanish, German, French, Portuguese, Chinese, Japanese

Advanced Settings

Hardware detection, port config, auto-start

How It Works

LLM Runner AIO Screenshot
┌─────────────────────────────────────────────┐
│           LLM Runner AIO Launcher           │
├─────────────────────────────────────────────┤
│  ┌──────────┐  ┌──────────┐                │
│  │ SearXNG  │  │ llama.cpp│                │
│  │ :8080    │  │ :8000    │                │
│  └──────────┘  └──────────┘                │
│  ┌──────────┐  ┌──────────┐                │
│  │OpenWebUI │  │  Vane    │                │
│  │ :3000    │  │ :3001    │                │
│  └──────────┘  └──────────┘                │
└─────────────────────────────────────────────┘
  • llama.cpp runs the AI model inference engine
  • Open WebUI provides the chat interface at http://localhost:3000
  • SearXNG enables local web search at http://localhost:8080
  • Vane handles browser automation at http://localhost:3001

Installation

1

Download

Download LLM-Runner-AIO.exe (2.5 GB) from Hugging Face

2

Run

Double-click to execute — no installation needed

3

Ready

Access Open WebUI at http://localhost:3000

System Requirements

RequirementMinimumRecommended
OSWindows 10/11Windows 11
RAM8 GB16 GB+
GPUCPU OnlyNVIDIA RTX 3060+
VRAMN/A4 GB+
Python3.11 (bundled)3.11 (bundled)

Note: Python 3.11 and Node.js are bundled inside the executable. No separate installation needed!

Data Privacy

  • No Cloud Dependencies — Everything runs locally
  • No Telemetry — No data is sent anywhere
  • Local Database — Chat history stored only on your machine
  • No Account Required — No registration or login needed

Supported Languages

🇹🇷 Turkish🇬🇧 English🇪🇸 Spanish🇩🇪 German🇫🇷 French🇵🇹 Portuguese🇨🇳 Chinese🇯🇵 Japanese

Credits

This project would not be possible without the incredible work of:

  • • Georgi Gerganov — llama.cpp
  • • Open WebUI Team — Open WebUI
  • • SearXNG Contributors — SearXNG
  • • All open-source contributors who make local AI accessible

Ready to Get Started?

Download LLM Runner AIO and start running local LLMs instantly.

Download .exe (2.5 GB)Download .RAR (2.3 GB)