Yes. Vox is 100% free and open source under the MIT license. No subscription, no usage limits, no hidden costs.

Does Vox work offline?

Yes. After the initial model download, Vox runs entirely on your Mac with no internet connection required.

Vox runs locally on your hardware. Your conversations, files, and data never leave your device. No cloud, no telemetry.

What can Vox do that ChatGPT cannot?

Vox controls your desktop — it manages files, sends emails, replies to iMessages, captures screen regions, and runs background agents. It works offline and keeps all data on your device.

Does Vox work on Windows or Linux?

Vox is Mac-first today. The core architecture is platform-agnostic, and Windows and Linux support is the top contributor priority.

Local vs Cloud AI: A Practical Comparison

The debate between local and cloud AI matters more than most people realize. Where your AI runs determines what it can access, how fast it responds, and who sees your data. Vox takes a fully local approach — here's how it works and why.

The spectrum

AI systems fall on a spectrum from fully local to fully cloud:

Fully Local

All processing on your device
Complete privacy
No internet required for inference
Limited by device hardware
Smaller models (7B-70B parameters)
Examples: Ollama, LM Studio

Fully Cloud

All processing on remote servers
Data sent to provider
Requires internet connection
Unlimited compute power
Largest models (400B+ parameters)
Examples: ChatGPT, Claude web

How Vox approaches this

Vox is a fully local system. Everything runs on your Mac — the AI model, your data, and all tool execution:

Vox — Processing Architecture

✓Wake word detection → LOCAL (on-device ONNX model)Done

✓Speech-to-text → LOCAL (on-device processing)Done

✓AI reasoning → LOCAL (Qwen3 via llama.cpp)Done

✓Tool execution → LOCAL (your machine)Done

✓Knowledge base → LOCAL (SQLite on your machine)Done

✓File operations → LOCAL (direct filesystem)Done

What stays local

Wake word detection — ONNX model runs on-device, audio never leaves your computer
Knowledge base — Indexed in local SQLite with FTS5, searchable locally
File operations — All file management executes on your machine
Code execution — Scripts run directly on your machine
Document creation — Files generated and saved locally

What about the cloud?

Vox does not use cloud AI. The Qwen3 model runs locally via llama.cpp on your Mac. Conversations are stored in a local SQLite database. The only network requests are for features that inherently require the internet:

Web search — search queries require internet access by nature
Email — OAuth-authenticated email access through provider APIs

Why fully local works

Local AI has advanced rapidly. Vox runs Qwen3 locally via llama.cpp, with your choice of 4B, 8B, 14B, or 32B parameter variants depending on your hardware. These models deliver strong reasoning quality while keeping everything on your device.

You get capable AI without sacrificing privacy — no data ever leaves your Mac.

Note

The fully local approach means zero data leaves your Mac. Your files, conversations, knowledge base, and AI inference all stay on your machine. You get strong AI reasoning without any cloud dependency.

Vox proves that local AI can deliver real intelligence without any cloud dependency. Every operation — from AI reasoning to file management — runs entirely on your Mac.

Put Vox to work on your computer.

Download Vox for Mac and start with the local setup flow.

Download for Mac

macOS · Apple Silicon & Intel

Continue reading

Thought Leadership

AI That Runs on Your Computer, Not in the Cloud

Why local-first AI matters more than ever — and how Vox keeps your wake word, files, and knowledge base entirely on-device.

Thought Leadership

Why Privacy-First AI Matters More Than You Think

Every file you upload to a cloud AI leaves your control. Privacy-first means keeping sensitive operations local. Here is why it matters.

Guides

How to Set Up Your Knowledge Base in Vox

Index your documents, code, and notes into a local knowledge base. Search everything by voice with instant results.