Blog - QVAC by Tether

23 June 2026

8 minutes read

Run OpenCode locally with QVAC: a coding agent with no cloud

23 June 2026 — What OpenCode is OpenCode is an open-source coding agent. You give it a goal in plain language, and it plans, reads and writes files, runs commands, and reports back. It lives in your terminal or in a browser interface, and it works the way a teammate would: open the repo, make the change, run it, […]

18 June 2026

11 minutes read

How to (easily) run a fully local, private AI assistant with OpenClaw and QVAC

18 June 2026 — What an agent harness is, and what OpenClaw does A language model on its own can only produce text. It cannot open a file, run a command, call an API, or remember what it did five minutes ago. An agent harness is the layer that closes that gap. It takes the model’s text output, turns […]

16 June 2026

6 minutes read

QVAC Football Predictor 2026: a local AI that calls matches, and shows its work

16 June 2026 — We built an app that predicts football matches. The twist: the AI runs entirely on your own machine, it simulates each match 10,000 times, and it reasons out loud before committing to a scoreline. No cloud model, no API key, no per-token bill. You can watch it think. This is a write-up of how it […]

2 June 2026

6 minutes read

TurboQuant in QVAC SDK 0.12.0: KV-cache quantization for production local AI

2 June 2026 — TurboQuant is a KV-cache quantization algorithm published by Google Research at ICLR 2026 (Zandieh et al.). It compresses the running context memory of a transformer LLM by up to 5x with nearly no accuracy loss across long-context benchmarks. QVAC SDK 0.12.0 integrates TurboQuant inside qvac-fabric-llm.cpp with a Vulkan backend.

QVAC Blog