QVAC Blog

Filter By:
Run OpenCode locally with QVAC: a coding agent with no cloud

What OpenCode is OpenCode is an open-source coding agent. You give it a goal in plain language, and it plans, reads and writes files, runs commands, and reports back. It lives in your terminal or in a browser interface, and it works the way a teammate would: open the repo, make the change, run it, […]

Read more
How to (easily) run a fully local, private AI assistant with OpenClaw and QVAC

What an agent harness is, and what OpenClaw does A language model on its own can only produce text. It cannot open a file, run a command, call an API, or remember what it did five minutes ago. An agent harness is the layer that closes that gap. It takes the model’s text output, turns […]

Read more
QVAC Football Predictor 2026: a local AI that calls matches, and shows its work

We built an app that predicts football matches. The twist: the AI runs entirely on your own machine, it simulates each match 10,000 times, and it reasons out loud before committing to a scoreline. No cloud model, no API key, no per-token bill. You can watch it think. This is a write-up of how it […]

Read more
TurboQuant in QVAC SDK 0.12.0: KV-cache quantization for production local AI

TurboQuant is a KV-cache quantization algorithm published by Google Research at ICLR 2026 (Zandieh et al.). It compresses the running context memory of a transformer LLM by up to 5x with nearly no accuracy loss across long-context benchmarks. QVAC SDK 0.12.0 integrates TurboQuant inside qvac-fabric-llm.cpp with a Vulkan backend.

Read more
Loading...