Name: SDK
Brand: QVAC

Question 1

How is the QVAC SDK different from using cloud AI APIs?

Accepted Answer

1. How is the QVAC SDK different from using cloud AI APIs?

With cloud AI APIs, your data is sent to third-party servers for processing, you pay per request, and you need a constant internet connection. The QVAC SDK runs AI models directly on your own device. That means your data never leaves your hardware, there are no per-request costs, no rate limits, and no dependency on an internet connection once you have a model downloaded. You own the entire pipeline.

2. Is the QVAC SDK right for me?

3. Is the QVAC SDK free to use?

4. What license is the SDK released under?

5. Can I use it in a commercial product?

6. If I use QVAC, does the data ever leave my device?

7. Can I use the QVAC SDK in offline or air-gapped environments?

8. Can I build a chatbot or conversational AI with it?

9. Can I integrate the SDK into an existing app?

10. Does the SDK support voice features?

11. Can I use it for document processing and OCR?

12. Can I translate text between languages?

13. Does it support image understanding (multimodal/vision)?

14. What AI models can I use?

15. Can I use it to build mobile apps?

16. Can I use it in an Electron or desktop app?

17. Does it require a GPU?

18. What does "peer-to-peer" or “decentralized” mean in the context of the SDK?

19. How do I get started with the QVAC SDK?

20. How fast is local inference compared to cloud APIs?

21. What kind of hardware do I need for good performance?

Question 2

Is the QVAC SDK right for me?

Accepted Answer

The QVAC SDK is a good fit if you're a developer building an application that needs AI capabilities like chat, speech-to-text, text-to-speech, translation, or others, but care about user privacy, offline functionality, or avoiding cloud costs. It's designed for JavaScript/TypeScript developers. If you're building a desktop app, a mobile app, or a backend service and want AI that runs locally, this SDK is built for that.

Question 3

Is the QVAC SDK free to use?

Accepted Answer

Yes. The QVAC SDK is completely free and open-source. There are no subscription fees, usage charges, or per-request costs.

Question 4

What license is the SDK released under?

Accepted Answer

The SDK is released under the Apache License 2.0, a permissive open-source license that allows free use, modification, and distribution, including in commercial products.

Question 5

Can I use it in a commercial product?

Accepted Answer

Yes. The Apache 2.0 license explicitly permits commercial use. You can integrate the QVAC SDK into proprietary, closed-source, or commercial applications without restriction.

Question 6

If I use QVAC, does the data ever leave my device?

Accepted Answer

No. All AI processing happens locally on your device. Your prompts, documents, audio, and images are never sent to any external server. The only network activity is the initial model download (which can also be done over peer-to-peer) and optional peer-to-peer inference if you choose to enable it.

Question 7

Can I use the QVAC SDK in offline or air-gapped environments?

Accepted Answer

Yes. Once a model has been downloaded and cached on disk, the SDK works fully offline with no internet connection required. This makes it suitable for air-gapped, field, or restricted-network deployments.

Question 8

Can I build a chatbot or conversational AI with it?

Accepted Answer

Yes. The SDK supports LLM-based text completion with conversation history, streaming responses, and tool/function calling. You can build interactive chatbots, assistants, and conversational agents. It also supports multimodal conversations where users can send both text and images.

Question 9

Can I integrate the SDK into an existing app?

Accepted Answer

Yes. The SDK is a standard npm package that you install and import into your project. You can add it to an existing backend, desktop app, or mobile app. Also, any tool or app that works with the OpenAI REST API standard can point to a local QVAC server and work without changes.

Question 10

Does the SDK support voice features?

Accepted Answer

Yes. The SDK supports both speech-to-text and text-to-speech.

Question 11

Can I use it for document processing and OCR?

Accepted Answer

Yes. The SDK includes an OCR capability. Combined with the RAG (Retrieval-Augmented Generation) system, you can ingest documents, index their content, and query them using natural language.

Question 12

Can I translate text between languages?

Accepted Answer

Yes. The SDK includes a neural machine translation engine supporting multiple language pairs, as well as support for LLM-based translation.

Question 13

Does it support image understanding (multimodal/vision)?

Accepted Answer

Yes. The SDK supports multimodal models that can process both text and images in a single conversation. You can send an image alongside a text prompt and the model will reason about the visual content.

Question 14

What AI models can I use?

Accepted Answer

You can use any model you want and load it from a local file path, a URL (such as a HuggingFace link), or through peer-to-peer. For LLMs and embeddings, any GGUF-format model is supported. For TTS and OCR, ONNX-format models are used.

Question 15

Can I use it to build mobile apps?

Accepted Answer

Yes. The SDK supports iOS and Android.

Question 16

Can I use it in an Electron or desktop app?

Accepted Answer

Yes. The SDK runs on macOS, Linux, and Windows. It works with the most common JavaScript backends used in Electron and similar desktop frameworks.

Question 17

Does it require a GPU?

Accepted Answer

No, a GPU is not strictly required, as the SDK can run inference on the CPU. However, GPU acceleration significantly improves performance. The SDK supports Metal on macOS and iOS (as well as CPU on Intel), Vulkan on Linux, Windows, and Android, and OpenCL on select Android devices.

Question 18

What does "peer-to-peer" or “decentralized” mean in the context of the SDK?

Accepted Answer

Peer-to-peer (P2P) means devices can communicate directly without a central server. In the QVAC SDK, this enables two things: you can download models from other users' devices instead of a central server and you can delegate AI tasks to a more powerful device on your network. For example, a mobile phone could offload a heavy AI task to a desktop PC. All P2P connections use end-to-end encrypted, direct links with no data passing through third-party infrastructure.

Question 19

How do I get started with the QVAC SDK?

Accepted Answer

You just have to follow the steps defined in our installation guide.

Question 20

How fast is local inference compared to cloud APIs?

Accepted Answer

It depends on your hardware and the model size. On modern devices with GPU acceleration (Apple Silicon Macs, recent Android/iOS devices), local inference can be very responsive. Larger models require more capable hardware. The key trade-off is that you gain complete privacy, zero latency from network round-trips, and no rate limits.

Question 21

What kind of hardware do I need for good performance?

Accepted Answer

For the best experience, use a device with a supported GPU and enough RAM to hold your chosen model in memory. Smaller quantized models (e.g., 1B–3B parameters) run well even on modest hardware, including phones. Larger models (7B+) benefit from more RAM and GPU memory (VRAM).

Intelligence at the Edge,
In a Single API

Cross-platform AI for all your platforms

Decentralization that doesn’t get in the way

Local AI that scales

One SDK, All of AI

FAQ

Intelligence at the Edge, In a Single API

Cross-platform AI for all your platforms

Decentralization that doesn’t get in the way

Local AI that scales

One SDK, All of AI

FAQ

Intelligence at the Edge,
In a Single API