Decision tool
Local AI stack recommender
Answer a few questions and get a deterministic stack suggestion with hardware, privacy, document workflow, and configuration caveats.
The recommender is deterministic. It reuses the RAM/VRAM calculator for hardware fit and keeps incomplete or configuration-dependent paths visible.
Goal fit
general_chat
Mapped to the simplest source-backed stack that fits the selected workflow.
Hardware fit
fits_comfortably
This setup should likely run the selected model locally with the selected context, assuming the model format and runtime are configured correctly.
Privacy posture
casual
Configuration-dependent.
Setup complexity
simplest
Docker-heavy paths are penalized unless browser UI was requested.
LM Studio
Use LM Studio as the first stack to evaluate for this input set.
This setup should likely run the selected model locally with the selected context, assuming the model format and runtime are configured correctly.
Alternatives
- Ollama: Local runtime/API baseline for developer and backend workflows.
- Ollama + Open WebUI: Browser workspace option; Ollama remains the runtime.
What not to do
- Do not choose Open WebUI alone as the model runtime; pair it with Ollama or another provider.
- Do not assume document chat is local-only without checking the LLM, embedder, vector database, and tool settings.
- Do not increase stack complexity before lowering model size, context, or quantization when hardware fit is poor.
Caveats
- This is deterministic planning guidance, not a benchmark or hands-on setup result.
- Model fit still depends on model size, quantization, context length, runtime overhead, drivers, and other running apps.