aistack
aistack
Section titled “aistack”A self-hosted, OpenAI-API-compatible gateway that wraps open-source ASR (faster-whisper, NVIDIA Parakeet, Alibaba SenseVoice), TTS (Qwen3-TTS via vLLM), and a local LLM proxy (to Ollama) behind a single HTTP endpoint.
Positioning
Section titled “Positioning”aistack is research-shaped, not productized. It is built for developers and researchers who want to compare backends, measure quality/latency trade-offs, and integrate local ASR/TTS into their own tools — not for end users who expect a single-installer “download and run” experience. The emphasis is on observability, multi-backend comparison, and unsentimental benchmarks.
If you want a turnkey local-AI experience, use Ollama directly for LLMs and wait for a downstream productized tool for ASR/TTS. If you want to study local ASR/TTS, aistack is built for you.
What’s published here
Section titled “What’s published here”| Section | What it is |
|---|---|
| Integration Guide | Hello-world to production: capability discovery, requests, errors, streaming. The first page to read if you’re new to aistack. |
| HTTP API | Per-endpoint design rationale (the why) plus the auto-generated Reference (the what). |
More sections (configuration, deployment notes, research findings) will be progressively published. The full repo lives at github.com/dosmoon/aistack.
Quick start
Section titled “Quick start”git clone https://github.com/dosmoon/aistackcd aistackpip install -e .[asr-fasterwhisper]python -m uvicorn aistack.main:app --port 11500curl http://127.0.0.1:11500/healthThe full install + extras layout (per-ASR-backend opt-in, TTS Docker container, Ollama side-by-side) is documented in the repo’s README.md.
API at a glance
Section titled “API at a glance”GET /health livenessGET /v1/models capability inventoryPOST /v1/audio/transcriptions speech-to-text (Whisper / Parakeet / SenseVoice)POST /v1/audio/speech text-to-speech (Qwen3-TTS)POST /v1/chat/completions chat completion (proxy to Ollama)Default base URL: http://127.0.0.1:11500. For the per-endpoint reference, head to the HTTP API section.