hack-house/cmd_chat/agent
leetcrypt 85fde59292 perf(ai): keep the Ollama model warm and honor a real num_ctx
OllamaProvider now sends keep_alive (default 30m) so the model stays resident
in VRAM between /ai calls instead of cold-reloading, and sets explicit options
(num_ctx 8192, num_predict 512) — Ollama otherwise caps context at 2048, which
would silently truncate the larger backfilled window.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-06-02 17:43:02 -07:00
..
__init__.py feat(agent): model-agnostic AI agent bridge (PoC) + pin lets-hack demo to main 2026-06-01 02:05:48 -07:00
__main__.py feat(ai): backfill context on join + token-budget window 2026-06-02 17:43:02 -07:00
bridge.py feat(ai): backfill context on join + token-budget window 2026-06-02 17:43:02 -07:00
profiles.py feat(ai): model profiles, capability discovery, and agentless /ai list|models 2026-06-01 15:25:07 -07:00
providers.py perf(ai): keep the Ollama model warm and honor a real num_ctx 2026-06-02 17:43:02 -07:00