hack-house

Files

T

leetcrypt 85fde59292 perf(ai): keep the Ollama model warm and honor a real num_ctx

OllamaProvider now sends keep_alive (default 30m) so the model stays resident
in VRAM between /ai calls instead of cold-reloading, and sets explicit options
(num_ctx 8192, num_predict 512) — Ollama otherwise caps context at 2048, which
would silently truncate the larger backfilled window.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-06-02 17:43:02 -07:00

__init__.py

feat(agent): model-agnostic AI agent bridge (PoC) + pin lets-hack demo to main

2026-06-01 02:05:48 -07:00

__main__.py

feat(ai): backfill context on join + token-budget window

2026-06-02 17:43:02 -07:00

bridge.py

feat(ai): backfill context on join + token-budget window

2026-06-02 17:43:02 -07:00

profiles.py

feat(ai): model profiles, capability discovery, and agentless /ai list|models

2026-06-01 15:25:07 -07:00

providers.py

perf(ai): keep the Ollama model warm and honor a real num_ctx

2026-06-02 17:43:02 -07:00