Commit Graph

37 Commits

Author SHA1 Message Date
e1e6803eb1 Clean up docs, tests, patch writing bug
Checked out commit from rsarv3006 which is super interesting, grabbed some inspiration from it and mentioned it in the ideas file.
2026-05-22 21:04:54 -07:00
33b9de5441 Story generator fixes again 2026-05-22 19:37:03 -07:00
c4d88fced5 Update story mode for invariants 2026-05-22 17:13:27 -07:00
a0ad9b2ac0 Project runner tool and writer tooling fixes
i wanted the animated status bar...
2026-05-22 04:17:37 -07:00
d928a52fb3 Improve logging 2026-05-22 03:31:58 -07:00
6e03430a33 Recover usable drafts from failed runs 2026-05-22 02:55:26 -07:00
90e4c80116 bugfixes after test run and terminal status 2026-05-21 04:14:05 -07:00
fb575fc5f7 Rename the tutorial to a more generic name
Rather avoid using the name of a popular service
2026-05-21 03:56:30 -07:00
f7fed4535b Add tutorial integration workflow helpers
- Add `integ-test` to create, set up, validate, and run integration template tasks
  - Add `integ-report` to summarize latest integration run artifacts
  - Switch default pastebin template from model fallback to single `qwen3-coder:30b`
  - Support optional Ollama fields: `num_ctx`, `num_predict`, `seed`, and `stop`
  - Add `nightshift validate` preflight for task-specific test files
  - Update pastebin docs, config reference, and ideas tracking
  - Add tests for integration helpers, task-test validation, config parsing, and template expectations
2026-05-21 03:46:27 -07:00
e3679296fd Reliability improvements, integ test fixes
Isolate editor from editing tests for the tutorial, hardcode tests for the integ test, some fixs around isolation. We got the integ working!
2026-05-21 00:31:31 -07:00
3bb5bd4157 Fixes based on tests, do tdd
Changed the pastebin tutorial so it now starts skeletal: no prebuilt Flask behavior, no pre-generated task tests, and .gitkeep placeholders under templates/ and tests/. The new pipeline  in nightshift/project_templates/tutorial-pastebin/nightshift.yaml:1 now runs:

  plan -> semantic_context -> context -> write_tests -> review_tests -> implement -> pytest -> review
                                                                                                                                                                                           ────────────────────────────────────────────────────
  Added nightshift/project_templates/tutorial-pastebin/.nightshift/agents/test-writer.md:1, tightened the planner/implementer/reviewer/debugger prompts, mirrored the pipeline docs/
  example, and raised default retries to 6 for the basic starter plus pastebin.

  I also fixed the retry policy issue in nightshift/escalation.py:17 and nightshift/pipeline.py:251: configured repeated-failure thresholds are now respected instead of hard-stopping      in nightshift/project_templates/tutorial-pastebin/
  early after three same-stage/same-cause failures. Non-implementation file_writer stages now get stage-specific retry artifacts so test generation does not collide with implementation
  repair artifacts
2026-05-20 21:51:40 -07:00
c12493a248 fix guard and make stop repeat configurable 2026-05-20 05:02:32 -07:00
d08e629bce hotdog versioning and some bugfixes for integration project and model testing 2026-05-20 03:50:51 -07:00
7c54050223 add integ runs, dynamic model choices, symantic search, better file creation, debugging agents 2026-05-20 02:36:23 -07:00
05471e432e Some notes from testing 2026-05-17 19:31:54 -07:00
a3e1be75fe Add more rigorous responses to test failures 2026-05-17 19:05:27 -07:00
068822d766 Dashboard and human readable devlogs 2026-05-17 16:09:49 -07:00
fa97abf0c3 Implemented deterministic diff generation via a new file_writer stage.
What changed:

  - Added file_writer stage support in nightshift/config.py:93, nightshift/agents.py:446, and nightshift/pipeline.py:374.
  - Added file-block parsing and deterministic patch generation in nightshift/patches.py:65.
  - file_writer agents now return complete file blocks:

    ```file:relative/path.py
    <complete file content>
    ```
    NightShift reads current files and generates the unified diff itself.

  - Existing patch_validator, patch_normalizer, and patch_apply still run after that.
  - Switched tiny-lisp-nightshift/nightshift.yaml:51 to type: file_writer.
  - Updated tiny-lisp-nightshift/agents/implementer.md:1 to request file blocks instead of diffs.
  - Updated README, tutorial, config reference, and design doc.

  Also added tests for parsing file blocks, generating multi-file diffs, validating those diffs, and running a full file_writer -> normalize -> validate pipeline.

  Verification: python -m unittest discover -v passes, 101 tests.
2026-05-17 15:24:10 -07:00
42564c6867 The ollama backend now uses Ollama’s HTTP API instead of ollama run 2026-05-17 14:23:31 -07:00
db9b24379e Outline some bugs and move some docs 2026-05-17 14:17:17 -07:00
e079c9088d The bug was that validate_patch had no on_fail, so NightShift stopped instead of sending that feedback back to implement. 2026-05-17 13:56:36 -07:00
809ec92e0e fix apply patch when user has no git 2026-05-17 13:53:10 -07:00
7c1cf29745 fix pathing for tutorial 2026-05-17 10:30:41 -07:00
eb155d37e5 Added tutorial for real models 2026-05-17 10:29:05 -07:00
9e3b56b214 documentation update 2026-05-17 10:16:26 -07:00
a8616a1062 close out phase2 and support quickstart 2026-05-17 10:11:59 -07:00
12e2c99a75 improve communication, add code writing, change logo 2026-05-17 10:04:45 -07:00
646c655314 Repo Lookup, Request Context, Planner, Context Stage, QoL improvements
- Added operational run logging via nightshift/runlog.py.
  - CLI now streams progress during run / run --all.
  - Runs write .nightshift/runs/<run-id>/run.log and aggregate .nightshift/nightshift.log.
  - Web dashboard now shows the last 100 run log lines.
  - Added agent temperature config.
  - Added minimal openai_compatible backend and temperature passing for it.
  - Added Ollama temperature handling.
  - Added scoped repo lookup tools in nightshift/repo_tools.py: list_files, read_file, grep.
  - Planner agents can request lookup context with lookup_requests; NightShift saves files-inspected.md and reruns the planner with retrieved context.
  - Added repo_context stage type that writes context-pack.md.
  - Marked phases 23-27 complete in docs/design.md:990.
2026-05-17 09:56:28 -07:00
86aa7dd13c Update design.md 2026-05-17 09:45:49 -07:00
caac39b090 add bug docs 2026-05-17 02:09:32 -07:00
360f449738 next steps 2026-05-17 01:54:22 -07:00
957dc7d25b Ollama backend support, experiment metadata and prompt snapshots, stronger command execution controls, refreshed docs/examples, a read-only Flask dashboard, and a runnable quickstart Lisp project. 2026-05-17 01:39:44 -07:00
57608e9660 Add status, git artifacts, task completion, multi-task runs, and dependency handling 2026-05-17 01:19:43 -07:00
528c0ddeb5 documentation pass and hardening bugfixes 2026-05-17 00:49:17 -07:00
d84d580671 Implement pipeline execution, context, reports, and MVP docs 2026-05-17 00:38:00 -07:00
c1baf9b7d8 Implement NightShift MVP phases 1-6
Includes starter project generation, validation for configs/tasks/commands, artifact snapshot writing, structured stage results, command output capture, devlogs for phases 1-6, and unit coverage for the implemented MVP layers.
2026-05-17 00:17:13 -07:00
5958c82cf9 First docs 2026-05-16 23:56:58 -07:00