Fix parser routing after PR merge

2026-06-14 10:08:37 +00:00 · 2026-05-24 01:03:59 -07:00 · 2026-05-24 01:03:59 -07:00 · 3f8532197f
commit 3f8532197f
parent 03438e1e8a
10 changed files with 241 additions and 61 deletions
--- a/docs/config-reference.md
+++ b/docs/config-reference.md
@ -16,6 +16,7 @@ NightShift config is YAML.
 - `allowed_commands`: exact command-stage allowlist entries after whitespace normalization.
 - `forbidden_commands`: dangerous fragments blocked before allowlist acceptance.
 - `allowed_env`: optional environment variable names to pass to command stages.
 - `skip_repo_parts`: optional directory or path-part names to exclude from repo lookup tools, in addition to built-in skips such as `.git`, `.nightshift`, `__pycache__`, `.venv`, and `venv`.
 ## `experiment`
@ -104,14 +105,23 @@ Writer stages:
 - `code_writer`: agent returns a unified diff directly.
 - `file_writer`: agent returns complete file content blocks; NightShift generates the unified diff deterministically. Prefer this for local models that wrap or miscount long patch hunks.
-`file_writer` blocks use this form:
+Code-oriented `file_writer` stages use fenced blocks:
 ````markdown
-```file:relative/path.py
+```file:relative/path.to
 <complete file content>
 ```
 ````
 Writing-oriented `file_writer` stages for `story/chapters` and story state files use delimiter blocks:
 ```text
 FILE: story/chapters/chapter-001/scene-001.md
 ---CONTENT---
 <complete prose or state content>
 ---END---
 ```
 Semantic context stage:
 ```yaml
@ -124,7 +134,7 @@ This stage builds a lightweight repository index of files, Python symbols, impor
 ### `on_status` Stage Routing
-Instead of a single `on_fail` catch-all, use `on_status` to route each review status to a different stage:
+Use `on_status` to route stage statuses to different follow-up stages:
 ```yaml
 - id: review
@ -135,11 +145,13 @@ Instead of a single `on_fail` catch-all, use `on_status` to route each review st
    pass: summarize
    retry: implement
    fail: plan
-    escalate: human
+    escalate: summarize
 ```
 `on_status` supports `pass`, `fail`, `retry`, and `escalate` keys. For `pass`, it overrides sequential progression and any agent-supplied `next_stage`. For non-pass statuses, the lookup order is: `on_status[status]` → `on_fail` → `next_stage` (agent output).
 The older `on_pass` key remains supported as a deprecated alias for `on_status.pass`.
 ## Failure, Retry, and Resource Artifacts
 Failed command and validation stages write deterministic diagnostics under the task artifact directory:
--- a/docs/writer-and-coder.md
+++ b/docs/writer-and-coder.md
@ -20,7 +20,7 @@ After that correction, the automated test suite passes.
 - State update file-writer stages receive focused current state context.
 - Scene editor file-writer stages receive `current_scene_file`.
 - Agent invocations now write a sibling JSON artifact for reliable stdout/stderr extraction.
- Pipeline config now supports optional `on_pass` routing.
+- Pipeline config now supports optional `on_status` routing. The older `on_pass` key remains as a deprecated alias for `on_status.pass`.
 ## Coding Impact Findings
@ -37,11 +37,11 @@ No non-novel project template files changed in the current diff:
 The new `editor` agent and review repair routing are only configured in `tutorial-novel/nightshift.yaml`.
-### Finding 2: `on_pass` is inert for existing coding configs
+### Finding 2: `on_status` is inert for existing coding configs
-`on_pass` defaults to `None`, so existing coding templates keep their prior linear pass behavior unless they explicitly opt in.
+`on_status` defaults to `None`, so existing coding templates keep their prior linear pass behavior unless they explicitly opt in.
-Passing review stages still ignore model-provided `next_stage` values. This preserves the existing safety behavior where reviewers cannot jump around the pipeline on a pass unless the config has an explicit `on_pass`.
+Passing review stages still ignore model-provided `next_stage` values. This preserves the existing safety behavior where reviewers cannot jump around the pipeline on a pass unless the config has an explicit `on_status.pass` or legacy `on_pass`.
 ### Finding 3: Code writer stages still use the same direct patch path
@ -131,6 +131,6 @@ Result:
 ## Conclusion
-After the first-attempt file-writer context fix, I do not see evidence that the writer workflow changes degrade code generation. The shared changes are either opt-in (`on_pass`), artifact-reading improvements (JSON stdout), or narrowly gated to novel state/editor stages.
+After the first-attempt file-writer context fix, I do not see evidence that the writer workflow changes degrade code generation. The shared changes are either opt-in (`on_status`), artifact-reading improvements (JSON stdout), or narrowly gated to novel state/editor stages.
 Remaining non-writer issue: several coding-oriented templates still reference a missing `debugger.md` prompt. That should be handled separately from this writer/coder compatibility pass.
--- a/nightshift/config.py
+++ b/nightshift/config.py
@ -62,6 +62,7 @@ class StageConfig:
    commands: tuple[str, ...] = ()
    output: str | None = None
    on_fail: str | None = None
    on_pass: str | None = None
    on_status: dict[str, str] | None = None
    shell: bool = True
    timeout_seconds: int | None = None
@ -398,6 +399,7 @@ def parse_config(raw: dict[str, Any], config_path: Path) -> NightShiftConfig:
                commands=commands,
                output=_optional_string(stage_raw.get("output"), f"{stage_context}.output"),
                on_fail=_optional_string(stage_raw.get("on_fail"), f"{stage_context}.on_fail"),
                on_pass=_optional_string(stage_raw.get("on_pass"), f"{stage_context}.on_pass"),
                on_status=_parse_on_status(stage_raw, stage_context),
                shell=_optional_bool(stage_raw.get("shell", True), f"{stage_context}.shell"),
                timeout_seconds=timeout_seconds,
@ -423,6 +425,10 @@ def parse_config(raw: dict[str, Any], config_path: Path) -> NightShiftConfig:
            raise ConfigError(
                f"Config error: stage '{stage.id}' on_fail references unknown stage '{stage.on_fail}'."
            )
        if stage.on_pass and stage.on_pass not in stage_ids:
            raise ConfigError(
                f"Config error: stage '{stage.id}' on_pass references unknown stage '{stage.on_pass}'."
            )
        if stage.on_status:
            for status_key, target in stage.on_status.items():
                if target not in stage_ids:
@ -650,8 +656,11 @@ VALID_STATUS_KEYS = frozenset({"pass", "fail", "retry", "escalate"})
 def _parse_on_status(raw: dict[str, Any], context: str) -> dict[str, str] | None:
    on_status_raw = raw.get("on_status")
-    if on_status_raw is None:
+    on_pass = _optional_string(raw.get("on_pass"), f"{context}.on_pass")
    if on_status_raw is None and on_pass is None:
        return None
    if on_status_raw is None:
        return {"pass": on_pass} if on_pass is not None else None
    if not isinstance(on_status_raw, dict):
        raise ConfigError(f"Config error: {context}.on_status must be a mapping.")
    on_status: dict[str, str] = {}
@ -666,4 +675,11 @@ def _parse_on_status(raw: dict[str, Any], context: str) -> dict[str, str] | None
                f"Config error: {context}.on_status.{key} must be a non-empty string."
            )
        on_status[key] = value
    if on_pass is not None:
        existing_pass = on_status.get("pass")
        if existing_pass is not None and existing_pass != on_pass:
            raise ConfigError(
                f"Config error: {context}.on_pass conflicts with on_status.pass."
            )
        on_status["pass"] = on_pass
    return on_status
--- a/nightshift/patches.py
+++ b/nightshift/patches.py
@ -90,10 +90,9 @@ def repair_hunk_counts(patch: str) -> str:
 def parse_file_updates(text: str) -> tuple[FileUpdate, ...]:
-    """Parse model-supplied complete file content blocks."""
+    """Parse fenced model-supplied complete file content blocks."""
    updates: list[FileUpdate] = []
    updates.extend(_parse_delimited_file_updates(text))
    pattern = re.compile(
        r"```(?:file|path)[:=](?P<path>[^\n`]+)\n(?P<content>.*?)```",
        flags=re.DOTALL | re.IGNORECASE,
@ -106,22 +105,44 @@ def parse_file_updates(text: str) -> tuple[FileUpdate, ...]:
        updates.append(FileUpdate(path=path, content=content))
    if not updates:
        raise PipelineError(
-            "File writer error: no file blocks found. Expected fenced blocks like ```file:path.to."
+            "File writer error: no fenced file blocks found. Expected fenced blocks like ```file:path.to."
        )
    return tuple(updates)
-def _parse_delimited_file_updates(text: str) -> list[FileUpdate]:
+def parse_delimited_file_updates(text: str) -> tuple[FileUpdate, ...]:
    """Parse delimiter file blocks used by prose and story-state writer stages."""
    updates: list[FileUpdate] = []
    header_pattern = re.compile(r"(?m)^FILE:\s*(?P<path>[^\n]+)\n---CONTENT---\n")
    matches = list(header_pattern.finditer(text))
    for index, match in enumerate(matches):
        path = match.group("path").strip().strip("`")
        content_start = match.end()
        next_file_start = matches[index + 1].start() if index + 1 < len(matches) else len(text)
        raw_content = text[content_start:next_file_start]
        end_match = re.search(r"(?m)^---END---\s*$", raw_content)
        if end_match:
            raw_content = raw_content[: end_match.start()]
        content = raw_content.rstrip("\r\n") + "\n"
        if path:
            updates.append(FileUpdate(path=path, content=content))
    if updates:
        return tuple(updates)
    pattern = re.compile(
        r"(?ms)^FILE:\s*(?P<path>[^\n]+)\n---CONTENT---\n(?P<content>.*?)\n---END---\s*$"
    )
    updates: list[FileUpdate] = []
    for match in pattern.finditer(text):
        path = match.group("path").strip().strip("`")
        content = match.group("content")
        if path:
            updates.append(FileUpdate(path=path, content=content + "\n"))
-    return updates
+    if not updates:
        raise PipelineError(
            "File writer error: no delimiter file blocks found. Expected FILE: path with ---CONTENT---/---END---."
        )
    return tuple(updates)
 def generate_patch_from_file_updates(
--- a/nightshift/pipeline.py
+++ b/nightshift/pipeline.py
@ -30,6 +30,7 @@ from .patches import (
    format_validation_result,
    generate_patch_from_file_updates,
    normalize_patch_text,
    parse_delimited_file_updates,
    parse_file_updates,
    validate_patch,
 )
@ -200,41 +201,22 @@ class PipelineRunner:
                retry_notes.append(f"Context update from '{stage.id}': {result.context_update}")
            if result.status == "pass":
-                if stage.on_status and "pass" in stage.on_status:
+                pass_target_stage = (stage.on_status or {}).get("pass") or stage.on_pass or result.next_stage
-                    target = stage.on_status["pass"]
+                if stage.type in {"agent_review", "review"} and result.next_stage:
                    if target not in stage_indexes:
                        final_status = "failed"
                        final_reason = (
                            f"Stage '{stage.id}' on_status.pass references unknown stage '{target}'."
                        )
                        break
                    self.logger.event(
-                        "stage.next",
+                        "stage.next_ignored",
-                        "Jumping via on_status.pass",
+                        "Ignoring next_stage from passing review",
                        run_id=self.artifacts.run_id,
                        task_id=task.id,
                        stage_id=stage.id,
-                        next_stage=target,
+                        requested_next_stage=result.next_stage,
                    )
-                    index = stage_indexes[target]
+                    pass_target_stage = (stage.on_status or {}).get("pass") or stage.on_pass
-                    continue
+                if pass_target_stage:
-                if stage.type in {"agent_review", "review"}:
+                    if pass_target_stage not in stage_indexes:
                    if result.next_stage:
                        self.logger.event(
                            "stage.next_ignored",
                            "Ignoring next_stage from passing review",
                            run_id=self.artifacts.run_id,
                            task_id=task.id,
                            stage_id=stage.id,
                            requested_next_stage=result.next_stage,
                        )
                    index += 1
                    continue
                if result.next_stage:
                    if result.next_stage not in stage_indexes:
                        final_status = "failed"
                        final_reason = (
-                            f"Stage '{stage.id}' requested unknown next stage '{result.next_stage}'."
+                            f"Stage '{stage.id}' requested unknown next stage '{pass_target_stage}'."
                        )
                        break
                    self.logger.event(
@ -243,9 +225,9 @@ class PipelineRunner:
                        run_id=self.artifacts.run_id,
                        task_id=task.id,
                        stage_id=stage.id,
-                        next_stage=result.next_stage,
+                        next_stage=pass_target_stage,
                    )
-                    index = stage_indexes[result.next_stage]
+                    index = stage_indexes[pass_target_stage]
                    continue
                index += 1
                continue
@ -790,7 +772,7 @@ class PipelineRunner:
        while True:
            updates: tuple[FileUpdate, ...] = ()
            try:
-                updates = parse_file_updates(stdout)
+                updates = _parse_file_writer_updates(stage, stdout)
                candidate_index_path = self._write_file_writer_candidates(
                    task.id,
                    stage,
@ -843,7 +825,7 @@ class PipelineRunner:
                    )
                    break
                if (
-                    "no file blocks found" in str(exc)
+                    "file blocks found" in str(exc)
                    and "diff --git " not in stdout
                    and not invalid_rerun_done
                ):
@ -1842,6 +1824,8 @@ def _invalid_file_writer_output_summary(output: str, reason: str, max_chars: int
    lowered = output.lower()
    if "```file:" in lowered or "```path:" in lowered:
        lines.append("The output started a file block, but NightShift could not parse a complete closed block.")
    elif re.search(r"(?m)^FILE:\s*[^\n]+\n---CONTENT---\n", output):
        lines.append("The output started a delimiter file block, but NightShift could not parse a complete block.")
    else:
        lines.append("The output did not contain a parseable file block.")
    excerpt = output.strip()
@ -1865,6 +1849,12 @@ def _resolve_retry_target_stage(stage: StageConfig, result: StageResult) -> str
    return (stage.on_status or {}).get(result.status) or stage.on_fail or result.next_stage
 def _parse_file_writer_updates(stage: StageConfig, stdout: str) -> tuple[FileUpdate, ...]:
    if _is_writing_file_writer_stage(stage):
        return parse_delimited_file_updates(stdout)
    return parse_file_updates(stdout)
 def _previous_continuity_review_passed(previous_outputs: dict[str, str]) -> bool:
    for name, output in previous_outputs.items():
        if "continuity" in name and re.search(r"(?im)^status:\s*pass\s*$", output):
@ -1938,10 +1928,10 @@ def _filter_allowed_file_updates(updates: tuple[FileUpdate, ...], stage: StageCo
 def _file_writer_repair_format_note(stage: StageConfig) -> str:
-    if _is_state_update_stage(stage):
+    if _is_writing_file_writer_stage(stage):
        return (
            "Use delimiter file blocks only: FILE: path, ---CONTENT---, complete file content, "
-            "---END---. Do not use markdown code fences for state update output."
+            "---END---. Do not use markdown code fences for writing output."
        )
    return "Use complete fenced file blocks with both the opening ```file:path and closing ``` fence."
--- a/nightshift/templates.py
+++ b/nightshift/templates.py
@ -73,7 +73,7 @@ pipeline:
      #   pass: summarize
      #   retry: implement
      #   fail: plan
-      #   escalate: human
+      #   escalate: summarize
      on_fail: implement
      output: review.md
--- a/tests/test_config.py
+++ b/tests/test_config.py
@ -77,6 +77,26 @@ class ConfigTests(unittest.TestCase):
            })
            self.assertIsNone(review_stage.on_fail)
    def test_on_pass_loads_as_legacy_alias(self) -> None:
        with tempfile.TemporaryDirectory() as directory:
            root = Path(directory)
            init_project(root)
            config_path = root / "nightshift.yaml"
            config_path.write_text(
                config_path.read_text(encoding="utf-8").replace(
                    "      output: plan.md",
                    "      output: plan.md\n      on_pass: summarize",
                    1,
                ),
                encoding="utf-8",
            )
            config = load_config(config_path)
            plan_stage = next(stage for stage in config.pipeline.stages if stage.id == "plan")
            self.assertEqual(plan_stage.on_pass, "summarize")
            self.assertEqual(plan_stage.on_status, {"pass": "summarize"})
    def test_on_status_rejects_invalid_key(self) -> None:
        with tempfile.TemporaryDirectory() as directory:
            root = Path(directory)
@ -107,6 +127,24 @@ class ConfigTests(unittest.TestCase):
            with self.assertRaisesRegex(ConfigError, "on_status.fail references unknown stage"):
                load_config(config_path)
    def test_skip_repo_parts_loads(self) -> None:
        with tempfile.TemporaryDirectory() as directory:
            root = Path(directory)
            init_project(root)
            config_path = root / "nightshift.yaml"
            config_path.write_text(
                config_path.read_text(encoding="utf-8").replace(
                    "  allowed_commands:",
                    "  skip_repo_parts:\n    - dist\n  allowed_commands:",
                    1,
                ),
                encoding="utf-8",
            )
            config = load_config(config_path)
            self.assertEqual(config.safety.skip_repo_parts, ("dist",))
    def test_validate_requires_prompt_files(self) -> None:
        with tempfile.TemporaryDirectory() as directory:
            root = Path(directory)
--- a/tests/test_patches.py
+++ b/tests/test_patches.py
@ -7,6 +7,7 @@ from nightshift.errors import PipelineError
 from nightshift.patches import (
    generate_patch_from_file_updates,
    normalize_patch_text,
    parse_delimited_file_updates,
    parse_file_updates,
    repair_hunk_counts,
    validate_patch,
@ -248,7 +249,7 @@ test
            self.assertEqual(result.files, ("src/app.py", "src/test_app.py"))
    def test_file_updates_parse_explicit_delimiters(self) -> None:
-        updates = parse_file_updates(
+        updates = parse_delimited_file_updates(
            """FILE: story/chapters/chapter-001/scene-001.md
 ---CONTENT---
 Sunlight did not belong here.
@ -261,7 +262,7 @@ Sunlight did not belong here.
        self.assertEqual(updates[0].content, "Sunlight did not belong here.\n")
    def test_file_updates_parse_delimiters_without_end_before_next_file(self) -> None:
-        updates = parse_file_updates(
+        updates = parse_delimited_file_updates(
            """Intro prose is ignored.
 FILE: story/plot-state.md
@ -285,7 +286,7 @@ FILE: story/timeline.md
        self.assertEqual(updates[1].content, "# Timeline\n\n- SCENE-002 complete.\n")
    def test_file_updates_parse_mixed_delimiter_end_and_next_file(self) -> None:
-        updates = parse_file_updates(
+        updates = parse_delimited_file_updates(
            """FILE: story/plot-state.md
 ---CONTENT---
 first
@ -301,6 +302,16 @@ second
        self.assertEqual(updates[0].content, "first\n")
        self.assertEqual(updates[1].content, "second\n")
    def test_code_file_updates_reject_delimiter_blocks(self) -> None:
        with self.assertRaisesRegex(PipelineError, "no fenced file blocks"):
            parse_file_updates(
                """FILE: src/app.py
 ---CONTENT---
 print("hello")
 ---END---
 """
            )
    def test_file_updates_reject_duplicate_blocks(self) -> None:
        with tempfile.TemporaryDirectory() as directory:
            root = Path(directory)
--- a/tests/test_pipeline.py
+++ b/tests/test_pipeline.py
@ -238,11 +238,77 @@ class PipelineRunnerTests(unittest.TestCase):
            result = runner.run_task(task)
-            self.assertEqual(result.status, "failed")
+            self.assertEqual(result.status, "complete")
-            self.assertEqual(result.retry_count, 2)
+            self.assertEqual(result.retry_count, 1)
            self.assertEqual(
                [r.stage_id for r in result.stage_results],
-                ["plan", "review", "implement", "review", "implement", "review"],
+                ["plan", "review", "implement"],
            )
    def test_on_status_pass_jumps_to_configured_stage(self) -> None:
        with tempfile.TemporaryDirectory() as directory:
            root = Path(directory)
            _write_common_files(root)
            stages = (
                StageConfig(id="first", type="agent", agent="planner", output="first.md", on_status={"pass": "third"}),
                StageConfig(
                    id="second",
                    type="command",
                    commands=('python -c "print(\'should not run\')"',),
                    output="second-output.txt",
                ),
                StageConfig(id="third", type="summarize", output="final-notes.md"),
            )
            config = make_config(root, stages)
            runner = PipelineRunner(config, ArtifactStore(root, ".nightshift", run_id="test-run"))
            result = runner.run_task(parse_tasks(TASK_MD)[0])
            task_dir = root / ".nightshift" / "runs" / "test-run" / "tasks" / "TASK-001"
            self.assertEqual(result.status, "complete")
            self.assertEqual([item.stage_id for item in result.stage_results], ["first", "third"])
            self.assertFalse((task_dir / "second-output.txt").exists())
    def test_on_status_failure_takes_priority_over_agent_next_stage(self) -> None:
        with tempfile.TemporaryDirectory() as directory:
            root = Path(directory)
            _write_common_files(root)
            config = make_config(root, (), max_retries=1)
            config.agents["reviewer"] = AgentConfig(
                id="reviewer",
                backend="command",
                command=(
                    "python -c \"print('status: retry\\nreason: needs plan repair\\n"
                    "next_stage: implement')\""
                ),
                system_prompt=Path("reviewer.md"),
            )
            config = replace(
                config,
                pipeline=PipelineConfig(
                    max_task_retries=1,
                    stages=(
                        StageConfig(id="plan", type="agent", agent="planner", output="plan.md"),
                        StageConfig(id="implement", type="agent", agent="planner", output="implementation-log.md"),
                        StageConfig(
                            id="review",
                            type="agent_review",
                            agent="reviewer",
                            on_status={"retry": "plan"},
                            on_fail="implement",
                            output="review.md",
                        ),
                    ),
                ),
            )
            runner = PipelineRunner(config, ArtifactStore(root, ".nightshift", run_id="test-run"))
            result = runner.run_task(parse_tasks(TASK_MD)[0])
            self.assertEqual(result.retry_count, 1)
            self.assertEqual(
                [item.stage_id for item in result.stage_results],
                ["plan", "implement", "review", "plan", "implement", "review"],
            )
    def test_task_preflight_fails_when_task_specific_test_file_is_missing(self) -> None:
@ -963,12 +1029,14 @@ Acceptance Criteria:
            (root / "fake_writer.py").write_text(
                "\n".join(
                    [
-                        "print('```file:story/chapters/scene.md')",
+                        "print('FILE: story/chapters/scene.md')",
                        "print('---CONTENT---')",
                        "print('scene prose')",
-                        "print('```')",
+                        "print('---END---')",
-                        "print('```file:story/plot-state.md')",
+                        "print('FILE: story/plot-state.md')",
                        "print('---CONTENT---')",
                        "print('state')",
-                        "print('```')",
+                        "print('---END---')",
                    ]
                ),
                encoding="utf-8",
--- a/tests/test_repo_tools.py
+++ b/tests/test_repo_tools.py
@ -82,6 +82,30 @@ lookup_requests:
            self.assertIn("src/app.py:1", matches)
            self.assertNotIn(".nightshift", matches)
    def test_repo_tools_skip_configured_path_parts(self) -> None:
        with tempfile.TemporaryDirectory() as directory:
            root = Path(directory)
            (root / "dist").mkdir()
            (root / "dist" / "bundle.js").write_text("needle\n", encoding="utf-8")
            (root / "src").mkdir()
            (root / "src" / "app.js").write_text("needle\n", encoding="utf-8")
            safety = SafetyConfig(
                require_clean_worktree=False,
                scoped_paths=(".",),
                allowed_commands=(),
                forbidden_commands=(),
                skip_repo_parts=("dist",),
            )
            tools = RepoTools(root, safety, ArtifactStore(root, ".nightshift", run_id="test-run"))
            files = tools.list_files(".")
            matches = tools.grep("needle", ".")
            self.assertIn("src/app.js", files)
            self.assertNotIn("dist/bundle.js", files)
            self.assertIn("src/app.js:1", matches)
            self.assertNotIn("dist/bundle.js", matches)
 if __name__ == "__main__":
    unittest.main()