mirror of https://github.com/khodges42/glassMind.git synced 2026-06-14 18:18:36 +00:00

K. Hodges fc9b2efd0b Initialize Glassmind Rust CLI and markdown vault scanner

2026-05-24 02:18:08 -07:00

14 KiB

Raw Blame History

Glassmind Design Doc

One-line Summary

Glassmind is a local-first RAG and memory API for Obsidian vaults. It indexes markdown notes, performs semantic and metadata-aware search, and exposes useful context to external agents such as Claude, Codex, Hermes, Nightshift, or local models.

Non-goals

Glassmind is not:

an autonomous agent
a replacement for Obsidian
a cloud memory service
a chatbot
a startup-brain-theft machine
a magic AI filesystem

Glassmind is the memory layer.

Agents call Glassmind when they need context.

1. Core Philosophy

Source of Truth

Obsidian markdown files are canonical.

The database is rebuildable.

Obsidian vault = truth
SQLite DB = index/cache
Vector index = semantic search cache
.agent/ = Glassmind-owned workspace

If the database is deleted, Glassmind should be able to rebuild it from the vault.

Ownership Boundary

User notes belong to the user.

Glassmind may freely write to:

.agent/

Editing normal vault files should be optional and policy-controlled.

2. High-Level Architecture

Obsidian Vault
  ↓
Glassmind Indexer
  ↓
SQLite Metadata Store
  ↓
Vector Search Layer
  ↓
CLI / HTTP API / MCP Server
  ↓
Claude / Codex / Hermes / Nightshift / Local Models

Components

Indexer

Walks the Obsidian vault and reads markdown files.

It extracts:

file path
title
headings
sections
tags
wikilinks
frontmatter
modified time
content hash
chunks for retrieval

SQLite Store

Stores structured metadata.

Examples:

notes
chunks
tags
links
headings
frontmatter
index_runs

Vector Store

Stores embeddings for chunks.

For v1:

sqlite-vec

Later options:

Qdrant
LanceDB
Chroma

Embedding Backend

Embeddings convert text into “meaning coordinates.”

Example:

"local agent memory"

becomes a vector like:

[0.12, -0.44, 0.89, ...]

Glassmind compares vectors to find semantically related chunks.

Recommended v1:

Ollama embeddings

Later:

fastembed-rs

API Layer

Glassmind exposes:

CLI commands
HTTP API
MCP tools

MCP means “Model Context Protocol.” It is a standard-ish way for AI tools to call external tools.

3. Vault Layout

Glassmind should work with any Obsidian structure.

It may optionally create:

vault/
  .agent/
    memories/
    summaries/
    tasks/
    decisions/
    logs/
    cache/

`.agent/` Purpose

This is the safe agent-owned area.

It can contain:

generated project summaries
captured memories
task state
context snapshots
audit logs
retrieval reports

Normal user notes are indexed, but not modified by default.

4. Core Concepts

RAG

RAG means “Retrieval-Augmented Generation.”

Instead of shoving the entire vault into an LLM prompt, Glassmind retrieves only relevant notes and chunks.

query → retrieve relevant context → give context to LLM

Embeddings

Embeddings are semantic fingerprints.

They let Glassmind find notes by meaning, not just exact words.

Useful for finding:

"that thing I wrote about local memory"

even if the note says:

"persistent semantic cache for agents"

Chunk

A chunk is a small section of a note.

Usually:

heading section
paragraph group
checklist block
code block
fixed-size fallback slice

Glassmind searches chunks, not entire notes.

Hot / Warm / Cold Memory

Glassmind does not forget by default.

Instead, it ranks memory temperature.

Hot:
  recent notes
  active projects
  pinned context
  recently retrieved chunks

Warm:
  related notes
  linked notes
  older project notes

Cold:
  everything else

Hot memory is more likely to appear in context bundles.

Context Bundle

A context bundle is an LLM-ready packet.

Example:

User query:
  "help me continue Glassmind"

Context bundle:
  - top matching chunks
  - related project notes
  - recent decisions
  - open tasks
  - relevant links
  - source paths

This is probably the most important Glassmind feature.

5. API Design

CLI

glassmind init
glassmind index
glassmind search "local memory tool"
glassmind context "help me continue designing Glassmind"
glassmind read "Projects/Glassmind.md"
glassmind serve

HTTP API

POST /search
POST /context
GET  /notes/{id}
POST /index
GET  /health

MCP Tools

glassmind_search
glassmind_context
glassmind_read
glassmind_hot_context

6. First-Class Commands

`search`

Returns matching chunks.

glassmind search "obsidian rag memory"

Output:

1. Projects/Glassmind.md#Architecture
2. Daily/2026-05-24.md#Local Agents
3. Software Projects/Memory Tool.md#Embeddings

`context`

Returns an LLM-ready context bundle.

glassmind context "what was I thinking about local agent memory?"

This should include:

summarized answer
relevant chunks
source paths
confidence/ranking
suggested follow-up reads

`index`

Builds or updates the index.

glassmind index

Should support incremental indexing.

`serve`

Runs local API server.

glassmind serve

Default:

localhost only

No public network exposure by default.

7. Retrieval Strategy

Do not rely on only one method.

Glassmind should use hybrid scoring.

final_score =
  semantic similarity
+ keyword match
+ tag match
+ path/project boost
+ wikilink proximity
+ recency boost
+ hot memory boost

Why Hybrid Search?

Embeddings are good, but not perfect.

Keyword search is good, but brittle.

Tags are useful, but manually inconsistent.

Graph links are meaningful, but incomplete.

Hybrid search makes the system feel smarter.

8. Data Model Draft

notes

id
path
title
mtime
content_hash
created_at
updated_at

chunks

id
note_id
heading_path
content
chunk_type
start_line
end_line
token_estimate
content_hash

note_tags

note_id
tag_id

links

id
source_note_id
target
link_type

embeddings

chunk_id
model
vector
created_at

memory_events

id
event_type
source
content
created_at

9. Write Policy

Glassmind should support write policies.

Recommended Defaults

[writes]
agent_dir = true
user_notes = "propose"

Modes:

off:
  no writes

agent-only:
  writes only to .agent/

propose:
  produce diffs for user approval

allow:
  direct edits to user notes

For your personal setup, you might allow more.

For a sane default, use:

agent-only + proposed diffs

10. Implementation Stages

Stage 0 — Skeleton

Goal: project exists and runs.

Tasks:

create Rust project
add config file support
add CLI parser
define vault path
add logging
add glassmind init
create .agent/ directory

Suggested crates:

clap
serde
toml
tracing
anyhow

Stage 1 — Vault Indexer

Goal: read markdown vault and store metadata.

Tasks:

recursively walk vault
ignore .obsidian/, .git/, .agent/cache/
read .md files
calculate content hash
extract title
extract tags
extract wikilinks
extract headings
store note records in SQLite

Suggested crates:

walkdir
rusqlite
sha2
pulldown-cmark
gray_matter
regex

Success test:

glassmind index
glassmind stats

Shows:

Notes indexed: 1,247
Chunks indexed: 4,912
Tags indexed: 340
Links indexed: 2,103

Stage 2 — Chunking

Goal: split notes into useful retrieval units.

Tasks:

split by markdown headings
preserve heading path
preserve line numbers
fallback to size-based chunks for long sections
store chunks in SQLite

Chunk types:

heading_section
paragraph
task_block
code_block
frontmatter

Success test:

glassmind read-chunks "Projects/Glassmind.md"

Stage 3 — Keyword Search

Goal: useful search before embeddings.

Tasks:

add SQLite FTS5 table
index chunk text
implement glassmind search
return path, heading, snippet, score

FTS means “full-text search.”

It is SQLite’s built-in text search engine.

Success test:

glassmind search "obsidian memory"

Stage 4 — Embeddings

Goal: semantic search.

Tasks:

add embedding backend trait
implement Ollama embedding backend
store vectors in sqlite-vec
embed chunks
embed queries
return nearest chunks

Backend trait:

trait EmbeddingBackend {
    fn embed(&self, text: &str) -> Result<Vec<f32>>;
}

Config:

[embeddings]
backend = "ollama"
model = "nomic-embed-text"

Success test:

glassmind search "thing about my local second brain"

Finds notes that do not literally say those words.

Stage 5 — Hybrid Ranking

Goal: search feels good.

Tasks:

combine semantic score
combine keyword score
boost recent notes
boost matching tags
boost active project paths
boost wikilink neighbors
show score breakdown in debug mode

Example debug output:

score: 0.87
semantic: 0.52
keyword: 0.18
recency: 0.07
tag: 0.05
link: 0.05

Stage 6 — Context Bundles

Goal: make output useful for agents.

Tasks:

implement glassmind context
deduplicate chunks from same note
group by note
summarize or trim context
include source paths
respect token budget
output markdown and JSON

Example:

glassmind context "continue designing Glassmind" --budget 6000

Output sections:

Relevant Notes
Recent Decisions
Open Questions
Suggested Context
Sources

Stage 7 — HTTP Server

Goal: let tools call Glassmind.

Tasks:

add local HTTP server
implement /search
implement /context
implement /notes/{id}
implement /health
bind to localhost by default

Suggested crate:

axum

Success test:

curl localhost:7331/health

Stage 8 — MCP Server

Goal: Claude/Codex/Hermes can call Glassmind.

Tasks:

expose MCP tools
implement search tool
implement context tool
implement read tool
document setup

Tools:

glassmind_search
glassmind_context
glassmind_read

Stage 9 — Agent-Owned Memory

Goal: allow safe writes to .agent/.

Tasks:

capture-memory
capture-decision
capture-task
append markdown files in .agent/
index .agent/ files
keep audit log

Example:

glassmind capture decision \
  --project Glassmind \
  --text "Obsidian markdown is canonical; SQLite is rebuildable cache."

Stage 10 — Polish / Real Use

Goal: make it worth using daily.

Tasks:

config docs
better errors
incremental indexing
watch mode
pretty CLI output
JSON output
MCP examples
benchmark indexing
backup/rebuild story
test vault fixture

11. MVP Definition

The MVP is successful when this works:

glassmind index
glassmind search "what was I thinking about Obsidian RAG?"
glassmind context "help me continue the Glassmind project"
glassmind serve

And an external agent can call:

glassmind_context

to get useful context from the vault.

12. Risks

Risk: Overengineering

Mitigation:

CLI first
search first
no autonomous agent
no graph DB in v1

Risk: Bad Retrieval

Mitigation:

hybrid search
score debugging
manual eval queries
source visibility

Risk: Vault Corruption

Mitigation:

user notes read-only by default
.agent/ writes only
proposed diffs for user files
audit log

Risk: Slow Indexing

Mitigation:

content hashes
incremental updates
skip unchanged files

Risk: AI Slop in Vault

Mitigation:

agent output goes to .agent/inbox/
user approval before promoting content
clear generated-content markers

13. Opinionated v1 Tech Stack

Language:
  Rust

CLI:
  clap

HTTP:
  axum

Database:
  SQLite

SQL access:
  rusqlite or sqlx

Vector search:
  sqlite-vec

Markdown parsing:
  pulldown-cmark

Frontmatter:
  gray_matter

Embeddings:
  Ollama first

Logging:
  tracing

Config:
  glassmind.toml

14. Example Config

[vault]
path = "C:/Users/kass/Documents/ObsidianVault"

[index]
include_agent_dir = true
ignore_dirs = [".git", ".obsidian", ".trash"]
chunk_target_tokens = 500
chunk_overlap_tokens = 80

[embeddings]
backend = "ollama"
model = "nomic-embed-text"
url = "http://localhost:11434"

[search]
semantic_weight = 0.55
keyword_weight = 0.25
recency_weight = 0.10
link_weight = 0.05
tag_weight = 0.05

[writes]
mode = "agent-only"
agent_dir = ".agent"

[server]
host = "127.0.0.1"
port = 7331

15. Glossary

Agent

An AI-driven system that can use tools.

Examples:

Claude Code
Codex
Hermes
Nightshift
Kiro

Glassmind is not the agent.

Glassmind is the tool the agent calls.

RAG

Retrieval-Augmented Generation.

A system retrieves relevant context before the LLM answers.

Embedding

A vector representation of text meaning.

Used for semantic search.

Vector

A list of numbers representing meaning.

Vector Search

Finding vectors near another vector.

This finds semantically similar text.

Chunk

A smaller piece of a note used for retrieval.

FTS

Full-text search.

Keyword search built into SQLite.

MCP

Model Context Protocol.

A way for AI tools to call external tools.

Canonical Source

The real source of truth.

For Glassmind, this is the Obsidian markdown vault.

Cache

Rebuildable derived data.

The SQLite database is a cache/index, not the truth.

Hot Memory

Recently or frequently useful context.

Cold Memory

Old context that is still searchable but not automatically included.

16. North Star

Glassmind succeeds if an external agent can ask:

“What context from my Obsidian vault matters for this?”

And Glassmind returns something good enough that the agent feels like it actually remembers your projects.

Not fake memory.

Not chatbot vibes.

Actual local context retrieval over your own notes.

14 KiB Raw Blame History Unescape Escape

Glassmind Design Doc

One-line Summary

Non-goals

1. Core Philosophy

Source of Truth

Ownership Boundary

2. High-Level Architecture

Components

Indexer

SQLite Store

Vector Store

Embedding Backend

API Layer

3. Vault Layout

.agent/ Purpose

4. Core Concepts

RAG

Embeddings

Chunk

Hot / Warm / Cold Memory

Context Bundle

5. API Design

CLI

HTTP API

MCP Tools

6. First-Class Commands

search

context

index

serve

7. Retrieval Strategy

Why Hybrid Search?

8. Data Model Draft

notes

chunks

tags

note_tags

links

embeddings

memory_events

9. Write Policy

Recommended Defaults

10. Implementation Stages

Stage 0 — Skeleton

Stage 1 — Vault Indexer

Stage 2 — Chunking

Stage 3 — Keyword Search

Stage 4 — Embeddings

Stage 5 — Hybrid Ranking

Stage 6 — Context Bundles

Stage 7 — HTTP Server

Stage 8 — MCP Server

Stage 9 — Agent-Owned Memory

Stage 10 — Polish / Real Use

11. MVP Definition

12. Risks

Risk: Overengineering

Risk: Bad Retrieval

Risk: Vault Corruption

Risk: Slow Indexing

Risk: AI Slop in Vault

13. Opinionated v1 Tech Stack

14. Example Config

15. Glossary

Agent

RAG

Embedding

Vector

Vector Search

Chunk

FTS

MCP

Canonical Source

Cache

Hot Memory

Cold Memory

16. North Star

14 KiB

Raw Blame History

`.agent/` Purpose

`search`

`context`

`index`

`serve`