gnoma

Author	SHA1	Message	Date
vikingowl	bf05a5866b	feat(openai): lexical repair for malformed tool-call arguments Local-model servers (Ollama, llama.cpp, llamafile) routed through the OpenAI-compatible path frequently emit tool-call arguments that are almost valid JSON — wrapped in markdown fences, padded with prose, or trailing a stray comma. Strict parsing fails, the engine receives empty args, and the agent loop has to retry or escalate. Adds repairArgs(raw) at the EventToolCallDone boundary: strict-parse first, then apply cheap lexical fixes (strip ```json fences, drop trailing commas before }/], extract the first balanced {...} block with proper string/escape awareness). On success, the repaired bytes flow through unchanged; on failure, the original is returned and downstream parsing surfaces the error as before. Frontier providers (OpenAI proper, Anthropic, Mistral, Google) are unaffected — their SDKs return structured args that pass strict parse. The repair only does work when the upstream output is malformed. 11 unit tests cover: valid passthrough, empty, trailing commas, single/double-line fences, prose-wrapped, braces-inside-strings, multiple top-level objects (takes the first), and unrepairable input. A stream-level test verifies the wiring through flushNextToolCall.	2026-05-19 17:59:05 +02:00
vikingowl	10e5015059	chore(lint): clear remaining errcheck and staticcheck findings Brings the project to a clean `make lint` baseline (0 issues). Mechanical: - Wrap deferred resp.Body.Close() in closures (router/discovery.go, router/probe.go) so the unchecked return surfaces as `_ = ...`. - Apply `_ = ...` (single or multi-return blank) to test-file calls that intentionally ignore errors: os.MkdirAll / os.WriteFile / os.Chdir in setup paths, Close / Shutdown in teardown, Submit / Spawn / Send / LoadDir in tests that assert on side effects. Structural: - engine.handleRequestTooLarge drops the unused req parameter and rebuilds the request from compacted history (SA4009 — argument was overwritten before first use). - provider.ClassifyHTTPStatus and google.applyCapabilityOverrides switch to tagged switches over the discriminator (QF1002). - tui.app.go MouseWheel + inputMode and cmd/gnoma main slm-status use tagged switches in place of equality chains (QF1003). - cmd/gnoma main.go merges a var decl with its immediate assignment (S1021). - Three empty-branch sites (dispatcher_test, loader_test, coordinator_test) become real assertions or get the dead `if` removed (SA9003).	2026-05-19 17:53:42 +02:00
vikingowl	b136a30a39	feat(engine): early-stop detection for runaway agent loops Adds three lightweight per-turn detectors that fire corrective user messages back into the conversation when the model goes off the rails: - RepetitionDetector: sliding-window scan over streamed text deltas; trips when a 50/80/120-char pattern repeats >= 3 times in the trailing 200 chars. Breaks the active stream and injects a correction. - PatchFailureTracker: per-path counter for fs.edit/fs.write failures; trips on the 4th consecutive failure and steers the model to fs.write rather than another fs.edit on the same path. Success decrements with a floor of 0; paths are isolated. - DetectGreeting: narrow allowlist for "how can I help" style replies; only consulted after a round that used tools, so first-turn greetings don't false-positive. Detector state is per-turn (declared locally in runLoop), single- goroutine use. Corrective messages are appended as user-role text to both engine history and the context window. Telemetry: each trigger logs at INFO with round + path where applicable. Covered by 12 unit tests for the primitives and 5 loop-level integration tests that drive the full agentic loop via the existing eventStream mock.	2026-05-19 17:39:35 +02:00
vikingowl	19e6a33862	chore(lint): clear dead code and tighten lifecycle errcheck Removes five unused funcs/vars/fields that golangci-lint had been flagging (anthropic.toolCallDoneEvent, mistral.translateMessages, hook.newError, subprocess.vibeParser.lastAssistantMsgID, tui.cBase), two ineffectual assignments (tui/rendering.go visible-window loop, subprocess stream_test setup), and a stale if/HasPrefix that's now a strings.TrimPrefix. Wires errcheck onto every subprocess / stream lifecycle path so a failed close or shutdown is at least logged rather than silently dropped: - engine/loop.go: stream.Close on both the error and success paths - mcp/manager.go: Shutdown when StartAll partial-fails; Transport close after Initialize failure - mcp/transport.go: stdin.Close + syscall.Kill on graceful-timeout fallback - slm/download.go: Close propagated as a named-return error on the success path; explicitly discarded on the rollback path - slm/classifier.go, slm/manager.go, hook/prompt.go, context/summarize.go, config/write.go, cmd/gnoma/main.go, tool/fs/grep.go: explicit ignores or error logging on Close / Shutdown / WalkDir / Scanln Production-code errcheck and ineffassign are now zero. Remaining golangci-lint output is test-only Close-in-defer noise plus stylistic staticcheck QF suggestions, left alone.	2026-05-19 17:05:54 +02:00
vikingowl	79ae77306a	chore(audit): polish remaining audit findings (M2, H1, H3) - M2: stop echoing the matched pattern name in the user-visible [BLOCKED: ...] message returned by the firewall. The pattern (and the matched secret class) still appear in the operator log, but the string sent back into the prompt is now generic. - H1: document Rule.Pattern semantics on the Rule type and pin them with a regression test. Pattern is a case-sensitive, exact substring match against the JSON-serialised tool arguments — not a glob, regex, or whitespace-insensitive match. The new test exercises both matches and the documented gotchas (double-space, case drift, tab). - H3: every code path in CommandExecutor.Execute that converts a hook failure into Allow via FailOpen now emits a WARN naming the hook and the failure mode (timeout / launch_error / parse_error), so chronic hook failure or abuse is visible in operator logs. Also tightens errcheck on permission/rule.go (Printer.Print on a strings.Builder cannot error in practice; make the intent explicit).	2026-05-19 17:05:39 +02:00
vikingowl	48dd111484	feat(plugin): trust-on-first-use manifest pinning Plugins are now verified against ~/.config/gnoma/plugins.pins.toml at load time. Each plugin's plugin.json bytes are hashed (SHA-256) and: - recorded automatically on first load (TOFU) with a prominent warning - compared on subsequent loads - refused with a clear error if the hash drifted, without overwriting the pin so the user can review and re-enrol deliberately Pin-store I/O failures degrade to load-without-pinning rather than locking the user out of previously-trusted plugins. Closes audit finding C2. See ADR-003 for the decision rationale and docs/plugins-trust.md for the end-user trust model.	2026-05-19 16:44:09 +02:00
vikingowl	acc25ca432	fix(hook): execute hook Exec as a binary, not via sh -c Plugin loader resolves HookSpec.Exec as a relative path joined to the plugin directory, and manifest.checkSafePath rejects absolute paths and '..' traversal — Exec was always meant to be an executable path. The hook executor was wrapping it in 'sh -c', adding a redundant shell interpretation step that turned any space, quote, or metacharacter in the path into command-injection surface. Switch to exec.Command(path) with no shell wrapping. Closes audit finding C3. Adds a regression test that fails under the old 'sh -c' code path: a canary file created via shell sequencing remains absent when the executor treats Exec as a literal filename. Hook command tests now write small /bin/sh scripts to t.TempDir and point Exec at those — matching production semantics (resolved binary path) rather than inline shell strings.	2026-05-19 16:30:23 +02:00
vikingowl	b35c690b4a	test(permission): lock in elf safety-pattern inheritance Audit finding H2 hypothesised that spawn_elfs/agent's safetyCheck exemption could be reached as a bypass route if the spawned elf failed to enforce the same patterns. Verified by inspection that: 1. WithDenyPrompt copies safetyDenyPatterns into the elf checker. 2. Check() runs safetyCheck (Step 2) before ModeBypass (Step 3), so bypass cannot skip safety. 3. main.go always passes the parent permChecker to the elf Manager. H2 is not exploitable in current code. This test pins the contract so future refactors of WithDenyPrompt cannot silently drop pattern inheritance.	2026-05-19 16:21:53 +02:00
vikingowl	a533dfff9f	test(elf): make mockProvider.calls atomic Race detector flagged concurrent access to mockProvider.calls during TestManager_SpawnAndList and TestManager_WaitAll, where multiple spawned engines share the same mock. Switch to atomic.Int64. Closes audit finding L1. `go test -race ./...` is now fully green.	2026-05-19 16:19:40 +02:00
vikingowl	b36ef564ab	fix(engine): guard mutable state with a mutex Engine.history, usage, activatedTools, modelCaps, turnOpts, and cfg.Provider/Model are now mutated and read under e.mu. The lock is released across blocking provider.Stream calls so external setters (SetProvider, SetHistory, InjectMessage, etc.) can interleave. History() now returns a copy. Snapshot helpers (latestUserPrompt, historySnapshot, snapshotTurnOpts, etc.) replace the unsynchronised reads scattered through runLoop and buildRequest. Closes audit finding H4. Adds a race regression test that fails under -race before the fix and passes after.	2026-05-19 16:18:17 +02:00
vikingowl	153a7e3cf9	feat(fs): enforce workspace boundary on fs tools Adds a Guard that resolves every path against an allowlist of absolute roots (default: cwd) and rejects anything escaping via relative segments, absolute paths outside the root, or symlinks (including symlinked parents on writes). Closes audit finding C1: fs.read/fs.write/fs.edit/fs.glob/fs.grep/fs.ls previously accepted any absolute path; the only protection was a substring denylist (.env, .ssh/, ...) which missed /etc/shadow, kube configs, IDE secrets, and anything reachable via symlink.	2026-05-19 16:07:29 +02:00
vikingowl	a96d3c490d	feat: various improvements to engine, router, and TUI - engine/loop: enhanced loop handling - router: dynamic model discovery and task improvements - tui: suggestion box, input mode indicator, completions enhancements Generated by Mistral Vibe. Co-Authored-By: Mistral Vibe <vibe@mistral.ai>	2026-05-07 22:51:50 +02:00
vikingowl	8ed06ec574	feat: add dynamic model discovery within providers - OpenAI provider: use Models.ListAutoPaging() to discover available models - Anthropic provider: use Models.ListAutoPaging() to discover available models - Google provider: use Models.All() iterator to discover available models - All providers fall back to hardcoded lists if API calls fail - Add capability inference functions for each provider based on model ID - Add tests for model discovery fallback behavior This enables gnoma to dynamically discover new models as they become available from cloud providers, while maintaining backward compatibility with fallback lists for offline use or API failures. Generated by Mistral Vibe. Co-Authored-By: Mistral Vibe <vibe@mistral.ai>	2026-05-07 22:27:24 +02:00
vikingowl	d3befd72a5	feat(tui): suggestion box above input, input mode indicator, ! execute - Suggestion dropdown now renders between separator and input (not in chat area) — no more box at the top of an empty chat - Ghost text suppressed when dropdown is visible (eliminates the 'fig' / trailing text on the right) - Bottom separator shows purple 'cmd' label when typing '/' and yellow 'exec' label when typing '!' - '! <cmd>' prefix executes a raw shell command inline and shows output in the chat (same as /shell but one-shot)	2026-05-07 17:35:45 +02:00
vikingowl	7159e64270	perf+feat: parallel startup discovery + slash-command suggestion dropdown Startup: HarvestAliases, HarvestInventory, DiscoverCLIAgents, and DiscoverLocalModels now run concurrently. Worst case latency drops from sum(all) to max(all) — eliminates the 15s inventory timeout from blocking the main path. TUI: typing '/co' now shows a bordered dropdown of all matching commands with descriptions. ↑↓ navigate, Tab/Enter accepts the highlighted entry, Esc dismisses. Ghost-text still works for unique unambiguous matches.	2026-05-07 17:30:16 +02:00
vikingowl	056500541f	feat(tui): /config opens interactive settings panel Replaces the text dump with a navigable bordered overlay. ↑↓ to move, Enter to cycle/toggle values, Esc to close. Shows: Model (cycles through discovered arms), Permission mode, Incognito toggle.	2026-05-07 17:23:43 +02:00
vikingowl	d3fdfe30b9	feat(cli): add 'gnoma providers' subcommand Lists configured provider, auto-discovered CLI agents (claude/gemini/vibe), running local models (ollama/llamacpp), and SLM status in one shot.	2026-05-07 17:15:46 +02:00
vikingowl	0220f9e2cc	fix(slm): start llamafile in background; use lazyClassifier Blocking Start() call (up to 15s) no longer delays TUI startup. lazyClassifier falls back to heuristic until llamafile is healthy, then atomically swaps in the SLM classifier.	2026-05-07 17:13:56 +02:00
vikingowl	917cbd07f7	fix(slm): skip re-download when already set up Setup() now returns early if Status() == StatusReady. CLI also prints the existing path/size instead of starting a download.	2026-05-07 17:10:16 +02:00
vikingowl	035b63ea83	fix(slm): invoke llamafile via sh to bypass Wine binfmt_misc APE polyglot binaries start with MZ magic bytes which Wine's binfmt_misc rule intercepts on Linux. llamafile is also a valid POSIX shell script; running it via 'sh' bypasses the kernel's binfmt_misc lookup entirely.	2026-05-07 17:08:52 +02:00
vikingowl	f19f666e0d	fix: provider-agnostic startup + slm setup auto-config Remove the hardcoded mistral default so gnoma starts without any provider configured. TUI mode uses a stubProvider that lets CLI agent arms (claude, gemini, etc.) handle routing; pipe mode prints a clear setup message. Also: gnoma slm setup now auto-writes the default model_url to the global config when none is set, instead of erroring.	2026-05-07 17:05:06 +02:00
vikingowl	e7dddd201b	fix(cli): three UX issues — help output, TUI startup, setup command - Custom flag.Usage: shows subcommands and usage patterns; -h is no longer useless - system flag default is now '' (applies built-in at runtime); flag help no longer spews the entire system prompt - API key check skips hard-exit in TUI mode; TUI starts and surfaces auth errors inline on first request instead of blocking at launch - gnoma slm setup: progress shows speed (bytes/s), no hardcoded model URL in error message, points to llamafile releases page instead	2026-05-07 16:53:57 +02:00
vikingowl	0b52a68703	feat(slm): Wave C — SLM classifier, MaxComplexity routing, CLI subcommands, TUI status - slm.Classifier: openaicompat → llamafile, 2s timeout + heuristic fallback, heuristic baseline blended so Priority/RequiredEffort are never zeroed, extractJSON strips markdown fences from small-model responses - router.ParseTaskType: case-insensitive string → TaskType, unknown → TaskGeneration - router.Arm.MaxComplexity: zero = no ceiling (preserves existing arm behavior); filterFeasible excludes arms when task.ComplexityScore > MaxComplexity - config.SLMSection: [slm] enabled / model_url / data_dir - openaicompat.NewLlamafile: no API key, model = "default", no retries - slm.Manager: DefaultDataDir() (XDG), Manifest() accessor - cmd/gnoma: `gnoma slm setup` / `gnoma slm status` subcommands; SLM arm registered with MaxComplexity=0.3 when enabled + set up - tui: /config shows slm status (ready/missing/not set up + base URL if running) - docs: roadmap updated to reflect llamafile pivot from Ollama	2026-05-07 16:44:32 +02:00
vikingowl	bd43d02713	feat(slm): Wave B — Manager, Manifest, download, subprocess lifecycle - Manifest: JSON read/write with atomic rename; presence = ready invariant - download: HTTP fetch with SHA256 computation, progress callback, cleanup on failure - Manager: Status (NotSetUp/Ready/Missing), Setup (download + manifest write), Start (freePort, exec, PID file, health check), Stop, BaseURL - waitHealthy: polls /health with 15s ceiling and context cancellation - reapStalePID: kills stale process from previous run on next Start - 28 tests; all pass	2026-05-07 16:23:46 +02:00
vikingowl	e24154e543	feat(classifier): Wave A — TaskClassifier interface + HeuristicClassifier - internal/router/classifier.go: TaskClassifier interface with Classify(ctx, prompt, history) signature. HeuristicClassifier wraps the existing ClassifyTask() with zero behavior change. - engine.Config.Classifier: injectable TaskClassifier; nil defaults to HeuristicClassifier. Engine.classify() helper handles nil + error fallback transparently. - loop.go: all four router.ClassifyTask() call sites replaced with e.classify(ctx, prompt). SLMClassifier slots in without further changes to the engine.	2026-05-07 16:11:20 +02:00
vikingowl	9b2ab40115	feat(pty): Phase 2 — interactive shell and bash interactive detection - /shell [cmd]: launch user's $SHELL via tea.ExecProcess (PTY handoff) hands terminal to the shell and restores TUI on exit. /shell <cmd> runs that command in the shell directly. Detects $SHELL > $COMSPEC > /bin/sh\|powershell.exe in order. - bash tool: detect interactive commands before execution Prefix-interactive: sudo, ssh, passwd, vim/vi/nano, less/more, htop/top, mysql/psql, ftp/sftp, git push. Exact-interactive (REPL): python3/python/node/irb/iex/ghci/julia. Returns a tool result with interactive=true metadata and a hint to use /shell instead of hanging or erroring. - completions: add /shell to builtin command list - help: document /shell [cmd]	2026-05-07 15:52:56 +02:00
vikingowl	995b08dc0f	feat(engine): M8 cleanup — Wave B skill enforcement - Add tool.PathSensitiveTool interface (ExtractPaths); implement on all 6 fs tools - Add engine.TurnOptions.AllowedPaths: restricts tool filesystem access per skill invocation - Bash is denied outright when AllowedPaths is active (unparseable command args) - fs tools with empty path (cwd default) resolved via os.Getwd() and validated - Add engine.TurnOptions.AllowedTools + AllowedPaths wiring in pipe mode (main.go) and TUI skill dispatch (tui/app.go) - Remove TODO(M8.3) from skill.Frontmatter — enforcement is now complete	2026-05-07 15:29:33 +02:00
vikingowl	fc465e5f29	feat(engine): M8 cleanup — Wave A wiring gaps - Remove stale TODO(P0c) comment from main.go (resolved by P0c tier routing) - Wire config.Provider.Temperature → engine.Config.Temperature → provider.Request - Add WithMaxFileSize option to fs.write; wire cfg.Tools.MaxFileSize in main.go - Wire router.ReportOutcome after each runLoop return (success = err == nil) - Fix nil-callback guard on EventRouting dispatch (pre-existing bug exposed by new test)	2026-05-07 15:22:22 +02:00
vikingowl	0c65eeda08	docs: consolidated roadmap, ADR-013, drop stale plans - New 7-phase roadmap (2026-05-07-gnoma-roadmap.md) covering M8 cleanup, PTY interactive shell, SLM classifier, router revisit, USP security, ELF support, and distribution - ADR-013 (002-slm-routing.md): SLM-first routing supersedes ADR-009; Thompson Sampling deferred pending SLM production data - ADR-009 status updated to "Superseded by ADR-013" - gemma-integration-analysis.md: header note that Node.js specifics (LiteRT-LM, daemon, PID) don't apply to gnoma's Go implementation - TODO.md replaced with thin pointer to roadmap + stable backlog - Deleted stale plan/spec files: m6-m7-closeout, m8-hooks-design	2026-05-07 15:06:54 +02:00
vikingowl	76988453ab	docs: note routing revisit after SLM integration	2026-05-07 14:41:37 +02:00
vikingowl	640860404a	feat(router): tier-based routing — CLI > local > API, disabled arms Adds explicit tier preference to arm selection so the router deterministically prefers lower-cost arms before falling back: tier 0: CLI agents (IsCLIAgent=true, subprocess/claude\|gemini\|vibe) tier 1: local models (IsLocal=true, ollama/llamacpp) tier 2: API providers (everything else) Within a tier, quality/cost scoring still applies. filterFeasible still gates on quality thresholds, so a low-quality local arm won't beat a high-quality API arm when the task's minimum threshold rules it out. Also adds Arm.Disabled: arms with Disabled=true are excluded from auto-routing but remain selectable via ForceArm. Implementation: armTier helper + selectBest refactored to try tiers in order, bestScored picks within a tier. router.Select skips disabled arms in allArms collection (forced arm bypasses disable check).	2026-05-07 14:36:36 +02:00
vikingowl	f213d8f9ce	feat(provider): subprocess CLI provider for claude, gemini, vibe Adds internal/provider/subprocess — a provider.Provider that spawns CLI agents (claude, gemini, vibe) as subprocesses and streams their output. - FormatParser interface + three parsers for claude-stream-json, gemini-stream-json, and vibe-streaming formats; fixtures captured from real binaries - subprocessStream: pull-based stream.Stream over subprocess stdout with bounded stderr capture (8KB) and guarded reap() to prevent double-Wait - DiscoverCLIAgents: parallel PATH scan with 10s timeout, stable ordering - Provider: only the last user message is passed as --prompt; all other request fields (history, tools, system prompt) are intentionally ignored (see package doc) - main.go: discover and register CLI arms at startup; TODO(P0c) for tier-based routing to enforce preference order explicitly	2026-05-07 14:29:34 +02:00
vikingowl	f9b8c1886b	feat(router): normalize effort/thinking abstraction across providers Add EffortLevel (auto/low/medium/high) as a provider-agnostic reasoning control, replacing the Capabilities.Thinking bool. Each provider maps the level to its native parameter: Anthropic budget tokens (1K/8K/16K), OpenAI reasoning_effort (low/medium/high), Google thinking budget (1K/8K/16K). Task classification auto-infers effort from TaskType and complexity; filterFeasible excludes arms that lack the required level.	2026-05-07 14:08:50 +02:00
vikingowl	86b5169bff	docs: update TODO with Native SLM Runtime integration - Replace Gemma Integration with expanded SLM Preflight Engine section - Add Deep Intent Routing (Skill Decomposer, Context Flattener, HITL toggle) - Add Security & Iron Law Integration (USP Pre-Audit, Hallucination Gate) - Include Recommended Tiny Stack table (Gemma 3 270M, ollama/llm, Q4_K_M GGUF) - Document the Integrated Flow for local vs frontier routing Generated by Mistral Vibe. Co-Authored-By: Mistral Vibe <vibe@mistral.ai>	2026-05-07 11:36:00 +02:00
vikingowl	e6489b31a9	docs: add TODO roadmap for gemma routing, USP integration, local tmp, and ELF support	2026-05-07 00:21:52 +02:00
vikingowl	3873f90f83	feat: local model reliability — SDK retries, capability probing, init skill, context compaction Three compounding bugs prevented tool calling with llama.cpp: - Stream parser set argsComplete on partial JSON (e.g. "{"), dropping subsequent argument deltas — fix: use json.Valid to detect completeness - Missing tool_choice default — llama.cpp needs explicit "auto" to activate its GBNF grammar constraint; now set when tools are present - Tool names in history used internal format (fs.ls) while definitions used API format (fs_ls) — now re-sanitized in translateMessage Additional changes: - Disable SDK retries for local providers (500s are deterministic) - Dynamic capability probing via /props (llama.cpp) and /api/show (Ollama), replacing hardcoded model prefix list - Engine respects forced arm ToolUse capability when router is active - Bundled /init skill with Go template blocks, context-aware for local vs cloud models, deduplication rules against CLAUDE.md - Tool result compaction for local models — previous round results replaced with size markers to stay within small context windows - Text-only fallback when tool-parse errors occur on local models - "text-only" TUI indicator when model lacks tool support - Session ResetError for retry after stream failures - AllowedTools per-turn filtering in engine buildRequest	2026-04-13 02:01:01 +02:00
vikingowl	99529e6156	fix: deterministic 500 retry, OpenAI error wrapping, local /init prompt Stop retrying llama.cpp 500s that are deterministic tool-parse failures by inspecting the error message body (ClassifyHTTPError). Wrap OpenAI SDK errors as ProviderError so the engine's retry logic classifies them. Add localInitPrompt for local models that uses sequential fs_* calls instead of spawn_elfs (which local models can't produce reliably).	2026-04-12 18:35:18 +02:00
vikingowl	0adf118675	fix(router): discovery loop removes forced arm, breaking routing The discovery loop's reconcileArms removed the CLI-forced arm (llamacpp/default) because the llama.cpp server reports the real model name (e.g. gemma-26b), creating a mismatch. After 30s the forced arm disappeared and all subsequent requests failed. Three-layer fix: - Eager: query the specific provider at startup to resolve the real model name before registering the forced arm - Lazy: reconcileArms detects placeholder "default" arm names and atomically renames them when discovery reveals the real identity, with an onReconcile callback to update the session and TUI - Guard: the forced arm is never garbage-collected by the removal loop Also fixes misleading /init error messaging — failed inits now show "loaded from disk (init failed)" instead of "AGENTS.md written to".	2026-04-12 17:51:30 +02:00
vikingowl	88e6bdb2a4	feat(tui): Tier 3-4 UX improvements — split, routing, session naming, context bar - Split app.go (2091→1378 lines) into rendering.go, events.go, init.go - Add EventRouting stream event for router arm transparency - Add session auto-naming from first user message - Add context window progress bar in status bar - Add /keys cheatsheet, /replay for resumed sessions - Add inline cost-per-turn after assistant responses - Add diff previews in fs.write/fs.edit permission prompts - Collapse tool output to 3 lines by default (ctrl+o expands) - Use AddPrefix for system context instead of InjectMessage - Handle ContentThinking and ContentToolResult in session resume - Show session title in resume picker - Add /model numeric selection snapshot safety	2026-04-12 05:13:16 +02:00
vikingowl	0b1f8cb5ec	feat(tui): Tier 1-2 UX improvements — completions, usage, provider status Tier 1 (launch blockers): - Remove /shell from /help (advertised but unimplemented) - Kill dead _ = closeLen assignment - Cache glamour renderer by width — no longer recreated on every WindowSizeMsg when width hasn't changed Tier 2 (ship-quality UX): - Slash command ghost-text completion with Tab accept. Sources: static command list + dynamic skill names. /permission gets arg completion for the 6 modes. - /compact reports before/after token counts (e.g. "32k → 18k tokens") - /provider shows all registered arms grouped by provider, not just "restart required" - /usage command: input/output/total tokens, context %, provider, turns - Widen Ctrl+C quit window from 1s to 2s - "new content below" indicator when scrolled up during streaming - Permission prompt: inline chat notification when approval needed, so the user notices even if focused on input	2026-04-12 04:19:55 +02:00
vikingowl	e5a1d21f53	fix: append mutation, pipe-mode hang, Mistral regex false positives - Fix append footgun: allHooks/allMCPServers allocated fresh to avoid mutating cfg's backing array (lines 391/413 in main.go) - Fix pipe-mode permission prompt: detect no-TTY stdin and auto-deny instead of blocking forever on fmt.Scanln EOF - Tighten Mistral API key regex from bare [a-zA-Z0-9]{32} (matched commit hashes, UUIDs) to context-gated pattern requiring "mistral" keyword nearby. Added scanner test for positives and negatives. - Remove README demo GIF TODO placeholder - Unify version string: pass buildVersion from ldflags into tui.Config instead of hardcoding "v0.1.0-dev" - Populate benchmarks doc with actual Go benchmark results	2026-04-12 03:49:47 +02:00
vikingowl	d7b524664d	fix(m8): replace_default map, error UX, benchmarks, and launch prep - Fix replace_default positional bug: []string → map[string]string for explicit MCP tool → built-in name mapping - Improve error messages for missing API keys (3 actionable options) and unknown providers (early validation with available list) - Remove python3 dependency from MCP tests (pure bash grep/sed parsing) - Add router benchmark scaffold (6 benchmarks in bench_test.go + docs) - Add .goreleaser.yml for cross-platform binary releases with ldflags - Add launch-ready README with quickstart, extensibility docs, GIF placeholder - Add CONTRIBUTING.md and Gitea issue templates (bug report, feature request)	2026-04-12 03:34:58 +02:00
vikingowl	d2d79d65da	feat(m8): MCP client, tool replaceability, and plugin system Complete the remaining M8 extensibility deliverables: - MCP client with JSON-RPC 2.0 over stdio transport, protocol lifecycle (initialize/tools-list/tools-call), and process group management for clean shutdown - MCP tool adapter implementing tool.Tool with mcp__{server}__{tool} naming convention and replace_default for swapping built-in tools - MCP manager for multi-server orchestration with parallel startup, tool discovery, and registry integration - Plugin system with plugin.json manifest (name/version/capabilities), directory-based discovery (global + project scopes with precedence), loader that merges skills/hooks/MCP configs into existing registries, and install/uninstall/list lifecycle manager - Config additions: MCPServerConfig, PluginsSection with opt-in/opt-out enabled/disabled resolution - TUI /plugins command for listing installed plugins - 54 tests across internal/mcp and internal/plugin packages	2026-04-12 03:09:05 +02:00
vikingowl	d51f76aee0	docs: mark M8.2 skill system deliverables complete in milestones.md	2026-04-07 02:25:29 +02:00
vikingowl	4a37e84114	feat(skill): enhanced coordinator prompt with fan-out and concurrency guidance	2026-04-07 02:24:49 +02:00
vikingowl	338d4a12b8	feat(skill): pipe mode support and main.go wiring	2026-04-07 02:19:42 +02:00
vikingowl	71b0cf9490	feat(skill): TUI integration — /skillname invokes skills, /skills lists them	2026-04-07 02:18:12 +02:00
vikingowl	995b26ffe7	feat(skill): registry with multi-directory loading and precedence	2026-04-07 02:17:17 +02:00
vikingowl	42fc2adcd8	feat(skill): bundled /batch skill with go:embed	2026-04-07 02:16:35 +02:00
vikingowl	327e4d74c0	feat(skill): template rendering with Go text/template	2026-04-07 02:15:51 +02:00

1 2 3

148 Commits