T

vikingowl eb0583f606 fix(router): unpin config-default provider + complexity floor by task type

Two routing bugs were keeping the SLM out of every real prompt and,
once it was eligible, pulling complex tasks into it as well.

Bug 1: ForceArm was called unconditionally when a primary provider was
configured (cmd/gnoma/main.go:378). That short-circuited the entire
router — every prompt went straight to whatever was set as
[provider].default, regardless of tier, score, or feasibility. The SLM
arm appeared in `gnoma router stats` registration logs but had zero
observations after dozens of prompts.

Fix: only pin when the user passed --provider on the command line.
Config defaults register the arm but don't force it; the router picks
freely. Verified end-to-end — trivial prompts now reach slm/ollama
via the tier-0 priority.

Bug 2: A short prompt like "refactor the SLM module" classifies as
TaskRefactor with complexity 0.015 — well under the SLM arm's 0.3
ceiling. The arm became eligible despite the task being inherently
non-trivial. Once eligible, tier-0 priority then pulled it in over
the CLI agents.

Fix: add MinComplexityForType, applied in both ClassifyTask
(heuristic path) and slm.Classifier.Classify (SLM-overlay path). The
floor is per-task-type:

  - TaskSecurityReview, TaskOrchestration  → 0.60
  - TaskRefactor, TaskPlanning, TaskDebug  → 0.40
  - TaskUnitTest, TaskReview               → 0.35

Tasks like Explain/Generation/Boilerplate keep their organic
complexity score so trivial knowledge prompts (≤0.15) still fall to
the SLM. Tasks that imply existing code or multi-step reasoning are
clamped above the SLM's MaxComplexity, naturally routing them to a
bigger arm.

After both fixes, observed routing in a clean run:

  What is 2+2?              → slm/ollama (complexity 0.015)
  Define a closure          → slm/ollama (complexity 0.015)
  What is HTTP?             → slm/ollama (complexity 0.015)
  Refactor the SLM module   → subprocess/gemini (complexity 0.40)
  Audit for race conditions → subprocess/gemini (complexity 0.35)
  Plan a migration          → subprocess/gemini (complexity 0.40)

2026-05-19 19:22:16 +02:00

.gitea/issue_template

fix(m8): replace_default map, error UX, benchmarks, and launch prep

2026-04-12 03:34:58 +02:00

cmd/gnoma

fix(router): unpin config-default provider + complexity floor by task type

2026-05-19 19:22:16 +02:00

docs

feat(slm): pluggable backends + trivial-prompt routing

2026-05-19 18:53:32 +02:00

internal

fix(router): unpin config-default provider + complexity floor by task type

2026-05-19 19:22:16 +02:00

.env.example

feat: Ollama/gemma4 compat — /init flow, stream filter, safety fixes

2026-04-05 19:24:51 +02:00

.gitignore

chore: ignore .claude/ tool state directory

2026-05-19 19:06:58 +02:00

.goreleaser.yml

fix(m8): replace_default map, error UX, benchmarks, and launch prep

2026-04-12 03:34:58 +02:00

AGENTS.md

feat: local model reliability — SDK retries, capability probing, init skill, context compaction

2026-04-13 02:01:01 +02:00

CLAUDE.md

refactor: migrate mistral sdk to github.com/VikingOwl91/mistral-go-sdk

2026-04-03 12:06:59 +02:00

CONTRIBUTING.md

fix(m8): replace_default map, error UX, benchmarks, and launch prep

2026-04-12 03:34:58 +02:00

gemma-integration-analysis.md

docs: consolidated roadmap, ADR-013, drop stale plans

2026-05-07 15:06:54 +02:00

go.mod

feat(skill): core Skill type and YAML frontmatter parser

2026-04-07 02:05:49 +02:00

go.sum

feat(skill): core Skill type and YAML frontmatter parser

2026-04-07 02:05:49 +02:00

Makefile

feat: rate limit pools, elf tree view, permission prompts, dep updates

2026-04-03 20:54:48 +02:00

README.md

feat(plugin): trust-on-first-use manifest pinning

2026-05-19 16:44:09 +02:00

TODO.md

docs: consolidated roadmap, ADR-013, drop stale plans

2026-05-07 15:06:54 +02:00

README.md

gnoma

A provider-agnostic agentic coding assistant built in Go. gnoma routes tasks to the best available LLM — cloud or local — through a multi-armed bandit router, while tools, hooks, skills, MCP servers, and plugins keep it extensible. Named after the northern pygmy-owl (Glaucidium gnoma); agents are called elfs (elf owl).

Quickstart

# Install
go install somegit.dev/Owlibou/gnoma/cmd/gnoma@latest

# Or build from source
git clone https://somegit.dev/Owlibou/gnoma && cd gnoma
make build    # binary at ./bin/gnoma

# Set at least one provider key
export ANTHROPIC_API_KEY=sk-ant-...   # or OPENAI_API_KEY, MISTRAL_API_KEY, GEMINI_API_KEY

# Run
gnoma                                 # interactive TUI
echo "list files" | gnoma             # pipe mode
gnoma --provider ollama               # use a local model

Build

make build          # ./bin/gnoma
make install        # $GOPATH/bin/gnoma

Providers

Anthropic

export ANTHROPIC_API_KEY=sk-ant-...
./bin/gnoma --provider anthropic
./bin/gnoma --provider anthropic --model claude-opus-4-5-20251001

Integration tests hit the real API — keep a key in env:

go test -tags integration ./internal/provider/...

OpenAI

export OPENAI_API_KEY=sk-proj-...
./bin/gnoma --provider openai
./bin/gnoma --provider openai --model gpt-4o

Mistral

export MISTRAL_API_KEY=...
./bin/gnoma --provider mistral

Google (Gemini)

export GEMINI_API_KEY=AIza...
./bin/gnoma --provider google
./bin/gnoma --provider google --model gemini-2.0-flash

Ollama (local)

Start Ollama and pull a model, then:

./bin/gnoma --provider ollama --model gemma4:latest
./bin/gnoma --provider ollama --model qwen3:8b     # default if --model omitted

Default endpoint: http://localhost:11434/v1. Override via config or env:

# .gnoma/config.toml
[provider]
default = "ollama"
model   = "gemma4:latest"

[provider.endpoints]
ollama = "http://myhost:11434/v1"

llama.cpp (local)

Start the llama.cpp server:

llama-server --model /path/to/model.gguf --port 8080 --ctx-size 8192

Then:

./bin/gnoma --provider llamacpp
# model name is taken from the server's /v1/models response

Default endpoint: http://localhost:8080/v1. Override:

[provider.endpoints]
llamacpp = "http://localhost:9090/v1"

Extensibility (M8)

gnoma supports hooks, skills, MCP servers, and plugins.

MCP Servers

Connect any MCP-compatible tool server:

[[mcp_servers]]
name    = "git"
command = "mcp-server-git"
args    = ["--repo", "."]
timeout = "30s"

# Replace a built-in tool with an MCP tool
[mcp_servers.replace_default]
exec = "bash"   # MCP tool "exec" replaces gnoma's built-in "bash"

MCP tools appear as mcp__{server}__{tool} (e.g., mcp__git__status), or under the built-in name when using replace_default.

Skills

Drop markdown files into .gnoma/skills/ or ~/.config/gnoma/skills/:

/skillname          # invoke a skill
/skills             # list available skills

Hooks

Run shell commands on tool events:

[[hooks]]
name         = "block-rm-rf"
event        = "pre_tool_use"
type         = "command"
exec         = "bash-safety-check.sh"
tool_pattern = "bash*"

Plugins

Bundle skills, hooks, and MCP configs into installable plugins:

gnoma plugin install ./my-plugin    # install from directory
gnoma plugin list                   # list installed plugins

Plugins are pinned by SHA-256 of their plugin.json on first load (Trust-On-First-Use). A manifest that changes between runs is refused with a clear error and a re-enrollment hint. See docs/plugins-trust.md and ADR-003.

Session Persistence

Conversations are auto-saved to .gnoma/sessions/ after each completed turn. On a crash you lose at most the current in-flight turn; all previously completed turns are safe.

Resume a session

gnoma --resume              # interactive session picker (↑↓ navigate, Enter load, Esc cancel)
gnoma --resume <id>         # restore directly by ID
gnoma -r                    # shorthand

Inside the TUI:

/resume                     # open picker
/resume <id>                # restore by ID

Incognito mode

gnoma --incognito           # no session saved, no quality scores updated

Toggle at runtime with Ctrl+X.

Config

[session]
max_keep = 20   # how many sessions to retain per project (default: 20)

Sessions are stored per-project under .gnoma/sessions/<id>/. Quality scores (EMA routing data) are stored globally at ~/.config/gnoma/quality.json.

Config

Config is read in priority order:

~/.config/gnoma/config.toml — global
.gnoma/config.toml — project-local (next to go.mod / .git)
Environment variables

Example .gnoma/config.toml:

[provider]
default = "anthropic"
model   = "claude-sonnet-4-6"

[provider.api_keys]
anthropic = "${ANTHROPIC_API_KEY}"

[provider.endpoints]
ollama   = "http://localhost:11434/v1"
llamacpp = "http://localhost:8080/v1"

[permission]
mode = "auto"   # auto | accept_edits | bypass | deny | plan

Environment variable overrides: GNOMA_PROVIDER, GNOMA_MODEL.

Testing

make test               # unit tests
make test-integration   # integration tests (require real API keys)
make cover              # coverage report → coverage.html
make lint               # golangci-lint
make check              # fmt + vet + lint + test

Integration tests are gated behind //go:build integration and skipped by default.