34108075d2
At startup, polls ollama (/api/tags) and llama.cpp (/v1/models) for available models. Registers each as an arm in the router alongside the CLI-specified provider. Discovered: 7 ollama models + 1 llama.cpp model = 9 total arms. Router can now select from multiple local models based on task type. Discovery is non-blocking — failures logged and skipped.