vikingowl/polyscribe

Go to file

vikingowl 840383fcf7

CI / build (push) Has been cancelled

Details

[feat] add JSON and quiet output modes for models subcommands, update UI suppression logic, and enhance CLI test coverage

2025-08-27 23:58:57 +02:00

[refactor] remove backend and library modules, consolidating features into main crate

2025-08-13 13:35:53 +02:00

.github/workflows

docs: align CLI docs to models subcommands; host: scan XDG plugin dir; ci: add GitHub Actions; chore: add CHANGELOG

2025-08-14 11:16:50 +02:00

[feat] add JSON and quiet output modes for models subcommands, update UI suppression logic, and enhance CLI test coverage

2025-08-27 23:58:57 +02:00

docs: align CLI docs to models subcommands; host: scan XDG plugin dir; ci: add GitHub Actions; chore: add CHANGELOG

2025-08-14 11:16:50 +02:00

plugins/polyscribe-plugin-tubescribe

[refactor] rename and simplify ProgressManager to FileProgress, enhance caching logic, update Hugging Face API integration, and clean up unused comments

2025-08-15 11:24:50 +02:00

[refactor] remove unused test suites, examples, CI docs, and PR description file

2025-08-13 14:26:18 +02:00

.gitignore

[chore] update .gitignore to exclude /models directory

2025-08-08 07:05:17 +02:00

Cargo.lock

[feat] add ModelManager with caching, manifest management, and Hugging Face API integration

2025-08-27 20:56:05 +02:00

Cargo.toml

[feat] add ModelManager with caching, manifest management, and Hugging Face API integration

2025-08-27 20:56:05 +02:00

CHANGELOG.md

docs: align CLI docs to models subcommands; host: scan XDG plugin dir; ci: add GitHub Actions; chore: add CHANGELOG

2025-08-14 11:16:50 +02:00

CONTRIBUTING.md

[refactor] remove unused test suites, examples, CI docs, and PR description file

2025-08-13 14:26:18 +02:00

LICENSE

[chore] add MIT license and copyright notices across project files

2025-08-08 20:29:45 +02:00

README.md

[refactor] rename and simplify ProgressManager to FileProgress, enhance caching logic, update Hugging Face API integration, and clean up unused comments

2025-08-15 11:24:50 +02:00

rust-toolchain.toml

[refactor] remove backend and library modules, consolidating features into main crate

2025-08-13 13:35:53 +02:00

TODO.md

[docs] add initial TODO.md with actionable backlog and prioritization

2025-08-08 20:57:01 +02:00

README.md

PolyScribe

Local-first transcription and plugins.

Features

Local-first: Works offline with downloaded models
Multiple backends: CPU, CUDA, ROCm/HIP, and Vulkan support
Plugin system: Extensible via JSON-RPC plugins
Model management: Automatic download and verification of Whisper models
Manifest caching: Local cache for Hugging Face model manifests to reduce network requests

Model Management

PolyScribe automatically manages Whisper models from Hugging Face:

# Download models interactively
polyscribe models download

# Update existing models
polyscribe models update

# Clear manifest cache (force fresh fetch)
polyscribe models clear-cache

Manifest Caching

The Hugging Face model manifest is cached locally to avoid repeated network requests:

Default TTL: 24 hours
Cache location: $XDG_CACHE_HOME/polyscribe/manifest/ (or platform equivalent)
Environment variables:
- POLYSCRIBE_NO_CACHE_MANIFEST=1: Disable caching
- POLYSCRIBE_MANIFEST_TTL_SECONDS=3600: Set custom TTL (in seconds)

Installation

cargo install --path .

Usage

# Transcribe audio/video
polyscribe transcribe input.mp4

# Merge multiple transcripts
polyscribe transcribe --merge input1.json input2.json

# Use specific GPU backend
polyscribe transcribe --gpu-backend cuda input.mp4

Development

# Build
cargo build

# Run tests
cargo test

# Run with verbose logging
cargo run -- --verbose transcribe input.mp4