voice-to-notes

Author	SHA1	Message	Date
Gitea Actions	9989f65531	chore: bump sidecar version to 1.0.4 [skip ci]	2026-03-22 18:00:04 +00:00
Claude	2e7a5819bc	Fix CSP for blob URLs + fix pyannote AudioDecoder with torchaudio patch All checks were successful Build Sidecars / Bump sidecar version and tag (push) Successful in 4s Details Release / Bump version and tag (push) Successful in 3s Details Build Sidecars / Build Sidecar (macOS) (push) Successful in 3m25s Details Release / Build App (macOS) (push) Successful in 1m26s Details Build Sidecars / Build Sidecar (Linux) (push) Successful in 14m31s Details Release / Build App (Linux) (push) Successful in 3m50s Details Build Sidecars / Build Sidecar (Windows) (push) Successful in 27m7s Details Release / Build App (Windows) (push) Successful in 3m26s Details CSP: Add blob: to connect-src/img-src/media-src for wavesurfer.js audio playback. Add http://tauri.localhost to default-src for devtools. pyannote: sys.modules block didn't work — pyannote still uses AudioDecoder unconditionally. New approach: monkey-patch Audio.__call__ in diarize.py to use torchaudio.load() directly, bypassing the broken torchcodec path. Patch runs once before pipeline loading. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-22 10:59:54 -07:00
Gitea Actions	1104d956c9	chore: bump sidecar version to 1.0.3 [skip ci]	2026-03-22 16:54:32 +00:00
Claude	db770c341d	Fix CSP blocking IPC/assets + fix pyannote AudioDecoder crash All checks were successful Build Sidecars / Bump sidecar version and tag (push) Successful in 9s Details Release / Bump version and tag (push) Successful in 5s Details Build Sidecars / Build Sidecar (macOS) (push) Successful in 3m37s Details Release / Build App (macOS) (push) Successful in 1m16s Details Build Sidecars / Build Sidecar (Linux) (push) Successful in 14m3s Details Release / Build App (Linux) (push) Successful in 4m45s Details Build Sidecars / Build Sidecar (Windows) (push) Successful in 24m32s Details Release / Build App (Windows) (push) Successful in 3m12s Details CSP: Add connect-src for ipc.localhost and asset.localhost so Tauri IPC commands and local file loading (waveform, audio playback) work. pyannote: Block torchcodec in sys.modules at startup so pyannote.audio falls back to torchaudio for audio decoding. pyannote has a bug where it uses AudioDecoder unconditionally even when torchcodec import fails. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-22 09:54:21 -07:00
Gitea Actions	a5e3d47f02	chore: bump sidecar version to 1.0.2 [skip ci]	2026-03-22 16:13:16 +00:00
Claude	62675c77ae	Exclude torchcodec from PyInstaller to fix diarization Some checks failed Build Sidecars / Bump sidecar version and tag (push) Successful in 5s Details Release / Bump version and tag (push) Successful in 5s Details Release / Build App (Linux) (push) Has been cancelled Details Release / Build App (Windows) (push) Has been cancelled Details Release / Build App (macOS) (push) Has been cancelled Details Build Sidecars / Build Sidecar (macOS) (push) Successful in 4m8s Details Build Sidecars / Build Sidecar (Linux) (push) Successful in 13m59s Details Build Sidecars / Build Sidecar (Windows) (push) Successful in 25m43s Details torchcodec is partially bundled but non-functional (missing FFmpeg DLLs), causing pyannote.audio to try AudioDecoder which fails with NameError. Excluding it forces pyannote to fall back to torchaudio for audio loading. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-22 09:13:09 -07:00
Gitea Actions	d33bb609d7	chore: bump sidecar version to 1.0.1 [skip ci]	2026-03-22 14:58:05 +00:00
Claude	45247ae66e	Decouple sidecar versioning from app versioning Some checks failed Build Sidecars / Bump sidecar version and tag (push) Successful in 3s Details Release / Bump version and tag (push) Failing after 3s Details Release / Build App (Linux) (push) Has been skipped Details Release / Build App (Windows) (push) Has been skipped Details Release / Build App (macOS) (push) Has been skipped Details Build Sidecars / Build Sidecar (macOS) (push) Successful in 5m28s Details Build Sidecars / Build Sidecar (Linux) (push) Successful in 13m54s Details Build Sidecars / Build Sidecar (Windows) (push) Successful in 37m38s Details Sidecar now has its own version (1.0.0) and release lifecycle: - Sidecar tags: sidecar-v1.0.0, sidecar-v1.0.1, etc. - App tags: v0.2.x (unchanged) - Sidecar workflow triggers only on python/** changes or manual dispatch - App release no longer bumps python/pyproject.toml Sidecar version tracked via sidecar-version.txt in app data dir: - resolve_sidecar_path() reads version from file instead of CARGO_PKG_VERSION - download_sidecar() fetches latest sidecar-v* release from Gitea API - check_sidecar_update() compares local vs remote sidecar versions - Version file written after successful download Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-22 07:57:51 -07:00
Gitea Actions	5947ffef66	chore: bump version to 0.2.12 [skip ci]	2026-03-22 14:09:28 +00:00
Gitea Actions	1b706a855b	chore: bump version to 0.2.11 [skip ci]	2026-03-22 13:35:22 +00:00
Gitea Actions	67d3325532	chore: bump version to 0.2.10 [skip ci]	2026-03-22 13:33:25 +00:00
Gitea Actions	7c814805f3	chore: bump version to 0.2.9 [skip ci]	2026-03-22 13:14:43 +00:00
Claude	5f9f1426e6	Fix CUDA build: use PyTorch CUDA index URL explicitly Some checks failed Release / Bump version and tag (push) Successful in 15s Details Release / Build (macOS) (push) Successful in 5m11s Details Release / Build (Windows) (push) Failing after 14m6s Details Release / Build (Linux) (push) Has been cancelled Details Default torch on PyPI is CPU-only on Windows. Must use PyTorch's own package index (cu126) to get CUDA-enabled wheels. This also pins the CUDA version on Linux for deterministic builds. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-22 06:14:26 -07:00
Gitea Actions	5311b19fdc	chore: bump version to 0.2.8 [skip ci]	2026-03-22 12:56:13 +00:00
Gitea Actions	fcbe1afd0c	chore: bump version to 0.2.7 [skip ci]	2026-03-22 12:37:00 +00:00
Claude	7efa3bb116	Fix CUDA fallback: gracefully fall back to CPU when CUDA libs missing Some checks failed Release / Bump version and tag (push) Successful in 18s Details Release / Build (macOS) (push) Successful in 5m27s Details Release / Build (Linux) (push) Successful in 11m38s Details Release / Build (Windows) (push) Has been cancelled Details - transcribe: catch model load failures on CUDA and retry with CPU - hardware detect: test CUDA runtime actually works (torch.zeros on cuda) before recommending GPU, since CPU-only builds detect CUDA via driver but lack cublas/cuDNN libraries Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-22 05:36:40 -07:00
Gitea Actions	2be5024de7	chore: bump version to 0.2.6 [skip ci]	2026-03-22 03:48:06 +00:00
Claude	32e3c6d42e	Remove unittest from PyInstaller excludes (needed by huggingface_hub) All checks were successful Release / Bump version and tag (push) Successful in 3s Details Release / Build (macOS) (push) Successful in 5m46s Details Release / Build (Linux) (push) Successful in 7m46s Details Release / Build (Windows) (push) Successful in 16m29s Details Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-21 20:48:01 -07:00
Gitea Actions	68ae48a771	chore: bump version to 0.2.5 [skip ci]	2026-03-22 03:23:01 +00:00
Claude	4fed9bccb8	Fix sidecar crash: torch circular import under PyInstaller All checks were successful Release / Bump version and tag (push) Successful in 4s Details Release / Build (Linux) (push) Successful in 7m27s Details Release / Build (macOS) (push) Successful in 7m47s Details Release / Build (Windows) (push) Successful in 19m25s Details - Exclude ctranslate2.converters from PyInstaller bundle — these modules import torch at module level causing circular import crashes, and are only needed for model conversion (never used at runtime) - Defer all heavy ML imports to first handler call instead of startup, so the sidecar can send its ready message without loading torch/whisper Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-21 20:22:56 -07:00
Gitea Actions	ef1481b359	chore: bump version to 0.2.4 [skip ci]	2026-03-22 01:37:55 +00:00
Gitea Actions	9a6ea84637	chore: bump version to 0.2.3 [skip ci]	2026-03-21 21:46:10 +00:00
Gitea Actions	d2d111a5c7	chore: bump version to 0.2.2 [skip ci]	2026-03-21 18:56:14 +00:00
Gitea Actions	9033881274	chore: bump version to 0.2.1 [skip ci]	2026-03-21 18:53:23 +00:00
Claude	1ed34e0bbb	Add auto-increment version and release workflow All checks were successful Release / Bump version and tag (push) Successful in 3s Details - New release.yml: bumps patch version, commits with skip-ci marker, tags, creates Gitea release - Build workflows now trigger on v* tags only (not branch push) - Simplified upload steps: use tag directly, retry loop for release lookup - Fix macOS: install jq if missing - Sync python/pyproject.toml version to 0.2.0 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-21 11:53:13 -07:00
Claude	12869e3757	Fix sidecar not found on Windows/macOS/Linux: switch from externalBin to resources Some checks failed Build macOS / Build (macOS) (push) Failing after 2m53s Details Build Linux / Build (Linux) (push) Failing after 6m30s Details Build Windows / Build (Windows) (push) Failing after 8m15s Details Tauri's externalBin only bundled the single sidecar executable, but PyInstaller's onedir output requires companion DLLs and _internal/. The binary was also renamed with a target triple suffix that resolve_sidecar_path() didn't look for, causing it to fall back to dev mode which used a compile-time CI path (CARGO_MANIFEST_DIR). - Switch from externalBin to bundle.resources to include all sidecar files - Pass Tauri resource_dir to sidecar manager for platform-aware path resolution - Remove rename_binary() since externalBin target triple naming is no longer needed - Remove broken production-to-dev fallback that could never work on user machines Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-21 06:55:44 -07:00
Claude	b8d70539ec	Fix uv pip install: use venv dir not python binary path, add .exe on Windows Some checks failed Build & Release / Build sidecar (x86_64-pc-windows-msvc) (push) Failing after 6s Details Build & Release / Build app (x86_64-unknown-linux-gnu) (push) Has been cancelled Details Build & Release / Build app (aarch64-apple-darwin) (push) Has been cancelled Details Build & Release / Build sidecar (x86_64-unknown-linux-gnu) (push) Has been cancelled Details Build & Release / Build app (x86_64-pc-windows-msvc) (push) Has been cancelled Details Build & Release / Create Release (push) Has been cancelled Details Build & Release / Build sidecar (aarch64-apple-darwin) (push) Has been cancelled Details - uv pip --python works better with the venv directory path than the python binary path (avoids "No virtual environment found" on Windows) - Add .exe suffix to Windows python path for non-uv fallback Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 23:05:23 -07:00
Claude	760b5dc90e	Fix build script: remove duplicate 'install' arg, fix Windows shell Some checks failed Build & Release / Build sidecar (x86_64-pc-windows-msvc) (push) Failing after 5s Details Build & Release / Build app (x86_64-unknown-linux-gnu) (push) Has been cancelled Details Build & Release / Build app (aarch64-apple-darwin) (push) Has been cancelled Details Build & Release / Build app (x86_64-pc-windows-msvc) (push) Has been cancelled Details Build & Release / Create Release (push) Has been cancelled Details Build & Release / Build sidecar (x86_64-unknown-linux-gnu) (push) Has been cancelled Details Build & Release / Build sidecar (aarch64-apple-darwin) (push) Has been cancelled Details - build_sidecar.py: pip_install() now includes 'install' in the command, callers pass only package names (was doubling up as 'uv pip install install torch') - CI: set shell: bash on uv steps so Windows doesn't try to use cmd.exe Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 23:03:23 -07:00
Claude	4701b578fc	Use uv for Python management in CI and build script Some checks failed Build & Release / Build sidecar (x86_64-unknown-linux-gnu) (push) Failing after 23s Details Build & Release / Build sidecar (aarch64-apple-darwin) (push) Failing after 25s Details Build & Release / Build sidecar (x86_64-pc-windows-msvc) (push) Failing after 29s Details Build & Release / Build app (x86_64-unknown-linux-gnu) (push) Has been skipped Details Build & Release / Build app (aarch64-apple-darwin) (push) Has been skipped Details Build & Release / Build app (x86_64-pc-windows-msvc) (push) Has been skipped Details Build & Release / Create Release (push) Has been skipped Details - CI: install uv via astral-sh/setup-uv, use uv to install Python and run the build script (replaces setup-python which fails on self-hosted macOS runners) - build_sidecar.py: auto-detects uv and uses it for venv creation and package installation (much faster), falls back to standard venv + pip when uv is not available - Remove .github/workflows duplicate Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 22:58:30 -07:00
Claude	4a4402f71a	Fix CI: macOS Python toolcache permissions, Windows pip invocation - Create /Users/runner directory on macOS before setup-python (permission fix) - Use `python -m pip` everywhere instead of calling pip directly (Windows fix) - Refactor build_sidecar.py to use pip_install() helper via python -m pip Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 21:56:25 -07:00
Claude	58faa83cb3	Cross-platform distribution, UI improvements, and performance optimizations - PyInstaller frozen sidecar: spec file, build script, and ffmpeg path resolver for self-contained distribution without Python prerequisites - Dual-mode sidecar launcher: frozen binary (production) with dev mode fallback - Parallel transcription + diarization pipeline (~30-40% faster) - GPU auto-detection for diarization (CUDA when available) - Async run_pipeline command for real-time progress event delivery - Web Audio API backend for instant playback and seeking - OpenAI-compatible provider replacing LiteLLM client-side routing - Cross-platform RAM detection (Linux/macOS/Windows) - Settings: speaker count hint, token reveal toggles, dark dropdown styling - Loading splash screen, flexbox layout fix for viewport overflow - Gitea Actions CI/CD pipeline (Linux, Windows, macOS ARM) - Updated README and CLAUDE.md documentation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 21:33:43 -07:00
Claude	42ccd3e21d	Fix cargo fmt formatting and diarize threading test mock	2026-03-20 13:56:32 -07:00
Claude	0771508203	Merge perf/chunked-transcription: chunk-based processing for large files	2026-03-20 13:54:14 -07:00
Claude	c23b9a90dd	Merge perf/diarize-threading: diarization progress via background thread	2026-03-20 13:52:59 -07:00
Claude	35af6e9e0c	Merge perf/progress-every-segment: emit progress for every segment	2026-03-20 13:52:18 -07:00
Claude	c3b6ad38fd	Merge perf/stream-segments: streaming partial transcript segments and speaker updates	2026-03-20 13:51:51 -07:00
Claude	03af5a189c	Run pyannote diarization in background thread with progress reporting Move the blocking pipeline() call to a daemon thread and emit estimated progress messages every 2 seconds from the main thread. The progress estimate uses audio duration to calibrate the expected total time. Also pass audio_duration_sec from PipelineService to DiarizeService. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 13:50:57 -07:00
Claude	16f4b57771	Add chunked transcription for large audio files (>1 hour) Split files >1 hour into 5-minute chunks via ffmpeg, transcribe each chunk independently, then merge results with corrected timestamps. Also add chunk-level progress markers every 10 segments for all files. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 13:49:20 -07:00
Claude	6eb13bce63	Remove progress throttle so every segment emits a progress update Previously, progress messages were only sent every 5th segment due to a `segment_count % 5` guard. This made the UI feel unresponsive for short recordings with few segments. Now every segment emits a progress update with a more descriptive message including the segment number and audio percentage. Adds a test verifying that all 8 mock segments produce progress messages, not just every 5th. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 13:49:14 -07:00
Josh Knapp	67ed69df00	Stream transcript segments to frontend as they are transcribed Send each segment to the frontend immediately after transcription via a new pipeline.segment IPC message, then send speaker assignments as a batch pipeline.speaker_update message after diarization completes. This lets the UI display segments progressively instead of waiting for the entire pipeline to finish. Changes: - Add partial_segment_message and speaker_update_message IPC factories - Add on_segment callback parameter to TranscribeService.transcribe() - Emit partial segments and speaker updates from PipelineService.run() - Add send_and_receive_with_progress to SidecarManager (Rust) - Route pipeline.segment/speaker_update events in run_pipeline command - Listen for streaming events in Svelte frontend (+page.svelte) - Add tests for new message types, callback signature, and update logic Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 13:47:57 -07:00
Josh Knapp	585411f402	Fix speaker diarization: WAV conversion, pyannote 4.0 compat, telemetry bug - Convert non-WAV audio to 16kHz mono WAV before diarization (pyannote v4.0.4 AudioDecoder returns None duration for FLAC, causing crash) - Handle pyannote 4.0 DiarizeOutput return type (unwrap .speaker_diarization) - Disable pyannote telemetry (np.isfinite(None) bug with max_speakers) - Use huggingface_hub.login() to persist token for all sub-downloads - Pre-download sub-models (segmentation-3.0, speaker-diarization-community-1) - Add third required model license link in settings UI - Improve SpeakerManager hints based on settings state - Add word-wrap to transcript text Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 19:46:07 -08:00
Josh Knapp	a3612c986d	Add Test & Download button for diarization model, clickable links - Add diarize.download IPC handler that downloads the pyannote model and returns user-friendly error messages (missing license, bad token) - Add download_diarize_model Tauri command - Add "Test & Download Model" button in Speakers settings tab - Update instructions to list both required model licenses (speaker-diarization-3.1 AND segmentation-3.0) - Make all HuggingFace URLs clickable (opens in system browser) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 18:21:42 -08:00
Josh Knapp	baf820286f	Add HuggingFace token setting for speaker detection - Add "Speakers" tab in Settings with HF token input field - Include step-by-step instructions for obtaining the token - Pass hf_token from settings through Rust → Python pipeline → diarize - Token can also be set via HF_TOKEN environment variable as fallback - Move skip_diarization checkbox to Speakers tab Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 18:08:51 -08:00
Josh Knapp	ed626b8ba0	Fix progress overlay, play-from-position, layout cutoff, speaker info - Replace progress bar with task checklist showing pipeline steps (load model, transcribe, load diarization, identify speakers, merge) - Fix WaveformPlayer: track ready state, disable controls until loaded, play from current position instead of resetting to start - Fix workspace height calc to prevent bottom content cutoff - Show HF_TOKEN setup hint in SpeakerManager when no speakers detected - Add console logging for progress events to aid debugging Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 18:02:48 -08:00
Josh Knapp	4d7b9d524f	Fix IPC stdout corruption, dark window background, overlay timing - Redirect sys.stdout to stderr in Python sidecar so library print() calls don't corrupt the JSON-line IPC stream - Save real stdout fd for exclusive IPC use via init_ipc() - Skip non-JSON lines in Rust reader instead of failing with parse error - Set Tauri window background color to match dark theme (#0a0a23) - Add inline dark background on html/body to prevent white flash - Use Svelte tick() to ensure progress overlay renders before invoke - Improve ProgressOverlay with spinner, better styling, z-index 9999 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 17:50:55 -08:00
Josh Knapp	87b3ad94f9	Improve import UX: progress overlay, pyannote fix, debug logging - Enhanced ProgressOverlay with spinner, better styling, and z-index 9999 - Import button shows "Processing..." with pulse animation while transcribing - Fix pyannote API: use token= instead of deprecated use_auth_token= - Read HF_TOKEN from environment for pyannote model download - Add console logging for click-to-seek debugging - Add color-scheme: dark for native form controls Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 17:43:49 -08:00
Josh Knapp	669d88f143	Fix progress feedback, diarization fallback, and dropdown readability - Stream pipeline progress to frontend via Tauri events so the progress overlay updates in real time during transcription/diarization - Gracefully fall back to transcription-only when diarization fails (e.g. pyannote not installed) instead of erroring the whole pipeline - Add color-scheme: dark to fix native select/option elements rendering with unreadable white backgrounds Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 17:14:25 -08:00
Josh Knapp	d67625cd5a	Phase 5: AI provider system with local and cloud support - Implement AIProvider base interface with chat() and is_available() - Add LocalProvider connecting to bundled llama-server via OpenAI SDK - Add OpenAIProvider for direct OpenAI API access - Add AnthropicProvider for Anthropic Claude API - Add LiteLLMProvider for multi-provider gateway - Build AIProviderService with provider routing, auto-selection, and transcript context injection - Add ai.chat IPC handler supporting chat, list_providers, set_provider, and configure actions - Add ai_chat, ai_list_providers, ai_configure Tauri commands - Build interactive AIChatPanel with message history, quick actions (Summarize, Action Items), and transcript context awareness - Tests: 30 Python, 6 Rust, 0 Svelte errors Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 16:25:10 -08:00
Josh Knapp	415a648a2b	Phase 4: Export to SRT, WebVTT, ASS, plain text, and Markdown - Implement ExportService using pysubs2 for caption formats (SRT, VTT, ASS) and custom formatters for plain text and Markdown - SRT exports with [Speaker]: prefix, WebVTT with <v Speaker> voice tags, ASS with color-coded speaker styles - Plain text groups by speaker with labels, Markdown adds timestamps - Add export.start IPC handler and export_transcript Tauri command - Add export dropdown menu in header (appears after transcription) - Uses native save dialog for output file selection - Add pysubs2 dependency - Tests: 30 Python (6 export tests), 6 Rust, 0 Svelte errors Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 16:18:54 -08:00
Josh Knapp	44480906a4	Phase 3: Speaker diarization and full transcription pipeline - Implement DiarizeService with pyannote.audio speaker detection - Build PipelineService combining transcribe → diarize → merge with overlap-based speaker assignment per segment - Add pipeline.start and diarize.start IPC handlers - Add run_pipeline Tauri command for full pipeline execution - Wire frontend to use pipeline: speakers auto-created with colors, segments assigned to detected speakers - Build SpeakerManager with rename support (double-click or edit button) - Add speaker color coding throughout transcript display - Add pyannote.audio dependency - Tests: 24 Python (including merge logic), 6 Rust, 0 Svelte errors Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 16:09:48 -08:00

1 2

52 Commits