local-transcription

Archived

Author	SHA1	Message	Date
Developer	8c7f4e8008	Fix sidecar pipe crash on state changes and show logged-in state in settings All checks were successful Tests / Python Backend Tests (push) Successful in 14s Details Tests / Frontend Tests (push) Successful in 8s Details Tests / Rust Sidecar Tests (push) Successful in 2m0s Details The state callback in main_headless.py wrote events to stdout synchronously, so an EINVAL on the Tauri sidecar pipe (Windows) bubbled up through _set_state and tore down engine init and reload_engine. That turned PUT /api/config into a "Failed to fetch" for the user. The print is now pipe-safe and api_server isolates the chained callback so a future misbehaving listener cannot break the engine state machine. Settings also now persists remote.email on login and shows a "Logged in as <email>" indicator with a Log out button when an auth_token is present, instead of leaving the email/password fields blank on reload. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 18:55:43 -07:00
Developer	94bc704950	Fix settings save blocking event loop and overwriting config keys Some checks failed Tests / Python Backend Tests (push) Successful in 5s Details Tests / Frontend Tests (push) Successful in 8s Details Tests / Rust Sidecar Tests (push) Has been cancelled Details - Run apply_settings in thread pool executor to prevent engine reload from blocking the HTTP response (caused "TypeError: Failed to fetch") - Flatten nested config objects into dot-notation keys before saving so partial updates don't wipe out unincluded keys like auth_token Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 19:40:51 -07:00
Developer	f0b5890eba	Hide Local (Whisper) mode option when using cloud-only sidecar All checks were successful Tests / Python Backend Tests (push) Successful in 6s Details Tests / Frontend Tests (push) Successful in 7s Details Tests / Rust Sidecar Tests (push) Successful in 2m3s Details - Expose is_cloud_only flag in /api/status response - Add isCloudOnly to backend store state - Conditionally hide Local (Whisper) radio button in Settings Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 19:24:01 -07:00
Developer	4aa19eee86	Fix test: align remote.mode in no-reload settings test All checks were successful Tests / Python Backend Tests (push) Successful in 5s Details Tests / Frontend Tests (push) Successful in 8s Details Tests / Rust Sidecar Tests (push) Successful in 1m59s Details The default remote.mode changed from 'local' to 'byok', causing the apply_settings test to detect a mode mismatch and trigger an unexpected engine reload. Pin remote.mode to 'local' in the test to match the controller's assumed current mode. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 12:01:11 -07:00
Developer	ae61c8c75a	Fix Start button not updating: unblock the event loop All checks were successful Tests / Python Backend Tests (push) Successful in 5s Details Tests / Frontend Tests (push) Successful in 8s Details Tests / Rust Sidecar Tests (push) Successful in 2m4s Details start_transcription() blocks up to 15s waiting for the Deepgram WebSocket to connect. Running it synchronously in the async endpoint blocked the entire uvicorn event loop, preventing: - pollStatus from completing (frozen HTTP request) - WebSocket broadcasts from being sent - Any other API requests from being handled Fix: run start/stop/reload in thread pool via run_in_executor so the event loop stays responsive during long-running operations. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 12:43:49 -07:00
Developer	a3bcc5bee5	Show transcription start errors in UI, improve error logging All checks were successful Tests / Python Backend Tests (push) Successful in 5s Details Tests / Frontend Tests (push) Successful in 8s Details Tests / Rust Sidecar Tests (push) Successful in 2m5s Details Start Transcription button now shows the error message when it fails instead of silently reverting. Common causes: - Missing PortAudio library on Linux - Audio device not accessible - Deepgram connection failure Also added error details to backend console output and captured the last error from the Deepgram engine for better diagnostics. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 12:15:43 -07:00
Developer	293362baa1	Cloud sidecar auto-detects variant and guides user to configure All checks were successful Tests / Python Backend Tests (push) Successful in 5s Details Tests / Frontend Tests (push) Successful in 8s Details Tests / Rust Sidecar Tests (push) Successful in 2m7s Details On first launch, the cloud sidecar now: 1. Detects it's the cloud variant (DeviceManager import fails) 2. Auto-switches config from "local" to "byok" mode 3. Shows "Setup needed: Open Settings > Remote Transcription > enter your Deepgram API key" as a friendly status message 4. Stays in READY state so the UI is fully accessible The user can then open Settings, enter their Deepgram API key, save, and start transcribing without needing to know about modes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 09:17:06 -07:00
Developer	41f50dedec	Fix cloud sidecar crash on first launch All checks were successful Tests / Python Backend Tests (push) Successful in 5s Details Tests / Frontend Tests (push) Successful in 8s Details Tests / Rust Sidecar Tests (push) Successful in 3m11s Details The cloud sidecar excludes the local Whisper engine module, but on first launch the config defaults to remote.mode="local" which tries to import it. Now catches the ImportError gracefully and shows an error message telling the user to switch to Cloud (Deepgram) mode in Settings. The API server still starts so Settings is accessible. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 09:12:17 -07:00
Developer	3d3d7ec3c5	Add cloud-only sidecar variant (~50MB vs 500MB-2GB) All checks were successful Tests / Python Backend Tests (push) Successful in 6s Details Tests / Frontend Tests (push) Successful in 7s Details Tests / Rust Sidecar Tests (push) Successful in 1m59s Details Lightweight Deepgram-only sidecar that excludes PyTorch, faster-whisper, RealtimeSTT, and CUDA. Only includes audio capture + WebSocket streaming to Deepgram. Requires a Deepgram API key (BYOK or managed mode). Changes: - client/models.py: Extracted TranscriptionResult into standalone module so deepgram_transcription.py doesn't transitively import torch - backend/app_controller.py: Made RealtimeTranscriptionEngine and DeviceManager imports lazy (only loaded when remote.mode == "local") - local-transcription-cloud.spec: PyInstaller spec excluding all ML deps - SidecarSetup.svelte: Added "Cloud Only (Deepgram)" variant option - build-sidecar-cloud.yml: CI workflow building cloud sidecar for all 3 OS - sidecar-release.yml: Dispatches cloud build alongside CPU/CUDA builds Sidecar download options are now: - Standard (CPU): ~500 MB - local Whisper on any computer - GPU Accelerated (CUDA): ~2 GB - local Whisper with NVIDIA GPU - Cloud Only (Deepgram): ~50 MB - requires API key, no local models Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 16:57:43 -07:00
Developer	8db9b8298b	Fix dev mode sidecar launch and engine reload on mode change All checks were successful Tests / Python Backend Tests (push) Successful in 6s Details Tests / Frontend Tests (push) Successful in 7s Details Tests / Rust Sidecar Tests (push) Successful in 1m57s Details 1. Dev mode: use `uv run python` instead of bare `python` to ensure the project venv is used. Also use CARGO_MANIFEST_DIR to find the project root reliably. 2. Engine reload: changing remote.mode (local/managed/byok) now triggers a full engine reload. Previously only model and device changes triggered reload, so switching to Deepgram had no effect until the app was restarted. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 16:25:07 -07:00
Developer	5a674ed199	Add test suite (63 tests) and CI workflow, fix Settings API bugs Some checks failed Release / Bump version and tag (push) Successful in 4s Details Sidecar Release / Bump sidecar version and tag (push) Failing after 3s Details Tests / Python Backend Tests (push) Failing after 3s Details Tests / Frontend Tests (push) Successful in 8s Details Tests / Rust Sidecar Tests (push) Successful in 3m10s Details Test suite covering all three layers: Python backend (25 tests): - AppController: state machine, start/stop, callbacks, settings reload - API server: REST endpoints, config CRUD, status, devices - Config: dot-notation get/set, persistence, nested paths - Main headless: ready event port format validation Svelte frontend (14 tests via Vitest): - Backend store: exported properties/methods, port derivation, URLs - Config store: method names (fetchConfig not loadConfig), defaults - Transcriptions store: add/clear/plaintext - File extension regression: ensures $state runes only in .svelte.ts Rust sidecar (24 tests via cargo test): - Platform/arch detection, asset name construction - Ready event deserialization (with extra fields tolerance) - Path construction, version read/write, old version cleanup - Zip extraction, SidecarManager lifecycle CI workflow (.gitea/workflows/test.yml): - Runs on push to main and PRs - Three parallel jobs: Python, Frontend, Rust Also fixes three bugs found during test planning: - Settings: /api/check-updates -> GET /api/check-update - Settings: /api/remote/login -> /api/login - Settings: /api/remote/register -> /api/register Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 07:48:36 -07:00
Developer	a8de39de84	Fix OBS display and Start button not working All checks were successful Release / Bump version and tag (push) Successful in 12s Details Sidecar Release / Bump sidecar version and tag (push) Successful in 6s Details Three issues fixed: 1. Port mismatch: The sidecar reported the OBS port (8080) in the ready event but the frontend needs the API port (8081). Now reports the API port so WebSocket/REST connects to the right place. 2. Broadcast from wrong thread: Engine init fires state_changed from a background thread, but _broadcast_control used get_event_loop() which returns the wrong loop. Now captures the uvicorn event loop at startup via on_event("startup"). 3. Missed ready state: If the engine finishes before the WebSocket client connects, the "ready" state_changed was never received. Added status polling (GET /api/status) on WebSocket connect that retries every 2s while appState is "initializing". Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 07:35:41 -07:00
Developer	af534bf768	Add Tauri v2 + Svelte 5 frontend and headless Python backend Scaffold the cross-platform rewrite from PySide6/Qt to Tauri + Svelte, following the same architecture as voice-to-notes. The Python backend runs headless as a sidecar, with a FastAPI control API that the Svelte frontend connects to via REST and WebSocket. New files: - backend/app_controller.py: Headless orchestration (extracted from MainWindow) - backend/api_server.py: FastAPI control endpoints + /ws/control WebSocket - backend/main_headless.py: Headless entry point for sidecar mode - src-tauri/: Tauri v2 Rust shell with sidecar and dialog plugins - src/: Svelte 5 frontend (App, Settings, Controls, TranscriptionDisplay) - src/lib/stores/: Reactive stores for backend connection, config, transcriptions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 10:20:25 -07:00

13 Commits