Add cloud-only sidecar variant (~50MB vs 500MB-2GB) · 3d3d7ec3c5 - local-transcription

Add cloud-only sidecar variant (~50MB vs 500MB-2GB)

All checks were successful

Tests / Python Backend Tests (push) Successful in 6s

Details

Tests / Frontend Tests (push) Successful in 7s

Details

Tests / Rust Sidecar Tests (push) Successful in 1m59s

Details

Lightweight Deepgram-only sidecar that excludes PyTorch, faster-whisper,
RealtimeSTT, and CUDA. Only includes audio capture + WebSocket streaming
to Deepgram. Requires a Deepgram API key (BYOK or managed mode).

Changes:
- client/models.py: Extracted TranscriptionResult into standalone module
  so deepgram_transcription.py doesn't transitively import torch
- backend/app_controller.py: Made RealtimeTranscriptionEngine and
  DeviceManager imports lazy (only loaded when remote.mode == "local")
- local-transcription-cloud.spec: PyInstaller spec excluding all ML deps
- SidecarSetup.svelte: Added "Cloud Only (Deepgram)" variant option
- build-sidecar-cloud.yml: CI workflow building cloud sidecar for all 3 OS
- sidecar-release.yml: Dispatches cloud build alongside CPU/CUDA builds

Sidecar download options are now:
- Standard (CPU): ~500 MB - local Whisper on any computer
- GPU Accelerated (CUDA): ~2 GB - local Whisper with NVIDIA GPU
- Cloud Only (Deepgram): ~50 MB - requires API key, no local models

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

This commit is contained in:

Developer

2026-04-07 16:57:43 -07:00

parent bb039399fc

commit 3d3d7ec3c5

10 changed files with 469 additions and 42 deletions

									
										2

client/deepgram_transcription.py
									
												View File
												
				@@ -17,7 +17,7 @@ from datetime import datetime

				from queue import Queue, Empty

				from typing import Optional, Callable

				from client.transcription_engine_realtime import TranscriptionResult

				from client.models import TranscriptionResult

				logger = logging.getLogger(__name__)

Add cloud-only sidecar variant (~50MB vs 500MB-2GB) All checks were successful Tests / Python Backend Tests (push) Successful in 6s Details Tests / Frontend Tests (push) Successful in 7s Details Tests / Rust Sidecar Tests (push) Successful in 1m59s Details

2 client/deepgram_transcription.py Unescape Escape View File

Add cloud-only sidecar variant (~50MB vs 500MB-2GB)

All checks were successful

Tests / Python Backend Tests (push) Successful in 6s

Details

Tests / Frontend Tests (push) Successful in 7s

Details

Tests / Rust Sidecar Tests (push) Successful in 1m59s

Details

2

client/deepgram_transcription.py

View File