Initial commit: Local Transcription App v1.0
Phase 1 Complete - Standalone Desktop Application Features: - Real-time speech-to-text with Whisper (faster-whisper) - PySide6 desktop GUI with settings dialog - Web server for OBS browser source integration - Audio capture with automatic sample rate detection and resampling - Noise suppression with Voice Activity Detection (VAD) - Configurable display settings (font, timestamps, fade duration) - Settings apply without restart (with automatic model reloading) - Auto-fade for web display transcriptions - CPU/GPU support with automatic device detection - Standalone executable builds (PyInstaller) - CUDA build support (works on systems without CUDA hardware) Components: - Audio capture with sounddevice - Noise reduction with noisereduce + webrtcvad - Transcription with faster-whisper - GUI with PySide6 - Web server with FastAPI + WebSocket - Configuration system with YAML Build System: - Standard builds (CPU-only): build.sh / build.bat - CUDA builds (universal): build-cuda.sh / build-cuda.bat - Comprehensive BUILD.md documentation - Cross-platform support (Linux, Windows) Documentation: - README.md with project overview and quick start - BUILD.md with detailed build instructions - NEXT_STEPS.md with future enhancement roadmap - INSTALL.md with setup instructions 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
40
config/default_config.yaml
Normal file
40
config/default_config.yaml
Normal file
@@ -0,0 +1,40 @@
|
||||
user:
|
||||
name: "User"
|
||||
id: ""
|
||||
|
||||
audio:
|
||||
input_device: "default"
|
||||
sample_rate: 16000
|
||||
chunk_duration: 3.0
|
||||
|
||||
noise_suppression:
|
||||
enabled: true
|
||||
strength: 0.7
|
||||
method: "noisereduce"
|
||||
|
||||
transcription:
|
||||
model: "base"
|
||||
device: "auto"
|
||||
language: "en"
|
||||
task: "transcribe"
|
||||
|
||||
processing:
|
||||
use_vad: true
|
||||
min_confidence: 0.5
|
||||
|
||||
server_sync:
|
||||
enabled: false
|
||||
url: "ws://localhost:8000"
|
||||
api_key: ""
|
||||
|
||||
display:
|
||||
show_timestamps: true
|
||||
max_lines: 100
|
||||
font_family: "Courier"
|
||||
font_size: 12
|
||||
theme: "dark"
|
||||
fade_after_seconds: 10 # Time before transcriptions fade out (0 = never fade)
|
||||
|
||||
web_server:
|
||||
port: 8080
|
||||
host: "127.0.0.1"
|
||||
Reference in New Issue
Block a user