Initial commit: Local Transcription App v1.0

Phase 1 Complete - Standalone Desktop Application

Features:
- Real-time speech-to-text with Whisper (faster-whisper)
- PySide6 desktop GUI with settings dialog
- Web server for OBS browser source integration
- Audio capture with automatic sample rate detection and resampling
- Noise suppression with Voice Activity Detection (VAD)
- Configurable display settings (font, timestamps, fade duration)
- Settings apply without restart (with automatic model reloading)
- Auto-fade for web display transcriptions
- CPU/GPU support with automatic device detection
- Standalone executable builds (PyInstaller)
- CUDA build support (works on systems without CUDA hardware)

Components:
- Audio capture with sounddevice
- Noise reduction with noisereduce + webrtcvad
- Transcription with faster-whisper
- GUI with PySide6
- Web server with FastAPI + WebSocket
- Configuration system with YAML

Build System:
- Standard builds (CPU-only): build.sh / build.bat
- CUDA builds (universal): build-cuda.sh / build-cuda.bat
- Comprehensive BUILD.md documentation
- Cross-platform support (Linux, Windows)

Documentation:
- README.md with project overview and quick start
- BUILD.md with detailed build instructions
- NEXT_STEPS.md with future enhancement roadmap
- INSTALL.md with setup instructions

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
2025-12-25 18:48:23 -08:00
commit 472233aec4
31 changed files with 5116 additions and 0 deletions

23
requirements.txt Normal file
View File

@@ -0,0 +1,23 @@
# Core Dependencies
numpy>=1.24.0
pyyaml>=6.0
# Audio Processing
sounddevice>=0.4.6
scipy>=1.10.0
# Noise Suppression
noisereduce>=3.0.0
webrtcvad>=2.0.10
# Transcription - Using faster-whisper for better real-time performance
faster-whisper>=0.10.0
torch>=2.0.0
# GUI - Using CustomTkinter for modern look
customtkinter>=5.2.0
pillow>=10.0.0
# Optional: Server sync dependencies (will move to requirements-server.txt later)
# websockets>=12.0
# requests>=2.31.0