Initial commit: Local Transcription App v1.0
Phase 1 Complete - Standalone Desktop Application Features: - Real-time speech-to-text with Whisper (faster-whisper) - PySide6 desktop GUI with settings dialog - Web server for OBS browser source integration - Audio capture with automatic sample rate detection and resampling - Noise suppression with Voice Activity Detection (VAD) - Configurable display settings (font, timestamps, fade duration) - Settings apply without restart (with automatic model reloading) - Auto-fade for web display transcriptions - CPU/GPU support with automatic device detection - Standalone executable builds (PyInstaller) - CUDA build support (works on systems without CUDA hardware) Components: - Audio capture with sounddevice - Noise reduction with noisereduce + webrtcvad - Transcription with faster-whisper - GUI with PySide6 - Web server with FastAPI + WebSocket - Configuration system with YAML Build System: - Standard builds (CPU-only): build.sh / build.bat - CUDA builds (universal): build-cuda.sh / build-cuda.bat - Comprehensive BUILD.md documentation - Cross-platform support (Linux, Windows) Documentation: - README.md with project overview and quick start - BUILD.md with detailed build instructions - NEXT_STEPS.md with future enhancement roadmap - INSTALL.md with setup instructions 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
23
requirements.txt
Normal file
23
requirements.txt
Normal file
@@ -0,0 +1,23 @@
|
||||
# Core Dependencies
|
||||
numpy>=1.24.0
|
||||
pyyaml>=6.0
|
||||
|
||||
# Audio Processing
|
||||
sounddevice>=0.4.6
|
||||
scipy>=1.10.0
|
||||
|
||||
# Noise Suppression
|
||||
noisereduce>=3.0.0
|
||||
webrtcvad>=2.0.10
|
||||
|
||||
# Transcription - Using faster-whisper for better real-time performance
|
||||
faster-whisper>=0.10.0
|
||||
torch>=2.0.0
|
||||
|
||||
# GUI - Using CustomTkinter for modern look
|
||||
customtkinter>=5.2.0
|
||||
pillow>=10.0.0
|
||||
|
||||
# Optional: Server sync dependencies (will move to requirements-server.txt later)
|
||||
# websockets>=12.0
|
||||
# requests>=2.31.0
|
||||
Reference in New Issue
Block a user