Initial commit: Local Transcription App v1.0

Phase 1 Complete - Standalone Desktop Application

Features:
- Real-time speech-to-text with Whisper (faster-whisper)
- PySide6 desktop GUI with settings dialog
- Web server for OBS browser source integration
- Audio capture with automatic sample rate detection and resampling
- Noise suppression with Voice Activity Detection (VAD)
- Configurable display settings (font, timestamps, fade duration)
- Settings apply without restart (with automatic model reloading)
- Auto-fade for web display transcriptions
- CPU/GPU support with automatic device detection
- Standalone executable builds (PyInstaller)
- CUDA build support (works on systems without CUDA hardware)

Components:
- Audio capture with sounddevice
- Noise reduction with noisereduce + webrtcvad
- Transcription with faster-whisper
- GUI with PySide6
- Web server with FastAPI + WebSocket
- Configuration system with YAML

Build System:
- Standard builds (CPU-only): build.sh / build.bat
- CUDA builds (universal): build-cuda.sh / build-cuda.bat
- Comprehensive BUILD.md documentation
- Cross-platform support (Linux, Windows)

Documentation:
- README.md with project overview and quick start
- BUILD.md with detailed build instructions
- NEXT_STEPS.md with future enhancement roadmap
- INSTALL.md with setup instructions

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
2025-12-25 18:48:23 -08:00
commit 472233aec4
31 changed files with 5116 additions and 0 deletions

56
build-cuda.bat Normal file
View File

@@ -0,0 +1,56 @@
@echo off
REM Build script for Windows with CUDA support
echo Building Local Transcription with CUDA support...
echo ==================================================
echo.
echo This will create a build that supports both CPU and CUDA GPUs.
echo The executable will be larger (~2-3GB) but will work on any system.
echo.
set /p INSTALL_CUDA="Install PyTorch with CUDA support? (y/n) "
if /i "%INSTALL_CUDA%"=="y" (
echo Installing PyTorch with CUDA 12.1 support...
REM Uninstall CPU-only version if present
uv pip uninstall -y torch
REM Install CUDA-enabled PyTorch
REM This installs PyTorch with bundled CUDA runtime
uv pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
echo CUDA-enabled PyTorch installed
echo.
)
REM Clean previous builds
echo Cleaning previous builds...
if exist build rmdir /s /q build
if exist dist rmdir /s /q dist
REM Build with PyInstaller
echo Running PyInstaller...
uv run pyinstaller local-transcription.spec
REM Check if build succeeded
if exist "dist\LocalTranscription" (
echo.
echo Build successful!
echo Executable location: dist\LocalTranscription\LocalTranscription.exe
echo.
echo CUDA Support: YES (falls back to CPU if CUDA not available^)
echo.
echo To run the application:
echo cd dist\LocalTranscription
echo LocalTranscription.exe
echo.
echo To create a distributable package:
echo - Compress the dist\LocalTranscription folder to a ZIP file
echo - Name it: LocalTranscription-Windows-CUDA.zip
echo.
echo Note: This build will work on systems with or without NVIDIA GPUs.
) else (
echo.
echo Build failed!
exit /b 1
)