Phase 3: Speaker diarization and full transcription pipeline

- Implement DiarizeService with pyannote.audio speaker detection
- Build PipelineService combining transcribe → diarize → merge with
  overlap-based speaker assignment per segment
- Add pipeline.start and diarize.start IPC handlers
- Add run_pipeline Tauri command for full pipeline execution
- Wire frontend to use pipeline: speakers auto-created with colors,
  segments assigned to detected speakers
- Build SpeakerManager with rename support (double-click or edit button)
- Add speaker color coding throughout transcript display
- Add pyannote.audio dependency
- Tests: 24 Python (including merge logic), 6 Rust, 0 Svelte errors

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

This commit is contained in:

Josh Knapp

2026-02-26 16:09:48 -08:00

parent 842f8d5f90

commit 44480906a4

12 changed files with 806 additions and 24 deletions

									
										1

python/pyproject.toml
									
												View File
												
				@@ -11,6 +11,7 @@ license = "MIT"

				dependencies = [

				    "faster-whisper>=1.1.0",

				    "pyannote.audio>=3.1.0",

				]

				[project.optional-dependencies]

Phase 3: Speaker diarization and full transcription pipeline

1 python/pyproject.toml Unescape Escape View File

1

python/pyproject.toml

View File