Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions diarization/getting-started.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,8 @@ description: Speaker diarization — identify who spoke when in audio.

## Quick Start

Pyannote 3.1 online/streaming pipeline:

```swift
import FluidAudio

Expand Down
2 changes: 1 addition & 1 deletion diarization/streaming.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ description: Real-time speaker diarization for live audio streams.

## Overview

Process audio in chunks for real-time speaker labeling. Use this when you need speaker labels while transcription is happening. For most use cases, the [offline pipeline](/diarization/offline-pipeline) is more accurate.
Pyannote 3.1 pipeline for online/streaming diarization. Process audio in chunks for real-time speaker labeling. Use this when you need speaker labels while transcription is happening. For most use cases, the [offline pipeline](/diarization/offline-pipeline) is more accurate.

## Quick Start

Expand Down
2 changes: 1 addition & 1 deletion mobius/getting-started.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -54,7 +54,7 @@ These models have been converted and published to [Hugging Face](https://hugging
| **STT** | Parakeet TDT v2 0.6B | [NVIDIA](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2) | [FluidInference](https://huggingface.co/FluidInference/parakeet-tdt-0.6b-v2-coreml) |
| **STT** | Parakeet EOU 120M | [NVIDIA](https://huggingface.co/nvidia/parakeet-tdt_ctc-110m) | [FluidInference](https://huggingface.co/FluidInference/parakeet-eou-120m-coreml) |
| **VAD** | Silero VAD v6 | [Silero](https://github.com/snakers4/silero-vad) | [FluidInference](https://huggingface.co/FluidInference/silero-vad-coreml) |
| **Diarization** | Pyannote Community 1 | [Pyannote](https://huggingface.co/pyannote/speaker-diarization-community-1) | [FluidInference](https://huggingface.co/FluidInference/speaker-diarization-coreml) |
| **Diarization** | Pyannote 3.1 (online) + Community-1 (offline) | [Pyannote 3.1](https://huggingface.co/pyannote/speaker-diarization-3.1) / [Community-1](https://huggingface.co/pyannote/speaker-diarization-community-1) | [FluidInference](https://huggingface.co/FluidInference/speaker-diarization-coreml) |
| **TTS** | Kokoro 82M | [Hexgrad](https://huggingface.co/hexgrad/Kokoro-82M) | [FluidInference](https://huggingface.co/FluidInference/kokoro-82m-coreml) |
| **TTS** | PocketTTS 155M | [Kyutai](https://huggingface.co/kyutai/pocket-tts) | [FluidInference](https://huggingface.co/FluidInference/pocket-tts-coreml) |
| **Embedding** | CAM++ | [3D-Speaker](https://github.com/alibaba-damo-academy/3D-Speaker) | [FluidInference](https://huggingface.co/FluidInference/cam-plusplus-coreml) |
Expand Down