Docs
Roadmap
Roadmap
We disclose part of our roadmap in this section. We have a good-faith intent to deliver on this roadmap but this intent does not imply a contractual commitment.
⏳ indicates active development with a less than 3 month expected release date.
| Open-source SDK | Pro SDK | Pro is planned to be | |
|---|---|---|---|
| WhisperKit Features | |||
| File Transcription | |||
| Language Detection | |||
| Word Timestamps | |||
| Custom Vocabulary | ⏳ | Multilingual | |
| SRT & VTT Output Formats | |||
| Real-time Transcription | |||
| Fast Model Load | |||
| SpeakerKit Features | |||
| Voice Activity Detection | |||
| Speaker Diarization | |||
| RTTM Output Format | |||
| Diarized Transcription | |||
| Speaker identification | |||
| Real-time Diarization | ⏳ |
We are also working on on-device inference kits for language, text-to-speech and OCR models.