CoreML Speech Models
Collection
Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. β’ 12 items β’ Updated
Real-time speech enhancement model for Apple Silicon. Removes background noise from speech audio.
| Duration | Time | RTF |
|---|---|---|
| 5s | 0.65s | 0.13 |
| 10s | 1.2s | 0.12 |
| 20s | 4.8s | 0.24 |
import SpeechEnhancement
let enhancer = try await SpeechEnhancer.fromPretrained()
let clean = try enhancer.enhance(audio: noisyAudio, sampleRate: 48000)
swift run audio denoise noisy.wav --output clean.wav
DeepFilterNet3.mlpackage β Core ML FP16 model (Neural Engine)auxiliary.npz β ERB filterbank, Vorbis window, normalization states