Running natively via ONNX. No cloud. No API calls. Complete privacy.
Low-latency, real-time synthesis across desktop, browser, mobile, and edge. Fast enough to turn an entire webpage into audio in under a second.
Synthesize directly from text across 31 languages, or pass lang="na" for language-agnostic processing.
99M-parameter open-weight model - a fraction of the size of larger TTS systems for faster downloads and lower memory footprint.
Runs locally on desktop, mobile, browsers, and resource-constrained hardware like Raspberry Pi or e-readers with zero network dependency.
Outputs studio-grade 44.1kHz 16-bit WAV directly, ready for production playback without any external upsampler.
10 inline tags (e.g. <laugh>, <breath>, <sigh>) bring natural human nuance into generated speech.
Evaluated on the Minimax-MLS-test benchmark, Supertonic 3 stays within a competitive WER/CER range against much larger open TTS models while preserving a lightweight on-device deployment path.
Supertonic 3 runs fast on CPU, even compared with larger baselines measured on A100 GPU, and uses substantially less memory.
At about 99M parameters across the public ONNX assets, Supertonic 3 is much smaller than 0.7B to 2B class open TTS systems.
Not sure which language your text is in? Pass lang="na" and Supertonic will handle it automatically.
On the first run, Supertonic downloads the model assets automatically.
pip install supertonic
from supertonic import TTS
# First run downloads the model automatically
tts = TTS(auto_download=True)
style = tts.get_voice_style(voice_name="M1")
text = "Supertonic is a lightning fast, on-device TTS system."
wav, duration = tts.synthesize(
text=text,
lang="en", # Language code
voice_style=style, # Voice style
total_steps=8, # Quality: 5-12
speed=1.05, # Speed: 0.7-2.0
)
tts.save_audio(wav, "output.wav")
print(f"Generated {duration[0]:.2f}s of audio")
Run Supertonic as a local HTTP service for integration with other tools.
pip install 'supertonic[serve]'
supertonic serve --host 127.0.0.1 --port 7788
POST /v1/tts (Native)POST /v1/audio/speech (OpenAI compatible)GET /docs (API Documentation)Real-time text-to-speech on Raspberry Pi, demonstrating on-device performance.
Watch DemoExperience Supertonic on an Onyx Boox e-reader in airplane mode with zero network dependency.
Watch DemoTurn any webpage into audio in under one second with complete privacy.
Install ExtensionTry Supertonic directly in your browser with our interactive demo. Experience real-time synthesis with different languages and voice styles.
Ready-to-use examples through ONNX Runtime across multiple platforms.
Additional SDKs available for Rust, iOS, and Flutter platforms.
Memory-safe implementation
Native iOS apps
Cross-platform apps
Turn your voice into a deployable, edge-native TTS with permanent ownership. Create custom voice profiles for both Supertonic 2 and Supertonic 3.
Need more voices or enterprise features? Check out our commercial offerings:
Join thousands of developers who trust Supertonic for their on-device text-to-speech needs.