Voxtral TTS Demo
Generate realistic speech from text with custom or preset voices
Generate realistic speech from text with custom or preset voices
Transcribe audio clips to text in many languages
Run Cohere Transcribe locally in your browser on WebGPU.
Chat with a Victorianโera language model chatbot
Generate audio for a video using a text prompt
World-first embodied AI world model
VFig converts any diagram image into editable SVG code.
Portrait animation & lipsync with LTX 2.3
generate a video from an image with a text prompt
FireRed-Image-Edit ร Qwen-Image-Edit-Rapid (Transformers)
Edit image camera angle with interactive 3D controls
Chat with a multimodal AI using text, images, audio, or video
Generate high-quality images from text prompts
High-quality voice cloning TTS for 600+ languages
text to video, image to video, video extend
Turn any image into a DLSS 5 meme (using FLUX.2-klein-9b-kv)
generate a video from an image with a text prompt
Generate realistic speech from text with custom or preset voices
Image edit, text to image, image upscale, remove watermark
Run Gemma 4 locally in-browser on WebGPU w/ Transformers.js
Demo of the Collection of Qwen Image Edit LoRAs
Generate speech from text with custom voice, cloning, or presets
Chat with a multimodal AI using text, image, audio, or video
Portrait animation & lipsync with LTX 2.3
High-fidelity 3D Generation from images
Chat with a Victorianโera language model chatbot
Run Cohere Transcribe locally in your browser on WebGPU.
Transcribe audio clips to text in many languages
Embedding Leaderboard
Based 'Z-IMAGE TURBO'
Generate high-quality motions from text prompts
Create cinematic videos with audio from text prompts