Generate speech from text using a reference voice
Transcribe audio files to text instantly
Generate detailed prompts from any image