Japanese SFT/DPO data convert to speech via TTS. And audio caption data generated by Qwen3-Omni. All datasets are available for commercial use.
Ayuto Tsutsumi
Atotti
AI & ML interests
None yet
Recent Activity
liked a model about 9 hours ago
kyutai/ARC4_Encoder_Llama liked a dataset 7 days ago
sbintuitions/voicebench-ja liked a model 28 days ago
ACE-Step/ace-step-v1.5-1d-vae-stable-audio-format