TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation

Gihyun Kwon, Jong Chul Ye

Vision & Animation ICLR 2025

TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation

DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors

Keon Lee, Dong Won Kim, Jaehyeon Kim, Seungjun Chung, Jaewoong Cho

Voice Synthesis ICLR 2025

DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors

CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech

Jaehyeon Kim, Keon Lee, Seungjun Chung, Jaewoong Cho

Voice Synthesis ICLR 2024

CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech

DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

Keon Lee, Kyumin Park, Daeyoung Kim

Voice Synthesis ICASSP 2023

DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech

Kyumin Park, Keon Lee, Daeyoung Kim, Dongyeop Kang

Voice Synthesis Arxiv 2022

RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech