Total Size: 1914.71M

Dataset Overview

Dataset Type

Conversation

Language

Japanese

Speech Style

Conversational

Content

N/A

Audio Parameters

16 kHz, 16 bits

File Format

WAV

Recording Equipment

mobile

Recording Environment

indoor

License

MAGIC DATA OPEN-SOURCE LICENSE

Open Source

Conversation

10.35 hours

Japanese Duplex Conversation Training Dataset

This dataset focuses on processing Japanese conversational speech in real-world settings. Designed in a conversation-based style, it captures the interactive and complex nature of everyday communication, thereby enhancing model performance in authentic conversational environments. Recordings were made using mobile devices, a choice that closely mirrors actual usage scenarios and highlights the dataset’s practical relevance. With a total duration of 10 hours, the dataset offers a diverse and realistic collection of conversational speech samples.

Sample：

Two-speaker conversation with separate tracks:

The dataset is not for commercial use. The open-source dataset may be used for academic research and must be properly cited with the source.

Citation Format：Japanese Duplex Conversation Training Dataset. 2025. https://magichub.com/datasets/japanese-duplex-conversation-training-dataset/. Beijing Magic Data Technology Co., Ltd.

For more commercial datasets, please contact business@magicdatatech.com.

Recording Environment

备案号: 京ICP备18008050号-6号

京公网安备 11010802035822号

Your IP is: 216.73.216.82

SIGN IN

SIGN UP

Total Size: 1914.71M

Dataset Overview

Dataset Type

Language

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

MAGIC DATA OPEN-SOURCE LICENSE

Japanese Duplex Conversation Training Dataset

Dataset Overview

Dataset Type

Language

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

MAGIC DATA OPEN-SOURCE LICENSE

京公网安备 11010802035822号

SIGN IN

SIGN UP

Total Size: 1914.71M

Dataset Overview

Dataset Type

Language

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

MAGIC DATA OPEN-SOURCE LICENSE

Japanese Duplex Conversation Training Dataset

Dataset Overview

Dataset Type

Language

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

MAGIC DATA OPEN-SOURCE LICENSE

京公网安备 11010802035822号

Verifying Email