MagicData
SIGN IN

Total Size: 282M

概览

数据集类型

ASR数据集

语种

英语

语音类型

N/A

内容

N/A

音频参数

16 kHz, 16 bits

文件格式

WAV (PCM)

录音设备

手机

录音环境

手机
开源数据集
ASR数据集
5小时

Multi-stream Spontaneous Conversation Training Datasets_English

The Multi-stream conversation dataset developed by MagicData captures each speaker's audio track and labels each speaker separately, thereby preserving the natural occurrences of interruptions, interactions, and other dynamics in conversation. By isolating each speaker's audio, we can provide clearer and more accurate training data, enabling models to more effectively understand and respond to natural conversational exchanges. To facilitate broader understanding and accessibility, we have released a 5-hour sample as part of our open-source initiative: "Multi-stream Spontaneous Conversation Training Datasets_English".

For more commercial datasets, please contact business@magicdatatech.com.

概览

数据集类型

ASR数据集

语种

英语

语音类型

N/A

内容

N/A

音频参数

16 kHz, 16 bits

文件格式

WAV (PCM)

录音设备

手机

录音环境

手机
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}评论
写评论
*访客无法进行评论

Verifying Email