Total Size: 104 MB

Sign In to Download.

Dataset Overview

Dataset Type

speech corpus for TTS

Language

cmn-Tianjin, Jin Chinese (Tianjin, China)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

48 kHz, 24 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

Nuemann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

Popular Datasets

Open Source
TTS Corpus

Tianjin Dialect Speech Corpus for TTS

200 sentences of annotated male voices in Tianjin dialect applicable for Text-to-Speech Synthesis

Dataset Overview

Dataset Type

speech corpus for TTS

Language

cmn-Tianjin, Jin Chinese (Tianjin, China)

Speech Style

scripted monologue

Content

daily use sentences
48 kHz, 24 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

Nuemann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

This open-source dataset consists of 200 sentences of annotated male voices in Tianjin dialect that is applicable for Text-to-Speech Synthesis.

Sample:

能陪产就陪产,老婆生孩子太不容易了。
neng2 pei2 can3 jiu4 pei2 can3
lao3 po5 seng1 hai2 zii5 tai4 bu4 yong2 yi4 le5

The dataset is provided on an “As Is” basis, and no warranty, either expressed or implied, is given. Your use of the dataset is at your sole risk. You expressly understand and agree that MagicHub and/or Beijing Magic Data Technology Co., Ltd. shall not be liable for any direct, indirect, incidental, special or consequential damages; including but not limited to, damages for loss of profits, goodwill, use, data or other intangible losses related to the datasets.

Copyright © 2021 Beijing Magic Data Technology Co., Ltd. All rights reserved.

Similar datasets are available! Please feel free to CONTACT US if you have any questions or data requirements.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}