Sign In to Download.

Dataset Overview

Dataset Type

speech corpus
for TTS

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences, fables, and stories
44.1 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Philips K38003

Recording Environment

quiet indoor
environment

Category

TTS Corpus

Mandarin Chinese Speech Corpus for TTS – Children Speech

224 utterances of annotated female voices in Mandarin Chinese
applicable for Text-to-Speech Synthesis

Dataset Overview

Dataset Type

speech corpus
for TTS

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences, fables, and stories
44.1 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Philips K38003

Recording Environment

quiet indoor
environment

This open-source dataset consists of 15 minutes of annotated female voices in Mandarin Chinese that is applicable for Text-to-Speech Synthesis, where 224 utterances collected from a five-year-old girl were contained.

Sample:

小刺猬#1向#1妈妈#1敬礼#4。
xiao3 ci4 wei5 xiang4 ma1 ma5 jing4 li3
小刺猬/n 向/p 妈妈/n 敬礼/v

The dataset is provided on an “As Is” basis, and no warranty, either expressed or implied, is given. Your use of the dataset is at your sole risk. You expressly understand and agree that MagicHub and/or Beijing Magic Data Technology Co., Ltd. shall not be liable for any direct, indirect, incidental, special or consequential damages; including but not limited to, damages for loss of profits, goodwill, use, data or other intangible losses related to the datasets.

Copyright © 2021 Beijing Magic Data Technology Co., Ltd. All rights reserved.

Similar datasets are available! Please feel free to CONTACT US if you have any questions or data requirements.

Reviews

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}