登录 to Download.

概览

数据集类型

语音识别(ASR)音频数据集

语种

zh-CN, Mandarin Chinese (China)

语音类型

scripted monologue

内容

commands and queries
in vehicle-related scenes
44.1 kHz, 16 bits, dual

文件格式

WAV (PCM)
TXT (UTF8)

录音设备

microphone

录音环境

in-vehicle environment

类别

ASR Corpus

中文普通话朗读音频数据集-车载场景

总时长为6.13小时的中文普通话朗读语音音频和转写文本,
语料内容主要为车载场景下的命令控制语句

概览

数据集类型

语音识别(ASR)音频数据集

语种

zh-CN, Mandarin Chinese (China)

语音类型

scripted monologue

内容

commands and queries
in vehicle-related scenes
44.1 kHz, 16 bits, dual

文件格式

WAV (PCM)
TXT (UTF8)

录音设备

microphone

录音环境

in-vehicle environment

This open-source dataset consists of 6.13 hours of transcribed Mandarin Chinese scripted speech focusing on commands and queries in vehicle-related scenes, where 5,948 utterances contributed by ten speakers were contained.

A noteworthy feature is that two microphones were set up while recording—one at the sun visor, another near the speaker’s mouth, on a front passenger seat. Synchronous dual voices, consequently, were recorded.

Sample:

“去珠江发展中心的最快路线”

该数据集是以“现状”为基础提供的,并不提供任何明示或暗示的保证。 您将独自承担使用该数据集的风险。 您明确理解并同意,MagicHub和/或北京爱数智慧科技有限公司不承担任何直接、间接、偶然、特殊或间接的损害赔偿责任,包括但不限于利润损失、商誉损失、使用损失、数据损失或其他与该数据集有关的无形损失赔偿责任。

Copyright © 2021 北京爱数智慧科技有限公司 版权所有

我们还有更多同类可用数据集。如果您有任何问题或数据需求,请随时与联系我们

评论

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}