字幕翻译

German Scripted Speech Corpus - Command and Query
Category : ASR Corpus
Datasets Source : MagicData
Language : de-DE,
German (Germany)
Content : commands and queries
Tags : German
Size : 62 MB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license
Zhengzhou Dialect Scripted Speech Corpus - Daily Use Sentence
Category : ASR Corpus
Datasets Source : MagicData
Language : cmn-Zhengzhou,
Mandarin Chinese (Zhengzhou, China)
Content : daily use sentences
Tags : 中文方言
Size : 437 MB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license
Zhengzhou Dialect Conversational Speech Corpus
Category : ASR Corpus
Datasets Source : MagicData
Language : cmn-Zhengzhou,
Mandarin Chinese (Zhengzhou, China)
Content : 给定主题的对话
Tags : 中文方言
Size : 308 MB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license
Mandarin Chinese Conversational Speech Corpus - Web Meeting
Category : ASR Corpus
Datasets Source : MagicData
Language : zh-CN,
Mandarin Chinese (China)
Content : conversations
(web meetings)
Tags : 中文普通话
Size : 202 MB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license
English and Czech telephone converation data from Vystadial
Category : ASR Corpus
Datasets Source : Vystadial
Language : English and Czech
Content : 给定主题的对话
Tags : English, Czech
Size : 4.2G
File Format : WAV (PCM) TXT (UTF8)
License : Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0 US)
Chinese Mandarin Speech Corpus from Aishell
Category : ASR Corpus
Datasets Source : Aishell
Language : Mandarin Chinese (China)
Content : daily use sentences
Tags : 中文普通话
Size : 15GB
File Format : WAV (PCM) TXT (UTF8)
License : Apache License v.2.0
马来语对话音频数据集
Category : ASR Corpus
Datasets Source : MagicData
Language : ms-MY, Malay (Malaysia)
Content : 给定主题的对话
Tags : 马来西亚语
Size : 429 MB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license
印尼语对话音频数据集
Category : ASR Corpus
Datasets Source : MagicData
Language : id-ID,印尼语(印度尼西亚)
Content : 给定主题的对话
Tags : indonesian
Size : 322 MB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license
Mandarin Chinese Scripted Speech Corpus - Daily Use Sentence / Command and Query / SMS
Category : ASR Corpus
Datasets Source : MagicData
Language : zh-CN, Mandarin Chinese (China)
Content : daily use sentences,
commands and queries,
SMS
Tags : 中文普通话
Size : 59 GB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license
Mandarin Chinese Scripted Speech Corpus - in-Vehicle Scene
Category : ASR Corpus
Datasets Source : MagicData
Language : zh-CN, Mandarin Chinese (China)
Content : commands and queries
in vehicle-related scenes
Tags : 中文普通话
Size : 3.09 GB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license