英语数据集

Chinese-English Parallel Corpus - Finance
Category : NLP 语料库
Datasets Source : MagicData
Language : zh & en,
Chinese and English
Content : Chinese-English
parallel corpus
on finance-related
daily use sentences
Tags : Mandarin Chinese, English
Size : 8 KB
File Format : TXT (UTF8)
License : Magic Data
open-source license
English and Czech telephone converation data from Vystadial
Category : ASR Corpus
Datasets Source : Vystadial
Language : English and Czech
Content : 给定主题的对话
Tags : English, Czech
Size : 4.2G
File Format : WAV (PCM) TXT (UTF8)
License : Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0 US)
English Customer Service Scenario Text Corpus - Healthcare
Category : NLP 语料库
Datasets Source : MagicData
Language : en, English
Content : dialogical texts on
healthcare-related
customer service
Tags : 英语
Size : 29 KB
File Format : TXT (UTF8)
License : Magic Data
open-source license
Chinese English Scripted Speech Corpus - Children
Category : ASR Corpus
Datasets Source : MagicData
Language : en-CN,
English (China)
Content : words, phrases, and daily use sentences
Tags : 英语
Size : 151 MB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license
English Conversational Speech Corpus - Telephony
Category : ASR Corpus
Datasets Source : MagicData
Language : en,
英语
Content : conversations
Tags : 英语
Size : 214 MB
File Format : WAV (PCM)
TXT (UTF8)
License : Magic Data
open-source license