Open Dataset

500時間の音声録音、講演者の人口統計データを含む

12.06G

550 hits

0 likes

0 downloads

0 discuss

Social Science,Music,Languages Classification

一般情報コモンボイスは、コモンボイスのウェブサイト（http://voice.mozil 上でユーザーが読み上げた音声データのコーパスです。"......

Introduction
Data file
Related papers
Code
Discuss(0)
Instructions

Data Structure ? 12.06G

*The above analysis is the result extracted and analyzed by the system, and the specific actual data shall prevail.

README.md

一般情報

コモンボイスは、コモンボイスのウェブサイト（http://voice.mozilla.org/) でユーザーが読み上げた音声データのコーパスで、ユーザーが投稿したブログ記事、古い書籍、映画、その他の公開音声コーパスなど、多数の公共ドメインのソースからのテキストに基づいています。その主な目的は、自動音声認識（ASR）システムのトレーニングとテストを可能にすることです。

構造

コーパスは、利便性のためにいくつかの部分に分割されています。名前に「valid」が含まれるサブセットは、少なくとも2人の人が聴いた音声クリップで、それらのリスナーの大多数が音声がテキストに一致すると判断したものです。名前に「invalid」が含まれるサブセットは、少なくとも2人のリスナーが聴いたクリップで、大多数が音声がクリップに一致しないと判断したものです。その他のすべてのクリップ、つまり2票未満のもの、または有効票と無効票が同数のものは、名前に「other」が含まれています。

「valid」と「other」のサブセットは、さらに3つのグループに分けられます。

dev - 開発と実験用
train - 音声認識トレーニングで使用するため
test - 単語誤り率をテストするため

組織と規則

各データサブセットには、以下の命名規則に従った対応するCSVファイルがあります。

「cv-{type}-{group}.csv」

ここで、「type」は {valid, invalid, other} のいずれかで、「group」は {dev, train, test} のいずれかです。なお、無効セットはグループに分割されていません。

CSVファイルの各行は、1つの音声クリップを表し、以下の情報を含んでいます。

filename - 音声ファイルの相対パス
text - 音声の想定される文字起こし
up_votes - 音声がテキストに一致すると判断した人の数
down_votes - 音声がテキストに一致しないと判断した人の数
age - 話者が報告した場合の話者の年齢

teens: '＜ 19'
twenties: '19 - 29'
thirties: '30 - 39'
fourties: '40 - 49'
fifties: '50 - 59'
sixties: '60 - 69'
seventies: '70 - 79'
eighties: '80 - 89'
nineties: '＞ 89'

gender - 話者が報告した場合の話者の性別

male
female
other

accent - 話者が報告した場合の話者のアクセント

us: 'アメリカ英語'
australia: 'オーストラリア英語'
england: 'イギリス英語'
canada: 'カナダ英語'
philippines: 'フィリピン語'
hongkong: '香港英語'
indian: 'インドと南アジア（インド、パキスタン、スリランカ）'
ireland: 'アイルランド英語'
malaysia: 'マレーシア英語'
newzealand: 'ニュージーランド英語'
scotland: 'スコットランド英語'
singapore: 'シンガポール英語'
southatlandtic: '南大西洋（フォークランド諸島、セントヘレナ）'
african: '南アフリカ（南アフリカ、ジンバブエ、ナミビア）'
wales: 'ウェールズ英語'
bermuda: '西インド諸島とバミューダ（バハマ、バミューダ、ジャマイカ、トリニダード）'

各サブセットの音声クリップは、対応するCSVファイルと同じ命名規則のフォルダにMP3ファイルとして保存されています。たとえば、有効トレーニングセットのすべての音声データは、「cv-valid-train」フォルダに「cv-valid-train.csv」メタデータファイルとともに保存されます。

謝辞

このデータセットは、マイケル・ヘンレッティ、ティルマン・カンプ、ケリー・デイビスとコモンボイスチームによって編集され、以下の謝辞が含まれています。

コモンボイスのウェブサイトとアプリで音声を提供してくれたすべての人に心から感謝します。あなた方こそがこのプロジェクトの根幹であり、これを可能にしてくれたことに感謝します！

また、Discourse（https://discourse.mozilla-community.org/c/voice）とGithub（https://github.com/mozilla/voice-web) のコミュニティの皆さんにも感謝します。あなた方のおかげで、このプロジェクトはどんどん良くなっています。

そして、マイクロフト、SNIPS.ai、ミシック、Tatoeba.org、バンゴール大学、およびSAPに特別な感謝を申し上げます。この旅に同行してくれてありがとうございます。今後もさらに協力していきたいと思います。

No content available at the moment

Share your thoughts

Go share your ideas~~

ALL

Welcome to exchange and share

Your sharing can help others better utilize data.

Data usage instructions:

I. Data Source and Display Explanation:

1. The data originates from internet data collection or provided by service providers, and this platform offers users the ability to view and browse datasets.

2. This platform serves only as a basic information display for datasets, including but not limited to image, text, video, and audio file types.

3. Basic dataset information comes from the original data source or the information provided by the data provider. If there are discrepancies in the dataset description, please refer to the original data source or service provider's address.

II. Ownership Explanation:

1. All datasets on this site are copyrighted by their original publishers or data providers.

III. Data Reposting Explanation:

1. If you need to repost data from this site, please retain the original data source URL and related copyright notices.

IV. Infringement and Handling Explanation:

1. If any data on this site involves infringement, please contact us promptly, and we will arrange for the data to be taken offline.

Points：

40 Go earn points？

550
0
0
collect
Share

Select Language

AI Technology Community

Today search ranking

month_search_ranking

Dataset Category

Open Dataset

500時間の音声録音、講演者の人口統計データを含む

Data Structure ? 12.06G

Data Structure ?

*The above analysis is the result extracted and analyzed by the system, and the specific actual data shall prevail.

README.md

一般情報

構造

組織と規則

謝辞

Similar Data

The dataset is currently being organized and other channels have been prepared for you. Please use them

The dataset is currently being organized and other channels have been prepared for you. Please use them

ALL

I. Data Source and Display Explanation:

II. Ownership Explanation:

III. Data Reposting Explanation:

IV. Infringement and Handling Explanation: