Open Dataset

AISHELL - 1オープンソース中国語音声データベース

14.5G

852 hits

0 likes

2 downloads

0 discuss

Music Analysis Audio

Introduction
Data file
Related papers
Code
Discuss(0)
Instructions

Data Structure ? 14.5G

*The above analysis is the result extracted and analyzed by the system, and the specific actual data shall prevail.

README.md

ヒルシェル貝ックの中国語北京語オープンソース音声データベースAISHELL - ASR0009 - OS1の録音時間は178時間で、ヒルシェル貝ックの中国語北京語音声データベースAISHELL - ASR0009の一部です。AISHELL - ASR0009の録音テキストは、スマートホーム、無人運転、工業生産など11の分野に関わっています。録音は静かな室内環境で行われ、同時に3種類の異なる機器を使用しました。高忠実度マイク（44.1kHz、16ビット）；Androidシステムの携帯電話（16kHz、16ビット）；iOSシステムの携帯電話（16kHz、16ビット）。

高忠実度マイクで録音された音声は16kHzにダウンサンプリングされ、AISHELL - ASR0009 - OS1の制作に使用されます。中国のさまざまなアクセント地域から来た400人の話者が録音に参加しました。専門の音声校正担当者による書き起こしと注釈付けを行い、厳格な品質検査を通過したこのデータベースのテキスト正解率は95％以上です。訓練セット、開発セット、テストセットに分けられています。（学術研究をサポートしますが、許可なく商用利用は禁止されています。）

このオープンソースの北京語音声コーパスAISHELL - ASR0009 - OS1の長さは178時間です。これはAISHELL - ASR0009の一部で、その発話内容はスマートホーム、自動運転、工業生産など11の分野を含んでいます。全ての録音は静かな室内環境で行われ、同時に3種類の異なる機器を使用しました。高忠実度マイク（44.1kHz、16ビット）；Androidシステムの携帯電話（16kHz、16ビット）；iOSシステムの携帯電話（16kHz、16ビット）。高忠実度の音声は16kHzに再サンプリングされ、AISHELL - ASR0009 - OS1を構築するために使用されました。中国のさまざまなアクセント地域から400人の話者が録音に参加するよう招待されました。専門の音声注釈付けと厳格な品質検査を通じて、手動書き起こしの正解率は95％以上です。このコーパスは訓練セット、開発セット、テストセットに分けられています。（このデータベースは学術研究には無料で利用できますが、許可なく商用利用はできません。）

No content available at the moment

Share your thoughts

Go share your ideas~~

ALL

Welcome to exchange and share

Your sharing can help others better utilize data.

Data usage instructions:

I. Data Source and Display Explanation:

1. The data originates from internet data collection or provided by service providers, and this platform offers users the ability to view and browse datasets.

2. This platform serves only as a basic information display for datasets, including but not limited to image, text, video, and audio file types.

3. Basic dataset information comes from the original data source or the information provided by the data provider. If there are discrepancies in the dataset description, please refer to the original data source or service provider's address.

II. Ownership Explanation:

1. All datasets on this site are copyrighted by their original publishers or data providers.

III. Data Reposting Explanation:

1. If you need to repost data from this site, please retain the original data source URL and related copyright notices.

IV. Infringement and Handling Explanation:

1. If any data on this site involves infringement, please contact us promptly, and we will arrange for the data to be taken offline.

Points：

0 Go earn points？

852
2
0
collect
Share

Select Language

AI Technology Community

Today search ranking

month_search_ranking

Dataset Category

Open Dataset

AISHELL - 1オープンソース中国語音声データベース

Data Structure ? 14.5G

Data Structure ?

*The above analysis is the result extracted and analyzed by the system, and the specific actual data shall prevail.

README.md

Similar Data

The dataset is currently being organized and other channels have been prepared for you. Please use them

The dataset is currently being organized and other channels have been prepared for you. Please use them

ALL

I. Data Source and Display Explanation:

II. Ownership Explanation:

III. Data Reposting Explanation:

IV. Infringement and Handling Explanation: