Open Dataset
Data Structure ?
699.36M
Data Structure ?
*The above analysis is the result extracted and analyzed by the system, and the specific actual data shall prevail.
README.md
# 背景
人類創作音樂已有數萬年的歷史[1]。如今,創作音樂比以往任何時候都更加簡單和便捷。過去幾十年的技術發展讓人們能夠在電腦上模擬演奏幾乎所有能想象到的樂器。音頻序列器使用戶能夠逐樣本地在時間軸上排列他們的歌曲。數字音頻工作站(DAW)隨附虛擬樂器和合成器,使用戶能夠在自己的臥室裡虛擬演奏整個樂隊或管弦樂團。
使用DAW時的一個挑戰是以結構化的方式組織樣本和錄音,以便用戶能夠輕鬆訪問它們。除了自己的錄音外,許多用戶還會下載樣本。瀏覽樣本集以找到完美的聲音非常耗時,而且可能會阻礙用戶的創作流程[2]。此外,手動命名和標記錄音是一項耗時且乏味的任務,因此沒有太多用戶會這樣做[3]。結果是,在恰當的時刻找到合適的聲音成為了一個具有挑戰性的問題[4]。
對聲音的聲學內容和語義描述之間的關係進行建模,可以讓用戶使用文本查詢來檢索聲音。這個數據集是為了支持基於內容的音頻檢索系統的研究而收集的,重點是創作情境中使用的聲音。
# 內容
這個數據集於2016年6月從[Freesound](http://freesound.org) [5]收集而來。它包含了約23萬個聲音的基於幀的MFCC(Mel頻率倒譜係數)以及相關的標籤。
- `sounds.json`:最初從Freesound API下載的聲音元數據。這個文件包含每個聲音的`id`、相關的`標籤`、`預覽`鏈接以及包含基於幀的低層特徵的`analysis_frames`文件的鏈接。
- `preprocessed_tags.csv`:預處理過的標籤。只包含與至少0.01%的聲音相關聯的標籤。此外,標籤按連字符拆分並進行了詞幹提取。包含數字的標籤和少於三個字符的短標籤被移除。
- `queries.csv`:2016年5月11日至11月24日期間收集的針對Freesound數據庫的真實用戶查詢的聚合查詢日誌。
- `preprocessed_queries.csv`:查詢以與標籤相同的方式進行了預處理。
- `*_mfccs.csv.bz2`:每個聲音的原始MFCC,從`sounds.json`的`analysis_frames`字段中提供的URL提取,分為十個文件。
- `cb_{512|1024|2048|4096}_sparse.pkl`:聲音的碼本表示,保存為稀疏的`pd.DataFrame`。每個聲音的13個MFCC的一階和二階導數被附加到MFCC特徵向量上。使用K-Means(Mini-Batch K-Means)對所有幀進行聚類,以找到{512|1024|2048|4096}個聚類中心。然後,每個幀被分配到最接近的聚類中心,並使用計數將一個聲音表示為一個單一的{512|1024|2048|4096}維向量。
# 致謝
感謝巴塞羅那*龐培法布拉大學*的音樂技術小組創建和維護了Freesound [5]數據庫,並提供了聚合查詢日誌。
# 啟發
誰能創建出在*k*取值為{1, ..., 20}時以*k*精度和平均精度均值衡量的最佳基於內容的音頻檢索系統呢?
# 入門指南
以下是配套的GitHub存儲庫:https://github.com/dschwertfeger/cbar
# 參考文獻
[1] N. L. WallinとB. Merker,《The Origins of Music》。MIT出版社,2001年。
[2] M. Csikszentmihalyi,《Flow: The Psychology of Optimal Experience》。紐約:哈珀久負現代經典叢書,2008年。
[3] E. Pampalk、A. RauberおよびD. Merkl,「Content-based organization and visualization of music archives」
×
The dataset is currently being organized and other channels have been prepared for you. Please use them
The dataset is currently being organized and other channels have been prepared for you. Please use them
Note: Some data is currently being processed and cannot be directly downloaded. We kindly ask for your understanding and support.
No content available at the moment
No content available at the moment
- Share your thoughts
Go share your ideas~~
ALL
Welcome to exchange and share
Your sharing can help others better utilize data.
Data usage instructions: h1>
I. Data Source and Display Explanation:
- 1. The data originates from internet data collection or provided by service providers, and this platform offers users the ability to view and browse datasets.
- 2. This platform serves only as a basic information display for datasets, including but not limited to image, text, video, and audio file types.
- 3. Basic dataset information comes from the original data source or the information provided by the data provider. If there are discrepancies in the dataset description, please refer to the original data source or service provider's address.
II. Ownership Explanation:
- 1. All datasets on this site are copyrighted by their original publishers or data providers.
III. Data Reposting Explanation:
- 1. If you need to repost data from this site, please retain the original data source URL and related copyright notices.
IV. Infringement and Handling Explanation:
- 1. If any data on this site involves infringement, please contact us promptly, and we will arrange for the data to be taken offline.
- 1. The data originates from internet data collection or provided by service providers, and this platform offers users the ability to view and browse datasets.
- 2. This platform serves only as a basic information display for datasets, including but not limited to image, text, video, and audio file types.
- 3. Basic dataset information comes from the original data source or the information provided by the data provider. If there are discrepancies in the dataset description, please refer to the original data source or service provider's address.
- 1. All datasets on this site are copyrighted by their original publishers or data providers.
- 1. If you need to repost data from this site, please retain the original data source URL and related copyright notices.
- 1. If any data on this site involves infringement, please contact us promptly, and we will arrange for the data to be taken offline.