Select Language

Open Dataset

複数のテキスト分類データセット

複数のテキスト分類データセット

692 hits
0 likes
3 downloads
0 discuss
MNIST Classification

Data Structure ? 0M

    Data Structure ?

    *The above analysis is the result extracted and analyzed by the system, and the specific actual data shall prevail.

    README.md

    テキスト分類データセット:テキスト分類用のデータセットで、テキスト分類に使用できる8つのサブデータセットを含み、サンプルサイズは12万から360万、問題のレベル範囲は2レベルから14レベルで、データはDBPedia、Amazon、Yelp、Yahoo!、Sogou、AGから収集されています。

    ファイル

    グローバルネットワークにアクセスする必要があります。アクセス先:Google Drive

    8つのファイルが含まれており、ファイル名とサイズはそれぞれ以下の通りです。

    ag_news_csv.tar.gz 11MB
    amazon_review_full_csv.tar.gz 614MB
    amazon_review_polarity_csv.tar.gz 656MB
    DBPedia_csv.tar.gz 65MB
    sogou_news_csv.tar.gz 366MB
    yahoo_answers_csv.tar.gz 187MB
    yelp_review_polarity_csv.tar.gz 159MB

    関連論文

    1. Joachims T. Transductive Inference for Text Classification using Support Vector Machines[C]// Sixteenth International Conference on Machine Learning. Morgan Kaufmann Publishers Inc. 1999:200 - 209. 2. Joulin A, Grave E, Bojanowski P, et al. Bag of Tricks for Efficient Text Classification[J]. 2016:427 - 431. 3. Zhang Y, Wallace B. A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification[J]. Computer Science, 2015. 4. Ji Y L, Dernoncourt F. Sequential Short - Text Classification with Recurrent and Convolutional Neural Networks[J]. 2016:515 - 520. 5. Chen G, Ye D, Xing Z, et al. Ensemble application of convolutional and recurrent neural networks for multi - label text categorization[C]// International Joint Conference on Neural Networks. IEEE, 2017:2377 - 2383.

    ×

    The dataset is currently being organized and other channels have been prepared for you. Please use them

    The dataset is currently being organized and other channels have been prepared for you. Please use them

    Note: Some data is currently being processed and cannot be directly downloaded. We kindly ask for your understanding and support.
    No content available at the moment
    No content available at the moment
    • Share your thoughts
    Go share your ideas~~

    ALL

      Welcome to exchange and share
      Your sharing can help others better utilize data.
    Points:0 Go earn points?
    • 692
    • 3
    • 0
    • collect
    • Share