Compilation of dataset for machine learning models



  • Lists of dates are collected on this issue and the datasets are kept. Request to edit the existing response



    • https://www.kaggle.com/averkij/russian-chinese-parallel-corpora
    • https://archive.ics.uci.edu/ml/index.php
    • https://docs.google.com/spreadsheets/d/1ZSLP1McnXv0FtOd9t7dMp3AfaiusvaGwWV0F9g2pbho/edit#gid=0
    • https://my-cccp.ru/wp-content/uploads/2014/03/opredelenie-nacionalnosti.jpg
    • https://github.com/heartexlabs/awesome-data-labeling
    • https://www.kaggle.com/datasets
    • https://github.com/mathsyouth/awesome-text-summarization
    • https://www.apple.com/covid19/mobility
    • https://github.com/CSSEGISandData/COVID-19
    • https://drive.google.com/open?id=1hZpvoN-pEmpzYzEUYSJufUYSXXfXgEaY
    • https://www.crowdai.org/challenges/mapping-challenge https://drive.google.com/file/d/1EGw9EJLGik3ekAagBr6-SyMq8mcPJlQR/view?usp=drive_open
    • https://drive.google.com/file/d/19fN9R4lrIu2ANT6OajTbDTRIaO4j6Qku/view
    • https://foto.pamyat-naroda.ru/ Data from the Department of Defense website on veterans. It'll contain about a million records.
    • https://russe.nlpub.org/downloads/
    • https://tatianashavrina.github.io/taiga_site/
    • http://tpc.at.ispras.ru/prakticheskoe-zadanie-2015/
    • https://www.kaggle.com/blackmoon/russian-language-toxic-comments
    • https://github.com/X-zhangyang/Real-World-Masked-Face-Dataset
    • https://data.mendeley.com/datasets/8b8ygpt596/2
    • https://covid19faq.ru/l/ru/article/smgcguuguh-hand-collected-coronavirus-data-sources
    • https://www.kaggle.com/tapakah68/impressive-dataset
    • https://lionbridge.ai/datasets/best-portuguese-language-datasets-for-machine-learning/



Suggested Topics

  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2