Files
awesome-awesomeness/terminal/datasets
2024-04-23 15:17:38 +02:00

88 lines
7.7 KiB
Plaintext
Raw Blame History

This file contains invisible Unicode characters
This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
awesome-datasets
================
A curated list of awesome datasets for papers/experiments/validation.
- Awesome Datasets (#awesome-datasets)
- **Classification** (#classification) 
- **Semi-Supervised** (#semi-supervised) 
- **Regression** (#regression) 
- **Time-Series** (#time-series) 
- **Unsupervised (clustering)** (#unsupervised) 
- **Face Recognition** (#face-recognition) 
- **Image Processing** (#image-processing) 
- **Handwriting Recognition** (#handwriting-recognition)
- **Text Classification** (#text-classification) 
Classification
Datasets for classification.
⟡ KEEL - General (http://sci2s.ugr.es/keel/category.php?cat=clas) - General classification datasets.
⟡ KEEL - Missing-values (http://sci2s.ugr.es/keel/missing.php) - Missing values datasets.
⟡ KEEL - Imbalanced datasets (http://sci2s.ugr.es/keel/imbalanced.php) - Imbalanced datasets for classification.
⟡ KEEL - Multi-label (http://sci2s.ugr.es/keel/multilabel.php) - Multi-label datasets.
⟡ KEEL - Class noise (http://sci2s.ugr.es/keel/classNoise.php) - Datasets with class noise.
⟡ KEEL - Attribute noise (http://sci2s.ugr.es/keel/attributeNoise.php) - Datasets with attribute noise.
Semi-Supervised
Datasets for semi-supervised applications.
⟡ KEEL - semi-supervised (http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.
⟡ KEEL - semi-supervised (http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.
Regression
Datasets for regression applications.
⟡ KEEL - regression (http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for regression experiments.
Time series
Datasets for time-series problems.
⟡ KEEL - time-series (http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for time-series experiments.
Face Recognition
Face Recognition datasets.
⟡ JAFFE (http://kasrl.org/jaffe.html) - The Japanese Female Facial Expression (JAFFE) Database.
⟡ Carnegie Mellon (http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-8/faceimages/) - Datasets from theo-8 projects at Carnegie Mellon University.
⟡ Yale Face Database (http://vision.ucsd.edu/content/yale-face-database) - Datasets for facial expression (happy, sad, angry...) recognition.
⟡ Cohn-Kanade (http://www.pitt.edu/~emotion/ck-spread.htm) - The Cohn-Kanade AU-Coded Facial Expression Database is for research in automatic facial image analysis and synthesis and for perceptual studies.
⟡ AR face Database (http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html) - Different facial expressions, illumination conditions and occlusions.
⟡ Face Detection CBCL (http://cbcl.mit.edu/software-datasets/FaceData2.html) - Face Detection Data from MIT.
⟡ Face Recognition LFW (http://vis-www.cs.umass.edu/lfw/) - Face Recognition from UMASS.
⟡ Face Recognition ORL (http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html) - Face Recognition from AT&T.
Image Processing
Image Processing.
⟡ Microsoft - Salient Object Database (http://research.microsoft.com/en-us/um/people/jiansun/SalientObject/salient_object.htm) - MSRA Salient Object Database.
⟡ IVRG - Salient Object Database (http://ivrgwww.epfl.ch/supplementary_material/RK_CVPR09/) - Frequency-tuned Salient Region Detection.
⟡ ICDAR - Robust Reading (http://dag.cvc.uab.es/icdar2013competition/?com=introduction) - Robust Reading Competition.
⟡ Brodatz - Texture Recognition (http://www.ux.uis.no/~tranden/brodatz.html) - Texture Recognition.
⟡ Vistex - Texture Recognition (http://vismod.media.mit.edu/vismod/imagery/VisionTexture/vistex.html) - Texture Recognition.
⟡ Caltech - Object Categorization (http://www.vision.caltech.edu/Image_Datasets/Caltech101/) - Object Categorization from Caltech101.
⟡ Marcel - Gesture Recognition (http://www.idiap.ch/resource/gestures/) - Gesture Recognition from Marcel.
⟡ RPPDI - Gesture Recognition (http://rppdi.ecomp.poli.br/gesture/database/) - Gesture Recognition from RPPDI.
Handwriting Recognition
Handwriting Recognition
⟡ MNIST - Database of Handwritten Digits (http://yann.lecun.com/exdb/mnist/) - THE MNIST DATABASE of handwritten digits.
Text Classification
Text Classification
⟡ 20 Newsgroups (http://qwone.com/~jason/20Newsgroups/) - The 20 newsgroups text dataset.
⟡ Reuters-21578 (https://archive.ics.uci.edu/ml/datasets/Reuters-21578+Text+Categorization+Collection) - Reuters-21578 Text Categorization Collection Data Set