awesome-datasets ================ A curated list of awesome datasets for papers/experiments/validation. - [Awesome Datasets](#awesome-datasets) - [Classification](#classification) - [Semi-Supervised](#semi-supervised) - [Regression](#regression) - [Time-Series](#time-series) - [Unsupervised (clustering)](#unsupervised) - [Face Recognition](#face-recognition) - [Image Processing](#image-processing) - [Handwriting Recognition](#handwriting-recognition) - [Text Classification](#text-classification) ## Classification *Datasets for classification.* * [KEEL - General](http://sci2s.ugr.es/keel/category.php?cat=clas) - General classification datasets. * [KEEL - Missing-values](http://sci2s.ugr.es/keel/missing.php) - Missing values datasets. * [KEEL - Imbalanced datasets](http://sci2s.ugr.es/keel/imbalanced.php) - Imbalanced datasets for classification. * [KEEL - Multi-label](http://sci2s.ugr.es/keel/multilabel.php) - Multi-label datasets. * [KEEL - Class noise](http://sci2s.ugr.es/keel/classNoise.php) - Datasets with class noise. * [KEEL - Attribute noise](http://sci2s.ugr.es/keel/attributeNoise.php) - Datasets with attribute noise. ## Semi-Supervised *Datasets for semi-supervised applications.* * [KEEL - semi-supervised](http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments. * [KEEL - semi-supervised](http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments. ## Regression *Datasets for regression applications.* * [KEEL - regression](http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for regression experiments. ## Time series *Datasets for time-series problems.* * [KEEL - time-series](http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for time-series experiments. ## Face Recognition *Face Recognition datasets.* * [JAFFE](http://kasrl.org/jaffe.html) - The Japanese Female Facial Expression (JAFFE) Database. * [Carnegie Mellon](http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-8/faceimages/) - Datasets from theo-8 projects at Carnegie Mellon University. * [Yale Face Database](http://vision.ucsd.edu/content/yale-face-database) - Datasets for facial expression (happy, sad, angry...) recognition. * [Cohn-Kanade](http://www.pitt.edu/~emotion/ck-spread.htm) - The Cohn-Kanade AU-Coded Facial Expression Database is for research in automatic facial image analysis and synthesis and for perceptual studies. * [AR face Database](http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html) - Different facial expressions, illumination conditions and occlusions. * [Face Detection CBCL](http://cbcl.mit.edu/software-datasets/FaceData2.html) - Face Detection Data from MIT. * [Face Recognition LFW](http://vis-www.cs.umass.edu/lfw/) - Face Recognition from UMASS. * [Face Recognition ORL](http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html) - Face Recognition from AT&T. ## Image Processing *Image Processing.* * [Microsoft - Salient Object Database](http://research.microsoft.com/en-us/um/people/jiansun/SalientObject/salient_object.htm) - MSRA Salient Object Database. * [IVRG - Salient Object Database](http://ivrgwww.epfl.ch/supplementary_material/RK_CVPR09/) - Frequency-tuned Salient Region Detection. * [ICDAR - Robust Reading](http://dag.cvc.uab.es/icdar2013competition/?com=introduction) - Robust Reading Competition. * [Brodatz - Texture Recognition](http://www.ux.uis.no/~tranden/brodatz.html) - Texture Recognition. * [Vistex - Texture Recognition](http://vismod.media.mit.edu/vismod/imagery/VisionTexture/vistex.html) - Texture Recognition. * [Caltech - Object Categorization](http://www.vision.caltech.edu/Image_Datasets/Caltech101/) - Object Categorization from Caltech101. * [Marcel - Gesture Recognition](http://www.idiap.ch/resource/gestures/) - Gesture Recognition from Marcel. * [RPPDI - Gesture Recognition](http://rppdi.ecomp.poli.br/gesture/database/) - Gesture Recognition from RPPDI. ## Handwriting Recognition *Handwriting Recognition* * [MNIST - Database of Handwritten Digits](http://yann.lecun.com/exdb/mnist/) - THE MNIST DATABASE of handwritten digits. ## Text Classification *Text Classification* * [20 Newsgroups](http://qwone.com/~jason/20Newsgroups/) - The 20 newsgroups text dataset. * [Reuters-21578](https://archive.ics.uci.edu/ml/datasets/Reuters-21578+Text+Categorization+Collection) - Reuters-21578 Text Categorization Collection Data Set