Files
awesome-awesomeness/terminal/datasets
2024-04-22 21:54:39 +02:00

8.4 KiB

awesome-datasets
================
 
A curated list of awesome datasets for papers/experiments/validation.
 
- Awesome Datasets (#awesome-datasets)
- **Classification** (#classification)
- **Semi-Supervised** (#semi-supervised)
- **Regression** (#regression)
- **Time-Series** (#time-series)
- **Unsupervised (clustering)** (#unsupervised)
- **Face Recognition** (#face-recognition)
- **Image Processing** (#image-processing)
- **Handwriting Recognition** (#handwriting-recognition)
- **Text Classification** (#text-classification)
 
Classification
 
Datasets for classification.
 
KEEL - General (http://sci2s.ugr.es/keel/category.php?cat=clas) - General classification datasets.
KEEL - Missing-values (http://sci2s.ugr.es/keel/missing.php) - Missing values datasets.
KEEL - Imbalanced datasets (http://sci2s.ugr.es/keel/imbalanced.php) - Imbalanced datasets for classification.
KEEL - Multi-label (http://sci2s.ugr.es/keel/multilabel.php) - Multi-label datasets.
KEEL - Class noise (http://sci2s.ugr.es/keel/classNoise.php) - Datasets with class noise.
KEEL - Attribute noise (http://sci2s.ugr.es/keel/attributeNoise.php) - Datasets with attribute noise.
 
Semi-Supervised
 
Datasets for semi-supervised applications.
 
KEEL - semi-supervised (http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.
KEEL - semi-supervised (http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.
 
Regression
 
Datasets for regression applications.
 
KEEL - regression (http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for regression experiments.
 
 
Time series
 
Datasets for time-series problems.
 
KEEL - time-series (http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for time-series experiments.
 
Face Recognition
 
Face Recognition datasets.
 
JAFFE (http://kasrl.org/jaffe.html) - The Japanese Female Facial Expression (JAFFE) Database.
Carnegie Mellon (http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-8/faceimages/) - Datasets from theo-8 projects at Carnegie Mellon University.
Yale Face Database (http://vision.ucsd.edu/content/yale-face-database) - Datasets for facial expression (happy, sad, angry...) recognition.
Cohn-Kanade (http://www.pitt.edu/~emotion/ck-spread.htm) - The Cohn-Kanade AU-Coded Facial Expression Database is for research in automatic facial image analysis and synthesis and for
perceptual studies.
AR face Database (http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html) - Different facial expressions, illumination conditions and occlusions.
Face Detection CBCL (http://cbcl.mit.edu/software-datasets/FaceData2.html) - Face Detection Data from MIT.
Face Recognition LFW (http://vis-www.cs.umass.edu/lfw/) - Face Recognition from UMASS.
Face Recognition ORL (http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html) - Face Recognition from AT&T.
 
 
Image Processing
 
Image Processing.
 
Microsoft - Salient Object Database (http://research.microsoft.com/en-us/um/people/jiansun/SalientObject/salient_object.htm) - MSRA Salient Object Database.
IVRG - Salient Object Database (http://ivrgwww.epfl.ch/supplementary_material/RK_CVPR09/) - Frequency-tuned Salient Region Detection.
ICDAR - Robust Reading (http://dag.cvc.uab.es/icdar2013competition/?com=introduction) - Robust Reading Competition.
Brodatz - Texture Recognition (http://www.ux.uis.no/~tranden/brodatz.html) - Texture Recognition.
Vistex - Texture Recognition (http://vismod.media.mit.edu/vismod/imagery/VisionTexture/vistex.html) - Texture Recognition.
Caltech - Object Categorization (http://www.vision.caltech.edu/Image_Datasets/Caltech101/) - Object Categorization from Caltech101.
Marcel - Gesture Recognition (http://www.idiap.ch/resource/gestures/) - Gesture Recognition from Marcel.
RPPDI - Gesture Recognition (http://rppdi.ecomp.poli.br/gesture/database/) - Gesture Recognition from RPPDI.
 
 
Handwriting Recognition
 
Handwriting Recognition
 
MNIST - Database of Handwritten Digits (http://yann.lecun.com/exdb/mnist/) - THE MNIST DATABASE of handwritten digits.
 
Text Classification
 
Text Classification
 
20 Newsgroups (http://qwone.com/~jason/20Newsgroups/) - The 20 newsgroups text dataset.
Reuters-21578 (https://archive.ics.uci.edu/ml/datasets/Reuters-21578+Text+Categorization+Collection) - Reuters-21578 Text Categorization Collection Data Set