Files
2025-07-18 23:13:11 +02:00

90 lines
7.8 KiB
Plaintext
Raw Permalink Blame History

This file contains invisible Unicode characters
This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
awesome-datasets
================
A curated list of awesome datasets for papers/experiments/validation.
- Awesome Datasets (#awesome-datasets)
- **Classification** (#classification) 
- **Semi-Supervised** (#semi-supervised) 
- **Regression** (#regression) 
- **Time-Series** (#time-series) 
- **Unsupervised (clustering)** (#unsupervised) 
- **Face Recognition** (#face-recognition) 
- **Image Processing** (#image-processing) 
- **Handwriting Recognition** (#handwriting-recognition)
- **Text Classification** (#text-classification) 
Classification
Datasets for classification.
⟡ KEEL - General (http://sci2s.ugr.es/keel/category.php?cat=clas) - General classification datasets.
⟡ KEEL - Missing-values (http://sci2s.ugr.es/keel/missing.php) - Missing values datasets.
⟡ KEEL - Imbalanced datasets (http://sci2s.ugr.es/keel/imbalanced.php) - Imbalanced datasets for classification.
⟡ KEEL - Multi-label (http://sci2s.ugr.es/keel/multilabel.php) - Multi-label datasets.
⟡ KEEL - Class noise (http://sci2s.ugr.es/keel/classNoise.php) - Datasets with class noise.
⟡ KEEL - Attribute noise (http://sci2s.ugr.es/keel/attributeNoise.php) - Datasets with attribute noise.
Semi-Supervised
Datasets for semi-supervised applications.
⟡ KEEL - semi-supervised (http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.
⟡ KEEL - semi-supervised (http://sci2s.ugr.es/keel/semisupervised.php) - Datasets for semi-supervised experiments.
Regression
Datasets for regression applications.
⟡ KEEL - regression (http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for regression experiments.
Time series
Datasets for time-series problems.
⟡ KEEL - time-series (http://sci2s.ugr.es/keel/category.php?cat=reg) - Datasets for time-series experiments.
Face Recognition
Face Recognition datasets.
⟡ JAFFE (http://kasrl.org/jaffe.html) - The Japanese Female Facial Expression (JAFFE) Database.
⟡ Carnegie Mellon (http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-8/faceimages/) - Datasets from theo-8 projects at Carnegie Mellon University.
⟡ Yale Face Database (http://vision.ucsd.edu/content/yale-face-database) - Datasets for facial expression (happy, sad, angry...) recognition.
⟡ Cohn-Kanade (http://www.pitt.edu/~emotion/ck-spread.htm) - The Cohn-Kanade AU-Coded Facial Expression Database is for research in automatic facial image analysis and synthesis and for perceptual studies.
⟡ AR face Database (http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html) - Different facial expressions, illumination conditions and occlusions.
⟡ Face Detection CBCL (http://cbcl.mit.edu/software-datasets/FaceData2.html) - Face Detection Data from MIT.
⟡ Face Recognition LFW (http://vis-www.cs.umass.edu/lfw/) - Face Recognition from UMASS.
⟡ Face Recognition ORL (http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html) - Face Recognition from AT&T.
Image Processing
Image Processing.
⟡ Microsoft - Salient Object Database (http://research.microsoft.com/en-us/um/people/jiansun/SalientObject/salient_object.htm) - MSRA Salient Object Database.
⟡ IVRG - Salient Object Database (http://ivrgwww.epfl.ch/supplementary_material/RK_CVPR09/) - Frequency-tuned Salient Region Detection.
⟡ ICDAR - Robust Reading (http://dag.cvc.uab.es/icdar2013competition/?com=introduction) - Robust Reading Competition.
⟡ Brodatz - Texture Recognition (http://www.ux.uis.no/~tranden/brodatz.html) - Texture Recognition.
⟡ Vistex - Texture Recognition (http://vismod.media.mit.edu/vismod/imagery/VisionTexture/vistex.html) - Texture Recognition.
⟡ Caltech - Object Categorization (http://www.vision.caltech.edu/Image_Datasets/Caltech101/) - Object Categorization from Caltech101.
⟡ Marcel - Gesture Recognition (http://www.idiap.ch/resource/gestures/) - Gesture Recognition from Marcel.
⟡ RPPDI - Gesture Recognition (http://rppdi.ecomp.poli.br/gesture/database/) - Gesture Recognition from RPPDI.
Handwriting Recognition
Handwriting Recognition
⟡ MNIST - Database of Handwritten Digits (http://yann.lecun.com/exdb/mnist/) - THE MNIST DATABASE of handwritten digits.
Text Classification
Text Classification
⟡ 20 Newsgroups (http://qwone.com/~jason/20Newsgroups/) - The 20 newsgroups text dataset.
⟡ Reuters-21578 (https://archive.ics.uci.edu/ml/datasets/Reuters-21578+Text+Categorization+Collection) - Reuters-21578 Text Categorization Collection Data Set
datasets Github: https://github.com/viisar/awesome-datasets