Awesome Computer Vision: 
A curated list of awesome computer vision resources, inspired by awesome-php.
For a list people in computer vision listed with their academic
genealogy, please visit here
Contributing
Please feel free to send me pull
requests or email (jbhuang@vt.edu) to add links.
Table of Contents
Awesome Lists
Books
Computer Vision
- Computer Vision:
Models, Learning, and Inference - Simon J. D. Prince 2012
- Computer Vision: Theory and
Application - Rick Szeliski 2010
- Computer
Vision: A Modern Approach (2nd edition) - David Forsyth and Jean
Ponce 2011
- Multiple View
Geometry in Computer Vision - Richard Hartley and Andrew Zisserman
2004
- Computer
Vision - Linda G. Shapiro 2001
- Vision
Science: Photons to Phenomenology - Stephen E. Palmer 1999
- Visual
Object Recognition synthesis lecture - Kristen Grauman and Bastian
Leibe 2011
- Computer Vision for Visual
Effects - Richard J. Radke, 2012
- High
dynamic range imaging: acquisition, display, and image-based
lighting - Reinhard, E., Heidrich, W., Debevec, P., Pattanaik, S.,
Ward, G., Myszkowski, K 2010
- Numerical
Algorithms: Methods for Computer Vision, Machine Learning, and
Graphics - Justin Solomon 2015
- Image
Processing and Analysis - Stan Birchfield 2018
- Computer Vision,
From 3D Reconstruction to Recognition - Silvio Savarese 2018
OpenCV Programming
Machine Learning
Fundamentals
Courses
Computer Vision
Computational Photography
Machine Learning and
Statistical Learning
- Machine
Learning - Andrew Ng (Stanford University)
- Learning from
Data - Yaser S. Abu-Mostafa (Caltech)
- Statistical
Learning - Trevor Hastie and Rob Tibshirani (Stanford
University)
- Statistical Learning
Theory and Applications - Tomaso Poggio, Lorenzo Rosasco, Carlo
Ciliberto, Charlie Frogner, Georgios Evangelopoulos, Ben Deen (MIT)
- Statistical
Learning - Genevera Allen (Rice University)
- Practical
Machine Learning - Michael Jordan (UC Berkeley)
- Course
on Information Theory, Pattern Recognition, and Neural Networks -
David MacKay (University of Cambridge)
- Methods for
Applied Statistics: Unsupervised Learning - Lester Mackey
(Stanford)
- Machine
Learning - Andrew Zisserman (University of Oxford)
- Intro
to Machine Learning - Sebastian Thrun (Stanford University)
- Machine
Learning - Charles Isbell, Michael Littman (Georgia Tech)
- (Convolutional) Neural Networks
for Visual Recognition - Fei-Fei Li, Andrej Karphaty, Justin Johnson
(Stanford University)
- Machine
Learning for Computer Vision - Rudolph Triebel (TU Munich)
Optimization
Papers
Conference papers on the web
Survey Papers
## Pre-trained Computer Vision Models * List of
Computer Vision models These models are trained on custom
objects
Tutorials and talks
Computer Vision
Recent Conference Talks
3D Computer Vision
Internet Vision
Computational Photography
Learning and Vision
Object Recognition
Graphical Models
Machine Learning
Optimization
Deep Learning
Software
External Resource Links
General Purpose
Computer Vision Library
Multiple-view Computer
Vision
Feature Detection and
Extraction
- VLFeat
- SIFT
- David G. Lowe, “Distinctive image features from scale-invariant
keypoints,” International Journal of Computer Vision, 60, 2 (2004),
pp. 91-110.
- SIFT++
- BRISK
- Stefan Leutenegger, Margarita Chli and Roland Siegwart, “BRISK:
Binary Robust Invariant Scalable Keypoints”, ICCV 2011
- SURF
- Herbert Bay, Andreas Ess, Tinne Tuytelaars, Luc Van Gool, “SURF:
Speeded Up Robust Features”, Computer Vision and Image Understanding
(CVIU), Vol. 110, No. 3, pp. 346–359, 2008
- FREAK
- A. Alahi, R. Ortiz, and P. Vandergheynst, “FREAK: Fast Retina
Keypoint”, CVPR 2012
- AKAZE
- Pablo F. Alcantarilla, Adrien Bartoli and Andrew J. Davison, “KAZE
Features”, ECCV 2012
- Local Binary
Patterns
High Dynamic Range Imaging
Semantic Segmentation
Low-level Vision
Stereo Vision
Optical Flow
Image Denoising
BM3D, KSVD,
Super-resolution
- Multi-frame
image super-resolution
- Pickup, L. C. Machine Learning in Multi-frame Image
Super-resolution, PhD thesis 2008
- Markov
Random Fields for Super-Resolution
- W. T Freeman and C. Liu. Markov Random Fields for Super-resolution
and Texture Synthesis. In A. Blake, P. Kohli, and C. Rother, eds.,
Advances in Markov Random Fields for Vision and Image Processing,
Chapter 10. MIT Press, 2011
- Sparse
regression and natural image prior
- K. I. Kim and Y. Kwon, “Single-image super-resolution using sparse
regression and natural image prior”, IEEE Trans. Pattern Analysis and
Machine Intelligence, vol. 32, no. 6, pp. 1127-1133, 2010.
- Single-Image
Super Resolution via a Statistical Model
- T. Peleg and M. Elad, A Statistical Prediction Model Based on Sparse
Representations for Single Image Super-Resolution, IEEE Transactions on
Image Processing, Vol. 23, No. 6, Pages 2569-2582, June 2014
- Sparse
Coding for Super-Resolution
- R. Zeyde, M. Elad, and M. Protter On Single Image Scale-Up using
Sparse-Representations, Curves & Surfaces, Avignon-France, June
24-30, 2010 (appears also in Lecture-Notes-on-Computer-Science -
LNCS).
- Patch-wise
Sparse Recovery
- Jianchao Yang, John Wright, Thomas Huang, and Yi Ma. Image
super-resolution via sparse representation. IEEE Transactions on Image
Processing (TIP), vol. 19, issue 11, 2010.
- Neighbor
embedding
- H. Chang, D.Y. Yeung, Y. Xiong. Super-resolution through neighbor
embedding. Proceedings of the IEEE Computer Society Conference on
Computer Vision and Pattern Recognition (CVPR), vol.1, pp.275-282,
Washington, DC, USA, 27 June - 2 July 2004.
- Deformable
Patches
- Yu Zhu, Yanning Zhang and Alan Yuille, Single Image Super-resolution
using Deformable Patches, CVPR 2014
- SRCNN
- Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, Learning a Deep
Convolutional Network for Image Super-Resolution, in ECCV 2014
- A+:
Adjusted Anchored Neighborhood Regression
- R. Timofte, V. De Smet, and L. Van Gool. A+: Adjusted Anchored
Neighborhood Regression for Fast Super-Resolution, ACCV 2014
- Transformed
Self-Exemplars
- Jia-Bin Huang, Abhishek Singh, and Narendra Ahuja, Single Image
Super-Resolution using Transformed Self-Exemplars, IEEE Conference on
Computer Vision and Pattern Recognition, 2015
Image Deblurring
Non-blind deconvolution * Spatially
variant non-blind deconvolution * Handling
Outliers in Non-blind Image Deconvolution * Hyper-Laplacian
Priors * From
Learning Models of Natural Image Patches to Whole Image Restoration
* Deep Convolutional Neural
Network for Image Deconvolution * Neural
Deconvolution
Blind deconvolution * Removing
Camera Shake From A Single Photograph * High-quality
motion deblurring from a single image * Two-Phase
Kernel Estimation for Robust Motion Deblurring * Blur
kernel estimation using the radon transform * Fast
motion deblurring * Blind
Deconvolution Using a Normalized Sparsity Measure * Blur-kernel
estimation from spectral irregularities * Efficient
marginal likelihood optimization in blind deconvolution * Unnatural L0
Sparse Representation for Natural Image Deblurring * Edge-based
Blur Kernel Estimation Using Patch Priors * Blind
Deblurring Using Internal Patch Recurrence
Non-uniform Deblurring * Non-uniform
Deblurring for Shaken Images * Single
Image Deblurring Using Motion Density Functions * Image
Deblurring using Inertial Measurement Sensors * Fast
Removal of Non-uniform Camera Shake
Image Completion
Image Retargeting
Alpha Matting
Image Pyramid
Edge-preserving image
processing
Intrinsic Images
Contour Detection and
Image Segmentation
Interactive Image
Segmentation
Video Segmentation
Camera calibration
Simultaneous localization
and mapping
Tracking/Odometry:
Graph Optimization:
Loop Closure:
Localization & Mapping:
Single-view Spatial
Understanding
Object Detection
Nearest Neighbor Search
General purpose nearest
neighbor search
Nearest Neighbor Field
Estimation
Visual Tracking
Saliency Detection
Attributes
Action Reconition
Egocentric cameras
Human-in-the-loop systems
Image Captioning
Optimization
- Ceres Solver - Nonlinear
least-square problem and unconstrained optimization solver
- NLopt-
Nonlinear least-square problem and unconstrained optimization
solver
- OpenGM -
Factor graph based discrete optimization and inference solver
- GTSAM -
Factor graph based lease-square optimization solver
Deep Learning
Machine Learning
Datasets
External Dataset Link
Collection
Low-level Vision
Stereo Vision
Optical Flow
Video Object Segmentation
Change Detection
Image Super-resolutions
Intrinsic Images
Material Recognition
Multi-view Reconsturction
Saliency Detection
Visual Tracking
Visual Surveillance
Saliency Detection
Change detection
Visual Recognition
Image Classification
Self-supervised Learning
Scene Recognition
Object Detection
Semantic labeling
Multi-view Object Detection
Fine-grained Visual
Recognition
Pedestrian Detection
Action Recognition
Image-based
Video-based
Image Deblurring
Image Captioning
Scene Understanding
# SUN RGB-D - A RGB-D
Scene Understanding Benchmark Suite # NYU depth
v2 - Indoor Segmentation and Support Inference from RGBD Images
Aerial images
# Aerial
Image Segmentation - Learning Aerial Image Segmentation From Online
Maps
Resources for students
Resource link collection
Writing
Presentation
Research
Time Management
Blogs
Links
Licenses
License

To the extent possible under law, Jia-Bin Huang has waived all copyright
and related or neighboring rights to this work.
computervision.md
Github