awesome-awesomeness/html/mlops.html

<h1 id="awesome-mlops-awesome">Awesome MLOps <a
href="https://github.com/sindresorhus/awesome"><img
src="https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg"
alt="Awesome" /></a></h1>
<p>A curated list of awesome MLOps tools.</p>
<p>Inspired by <a
href="https://github.com/vinta/awesome-python">awesome-python</a>.</p>
<ul>
<li><a href="#awesome-mlops">Awesome MLOps</a>
<ul>
<li><a href="#automl">AutoML</a></li>
<li><a href="#cicd-for-machine-learning">CI/CD for Machine
Learning</a></li>
<li><a href="#cron-job-monitoring">Cron Job Monitoring</a></li>
<li><a href="#data-catalog">Data Catalog</a></li>
<li><a href="#data-enrichment">Data Enrichment</a></li>
<li><a href="#data-exploration">Data Exploration</a></li>
<li><a href="#data-management">Data Management</a></li>
<li><a href="#data-processing">Data Processing</a></li>
<li><a href="#data-validation">Data Validation</a></li>
<li><a href="#data-visualization">Data Visualization</a></li>
<li><a href="#drift-detection">Drift Detection</a></li>
<li><a href="#feature-engineering">Feature Engineering</a></li>
<li><a href="#feature-store">Feature Store</a></li>
<li><a href="#hyperparameter-tuning">Hyperparameter Tuning</a></li>
<li><a href="#knowledge-sharing">Knowledge Sharing</a></li>
<li><a href="#machine-learning-platform">Machine Learning
Platform</a></li>
<li><a href="#model-fairness-and-privacy">Model Fairness and
Privacy</a></li>
<li><a href="#model-interpretability">Model Interpretability</a></li>
<li><a href="#model-lifecycle">Model Lifecycle</a></li>
<li><a href="#model-serving">Model Serving</a></li>
<li><a href="#model-testing--validation">Model Testing &amp;
Validation</a></li>
<li><a href="#optimization-tools">Optimization Tools</a></li>
<li><a href="#simplification-tools">Simplification Tools</a></li>
<li><a href="#visual-analysis-and-debugging">Visual Analysis and
Debugging</a></li>
<li><a href="#workflow-tools">Workflow Tools</a></li>
</ul></li>
<li><a href="#resources">Resources</a>
<ul>
<li><a href="#articles">Articles</a></li>
<li><a href="#books">Books</a></li>
<li><a href="#events">Events</a></li>
<li><a href="#other-lists">Other Lists</a></li>
<li><a href="#podcasts">Podcasts</a></li>
<li><a href="#slack">Slack</a></li>
<li><a href="#websites">Websites</a></li>
</ul></li>
<li><a href="#contributing">Contributing</a></li>
</ul>
<hr />
<h2 id="automl">AutoML</h2>
<p><em>Tools for performing AutoML.</em></p>
<ul>
<li><a href="https://github.com/awslabs/autogluon">AutoGluon</a> -
Automated machine learning for image, text, tabular, time-series, and
multi-modal data.</li>
<li><a href="https://github.com/keras-team/autokeras">AutoKeras</a> -
AutoKeras goal is to make machine learning accessible for everyone.</li>
<li><a href="https://github.com/automl/Auto-PyTorch">AutoPyTorch</a> -
Automatic architecture search and hyperparameter optimization for
PyTorch.</li>
<li><a href="https://github.com/automl/auto-sklearn">AutoSKLearn</a> -
Automated machine learning toolkit and a drop-in replacement for a
scikit-learn estimator.</li>
<li><a href="https://github.com/alteryx/evalml">EvalML</a> - A library
that builds, optimizes, and evaluates ML pipelines using domain-specific
functions.</li>
<li><a href="https://github.com/microsoft/FLAML">FLAML</a> - Finds
accurate ML models automatically, efficiently and economically.</li>
<li><a href="https://h2o.ai/platform/h2o-automl">H2O AutoML</a> -
Automates ML workflow, which includes automatic training and tuning of
models.</li>
<li><a href="https://github.com/mindsdb/mindsdb">MindsDB</a> - AI layer
for databases that allows you to effortlessly develop, train and deploy
ML models.</li>
<li><a href="https://github.com/AxeldeRomblay/MLBox">MLBox</a> - MLBox
is a powerful Automated Machine Learning python library.</li>
<li><a href="https://github.com/google/model_search">Model Search</a> -
Framework that implements AutoML algorithms for model architecture
search at scale.</li>
<li><a href="https://github.com/microsoft/nni">NNI</a> - An open source
AutoML toolkit for automate machine learning lifecycle.</li>
</ul>
<h2 id="cicd-for-machine-learning">CI/CD for Machine Learning</h2>
<p><em>Tools for performing CI/CD for Machine Learning.</em></p>
<ul>
<li><a href="https://github.com/allegroai/clearml">ClearML</a> -
Auto-Magical CI/CD to streamline your ML workflow.</li>
<li><a href="https://github.com/iterative/cml">CML</a> - Open-source
library for implementing CI/CD in machine learning projects.</li>
<li><a href="https://github.com/jozu-ai/kitops">KitOps</a> – Open source
MLOps project that eases model handoffs between data scientist and
DevOps.</li>
</ul>
<h2 id="cron-job-monitoring">Cron Job Monitoring</h2>
<p><em>Tools for monitoring cron jobs (recurring jobs).</em></p>
<ul>
<li><a href="https://cronitor.io/cron-job-monitoring">Cronitor</a> -
Monitor any cron job or scheduled task.</li>
<li><a href="https://healthchecks.io/">HealthchecksIO</a> - Simple and
effective cron job monitoring.</li>
</ul>
<h2 id="data-catalog">Data Catalog</h2>
<p><em>Tools for data cataloging.</em></p>
<ul>
<li><a href="https://www.amundsen.io/">Amundsen</a> - Data discovery and
metadata engine for improving the productivity when interacting with
data.</li>
<li><a href="https://atlas.apache.org">Apache Atlas</a> - Provides open
metadata management and governance capabilities to build a data
catalog.</li>
<li><a href="https://github.com/ckan/ckan">CKAN</a> - Open-source DMS
(data management system) for powering data hubs and data portals.</li>
<li><a href="https://github.com/linkedin/datahub">DataHub</a> -
LinkedIn’s generalized metadata search &amp; discovery tool.</li>
<li><a href="https://github.com/magda-io/magda">Magda</a> - A federated,
open-source data catalog for all your big data and small data.</li>
<li><a href="https://github.com/Netflix/metacat">Metacat</a> - Unified
metadata exploration API service for Hive, RDS, Teradata, Redshift, S3
and Cassandra.</li>
<li><a href="https://open-metadata.org/">OpenMetadata</a> - A Single
place to discover, collaborate and get your data right.</li>
</ul>
<h2 id="data-enrichment">Data Enrichment</h2>
<p><em>Tools and libraries for data enrichment.</em></p>
<ul>
<li><a href="https://github.com/snorkel-team/snorkel">Snorkel</a> - A
system for quickly generating training data with weak supervision.</li>
<li><a href="https://github.com/upgini/upgini">Upgini</a> - Enriches
training datasets with features from public and community shared data
sources.</li>
</ul>
<h2 id="data-exploration">Data Exploration</h2>
<p><em>Tools for performing data exploration.</em></p>
<ul>
<li><a href="https://zeppelin.apache.org/">Apache Zeppelin</a> - Enables
data-driven, interactive data analytics and collaborative
documents.</li>
<li><a href="https://github.com/tkrabel/bamboolib">BambooLib</a> - An
intuitive GUI for Pandas DataFrames.</li>
<li><a href="https://github.com/sfu-db/dataprep">DataPrep</a> - Collect,
clean and visualize your data in Python.</li>
<li><a href="https://colab.research.google.com">Google Colab</a> -
Hosted Jupyter notebook service that requires no setup to use.</li>
<li><a href="https://jupyter.org/">Jupyter Notebook</a> - Web-based
notebook environment for interactive computing.</li>
<li><a href="https://jupyterlab.readthedocs.io">JupyterLab</a> - The
next-generation user interface for Project Jupyter.</li>
<li><a href="https://github.com/mwouts/jupytext">Jupytext</a> - Jupyter
Notebooks as Markdown Documents, Julia, Python or R scripts.</li>
<li><a href="https://github.com/ydataai/pandas-profiling">Pandas
Profiling</a> - Create HTML profiling reports from pandas DataFrame
objects.</li>
<li><a href="https://polynote.org/">Polynote</a> - The polyglot notebook
with first-class Scala support.</li>
</ul>
<h2 id="data-management">Data Management</h2>
<p><em>Tools for performing data management.</em></p>
<ul>
<li><a href="https://www.arrikto.com/">Arrikto</a> - Dead simple, ultra
fast storage for the hybrid Kubernetes world.</li>
<li><a href="https://github.com/BlazingDB/blazingsql">BlazingSQL</a> - A
lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS
cuDF.</li>
<li><a href="https://github.com/delta-io/delta">Delta Lake</a> - Storage
layer that brings scalable, ACID transactions to Apache Spark and other
engines.</li>
<li><a href="https://github.com/dolthub/dolt">Dolt</a> - SQL database
that you can fork, clone, branch, merge, push and pull just like a git
repository.</li>
<li><a href="https://github.com/kevin-hanselman/dud">Dud</a> - A
lightweight CLI tool for versioning data alongside source code and
building data pipelines.</li>
<li><a href="https://dvc.org/">DVC</a> - Management and versioning of
datasets and machine learning models.</li>
<li><a href="https://git-lfs.github.com">Git LFS</a> - An open source
Git extension for versioning large files.</li>
<li><a href="https://github.com/activeloopai/Hub">Hub</a> - A dataset
format for creating, storing, and collaborating on AI datasets of any
size.</li>
<li><a href="https://github.com/intake/intake">Intake</a> - A
lightweight set of tools for loading and sharing data in data science
projects.</li>
<li><a href="https://github.com/treeverse/lakeFS">lakeFS</a> -
Repeatable, atomic and versioned data lake on top of object
storage.</li>
<li><a href="https://github.com/MarquezProject/marquez">Marquez</a> -
Collect, aggregate, and visualize a data ecosystem’s metadata.</li>
<li><a href="https://github.com/milvus-io/milvus/">Milvus</a> - An open
source embedding vector similarity search engine powered by Faiss,
NMSLIB and Annoy.</li>
<li><a href="https://www.pinecone.io">Pinecone</a> - Managed and
distributed vector similarity search used with a lightweight SDK.</li>
<li><a href="https://github.com/qdrant/qdrant">Qdrant</a> - An open
source vector similarity search engine with extended filtering
support.</li>
<li><a href="https://github.com/quiltdata/quilt">Quilt</a> - A
self-organizing data hub with S3 support.</li>
</ul>
<h2 id="data-processing">Data Processing</h2>
<p><em>Tools related to data processing and data pipelines.</em></p>
<ul>
<li><a href="https://airflow.apache.org/">Airflow</a> - Platform to
programmatically author, schedule, and monitor workflows.</li>
<li><a href="https://github.com/azkaban/azkaban">Azkaban</a> - Batch
workflow job scheduler created at LinkedIn to run Hadoop jobs.</li>
<li><a href="https://github.com/dagster-io/dagster">Dagster</a> - A data
orchestrator for machine learning, analytics, and ETL.</li>
<li><a href="https://hadoop.apache.org/">Hadoop</a> - Framework that
allows for the distributed processing of large data sets across
clusters.</li>
<li><a href="https://github.com/OpenRefine/OpenRefine">OpenRefine</a> -
Power tool for working with messy data and improving it.</li>
<li><a href="https://spark.apache.org/">Spark</a> - Unified analytics
engine for large-scale data processing.</li>
</ul>
<h2 id="data-validation">Data Validation</h2>
<p><em>Tools related to data validation.</em></p>
<ul>
<li><a href="https://github.com/pyeve/cerberus">Cerberus</a> -
Lightweight, extensible data validation library for Python.</li>
<li><a href="https://github.com/cleanlab/cleanlab">Cleanlab</a> - Python
library for data-centric AI and machine learning with messy, real-world
data and labels.</li>
<li><a href="https://greatexpectations.io">Great Expectations</a> - A
Python data validation framework that allows to test your data against
datasets.</li>
<li><a href="https://json-schema.org/">JSON Schema</a> - A vocabulary
that allows you to annotate and validate JSON documents.</li>
<li><a href="https://github.com/tensorflow/data-validation">TFDV</a> -
An library for exploring and validating machine learning data.</li>
</ul>
<h2 id="data-visualization">Data Visualization</h2>
<p><em>Tools for data visualization, reports and dashboards.</em></p>
<ul>
<li><a href="https://count.co">Count</a> - SQL/drag-and-drop querying
and visualisation tool based on notebooks.</li>
<li><a href="https://github.com/plotly/dash">Dash</a> - Analytical Web
Apps for Python, R, Julia, and Jupyter.</li>
<li><a href="https://datastudio.google.com">Data Studio</a> - Reporting
solution for power users who want to go beyond the data and dashboards
of GA.</li>
<li><a href="https://github.com/PAIR-code/facets">Facets</a> -
Visualizations for understanding and analyzing machine learning
datasets.</li>
<li><a href="https://grafana.com/grafana/">Grafana</a> - Multi-platform
open source analytics and interactive visualization web
application.</li>
<li><a href="https://github.com/lux-org/lux">Lux</a> - Fast and easy
data exploration by automating the visualization and data analysis
process.</li>
<li><a href="https://www.metabase.com/">Metabase</a> - The simplest,
fastest way to get business intelligence and analytics to everyone.</li>
<li><a href="https://redash.io/">Redash</a> - Connect to any data
source, easily visualize, dashboard and share your data.</li>
<li><a href="https://github.com/CloudOrc/SolidUI">SolidUI</a> -
AI-generated visualization prototyping and editing platform, support 2D
and 3D models.</li>
<li><a href="https://superset.incubator.apache.org/">Superset</a> -
Modern, enterprise-ready business intelligence web application.</li>
<li><a href="https://www.tableau.com">Tableau</a> - Powerful and fastest
growing data visualization tool used in the business intelligence
industry.</li>
</ul>
<h2 id="drift-detection">Drift Detection</h2>
<p><em>Tools and libraries related to drift detection.</em></p>
<ul>
<li><a href="https://github.com/SeldonIO/alibi-detect">Alibi Detect</a>
- An open source Python library focused on outlier, adversarial and
drift detection.</li>
<li><a href="https://github.com/IFCA/frouros">Frouros</a> - An open
source Python library for drift detection in machine learning
systems.</li>
<li><a href="https://github.com/torchdrift/torchdrift/">TorchDrift</a> -
A data and concept drift library for PyTorch.</li>
</ul>
<h2 id="feature-engineering">Feature Engineering</h2>
<p><em>Tools and libraries related to feature engineering.</em></p>
<ul>
<li><a href="https://github.com/feature-engine/feature_engine">Feature
Engine</a> - Feature engineering package with SKlearn like
functionality.</li>
<li><a href="https://github.com/alteryx/featuretools">Featuretools</a> -
Python library for automated feature engineering.</li>
<li><a href="https://github.com/blue-yonder/tsfresh">TSFresh</a> -
Python library for automatic extraction of relevant features from time
series.</li>
</ul>
<h2 id="feature-store">Feature Store</h2>
<p><em>Feature store tools for data serving.</em></p>
<ul>
<li><a href="https://github.com/quintoandar/butterfree">Butterfree</a> -
A tool for building feature stores. Transform your raw data into
beautiful features.</li>
<li><a href="https://github.com/bytehub-ai/bytehub">ByteHub</a> - An
easy-to-use feature store. Optimized for time-series data.</li>
<li><a href="https://feast.dev/">Feast</a> - End-to-end open source
feature store for machine learning.</li>
<li><a href="https://github.com/linkedin/feathr">Feathr</a> - An
enterprise-grade, high performance feature store.</li>
<li><a href="https://github.com/featureform/featureform">Featureform</a>
- A Virtual Feature Store. Turn your existing data infrastructure into a
feature store.</li>
<li><a href="https://www.tecton.ai/">Tecton</a> - A fully-managed
feature platform built to orchestrate the complete lifecycle of
features.</li>
</ul>
<h2 id="hyperparameter-tuning">Hyperparameter Tuning</h2>
<p><em>Tools and libraries to perform hyperparameter tuning.</em></p>
<ul>
<li><a href="https://github.com/tobegit3hub/advisor">Advisor</a> -
Open-source implementation of Google Vizier for hyper parameters
tuning.</li>
<li><a href="https://github.com/maxpumperla/hyperas">Hyperas</a> - A
very simple wrapper for convenient hyperparameter optimization.</li>
<li><a href="https://github.com/hyperopt/hyperopt">Hyperopt</a> -
Distributed Asynchronous Hyperparameter Optimization in Python.</li>
<li><a href="https://github.com/kubeflow/katib">Katib</a> -
Kubernetes-based system for hyperparameter tuning and neural
architecture search.</li>
<li><a href="https://github.com/keras-team/keras-tuner">KerasTuner</a> -
Easy-to-use, scalable hyperparameter optimization framework.</li>
<li><a href="https://optuna.org/">Optuna</a> - Open source
hyperparameter optimization framework to automate hyperparameter
search.</li>
<li><a href="https://github.com/scikit-optimize/scikit-optimize">Scikit
Optimize</a> - Simple and efficient library to minimize expensive and
noisy black-box functions.</li>
<li><a href="https://github.com/autonomio/talos">Talos</a> -
Hyperparameter Optimization for TensorFlow, Keras and PyTorch.</li>
<li><a href="https://docs.ray.io/en/latest/tune.html">Tune</a> - Python
library for experiment execution and hyperparameter tuning at any
scale.</li>
</ul>
<h2 id="knowledge-sharing">Knowledge Sharing</h2>
<p><em>Tools for sharing knowledge to the entire team/company.</em></p>
<ul>
<li><a href="https://github.com/airbnb/knowledge-repo">Knowledge
Repo</a> - Knowledge sharing platform for data scientists and other
technical professions.</li>
<li><a href="https://kyso.io/">Kyso</a> - One place for data insights so
your entire team can learn from your data.</li>
</ul>
<h2 id="machine-learning-platform">Machine Learning Platform</h2>
<p><em>Complete machine learning platform solutions.</em></p>
<ul>
<li><a href="https://www.veritone.com/aiware/aiware-os/">aiWARE</a> -
aiWARE helps MLOps teams evaluate, deploy, integrate, scale &amp;
monitor ML models.</li>
<li><a href="https://algorithmia.com/">Algorithmia</a> - Securely govern
your machine learning operations with a healthy ML lifecycle.</li>
<li><a href="https://allegro.ai/">Allegro AI</a> - Transform ML/DL
research into products. Faster.</li>
<li><a href="https://bodywork.readthedocs.io/en/latest/">Bodywork</a> -
Deploys machine learning projects developed in Python, to
Kubernetes.</li>
<li><a href="https://cnvrg.io/">CNVRG</a> - An end-to-end machine
learning platform to build and deploy AI models at scale.</li>
<li><a href="https://dagshub.com/">DAGsHub</a> - A platform built on
open source tools for data, model and pipeline management.</li>
<li><a href="https://www.dataiku.com/">Dataiku</a> - Platform
democratizing access to data and enabling enterprises to build their own
path to AI.</li>
<li><a href="https://www.datarobot.com/">DataRobot</a> - AI platform
that democratizes data science and automates the end-to-end ML at
scale.</li>
<li><a href="https://www.dominodatalab.com/">Domino</a> - One place for
your data science tools, apps, results, models, and knowledge.</li>
<li><a href="https://edgeimpulse.com/">Edge Impulse</a> - Platform for
creating, optimizing, and deploying AI/ML algorithms for edge
devices.</li>
<li><a href="https://github.com/tensorchord/envd">envd</a> - Machine
learning development environment for data science and AI/ML engineering
teams.</li>
<li><a href="https://fedml.ai/">FedML</a> - Simplifies the workflow of
federated learning anywhere at any scale.</li>
<li><a href="https://gradient.paperspace.com/">Gradient</a> - Multicloud
CI/CD and MLOps platform for machine learning teams.</li>
<li><a href="https://www.h2o.ai/">H2O</a> - Open source leader in AI
with a mission to democratize AI for everyone.</li>
<li><a href="https://www.hopsworks.ai/">Hopsworks</a> - Open-source
platform for developing and operating machine learning models at
scale.</li>
<li><a href="https://www.iguazio.com/">Iguazio</a> - Data science
platform that automates MLOps with end-to-end machine learning
pipelines.</li>
<li><a href="https://katonic.ai/">Katonic</a> - Automate your cycle of
intelligence with Katonic MLOps Platform.</li>
<li><a href="https://www.knime.com/">Knime</a> - Create and
productionize data science using one easy and intuitive
environment.</li>
<li><a href="https://www.kubeflow.org/">Kubeflow</a> - Making
deployments of ML workflows on Kubernetes simple, portable and
scalable.</li>
<li><a href="https://lynxkite.com/">LynxKite</a> - A complete graph data
science platform for very large graphs and other datasets.</li>
<li><a href="https://github.com/ml-tooling/ml-workspace">ML
Workspace</a> - All-in-one web-based IDE specialized for machine
learning and data science.</li>
<li><a href="https://github.com/MLReef/mlreef">MLReef</a> - Open source
MLOps platform that helps you collaborate, reproduce and share your ML
work.</li>
<li><a href="https://www.modzy.com/">Modzy</a> - Deploy, connect, run,
and monitor machine learning (ML) models in the enterprise and at the
edge.</li>
<li><a href="https://neu.ro">Neu.ro</a> - MLOps platform that integrates
open-source and proprietary tools into client-oriented systems.</li>
<li><a href="https://www.omniml.ai">Omnimizer</a> - Simplifies and
accelerates MLOps by bridging the gap between ML models and edge
hardware.</li>
<li><a href="https://www.pachyderm.com/">Pachyderm</a> - Combines data
lineage with end-to-end pipelines on Kubernetes, engineered for the
enterprise.</li>
<li><a href="https://www.github.com/polyaxon/polyaxon/">Polyaxon</a> - A
platform for reproducible and scalable machine learning and deep
learning on kubernetes.</li>
<li><a href="https://aws.amazon.com/sagemaker/">Sagemaker</a> - Fully
managed service that provides the ability to build, train, and deploy ML
models quickly.</li>
<li><a href="https://www.sas.com/en_us/software/viya.html">SAS Viya</a>
- Cloud native AI, analytic and data management platform that supports
the analytics life cycle.</li>
<li><a href="https://sematic.dev">Sematic</a> - An open-source
end-to-end pipelining tool to go from laptop prototype to cloud in no
time.</li>
<li><a href="https://sigopt.com/">SigOpt</a> - A platform that makes it
easy to track runs, visualize training, and scale hyperparameter
tuning.</li>
<li><a href="https://www.truefoundry.com">TrueFoundry</a> - A
Cloud-native MLOps Platform over Kubernetes to simplify training and
serving of ML Models.</li>
<li><a href="https://valohai.com/">Valohai</a> - Takes you from POC to
production while managing the whole model lifecycle.</li>
</ul>
<h2 id="model-fairness-and-privacy">Model Fairness and Privacy</h2>
<p><em>Tools for performing model fairness and privacy in
production.</em></p>
<ul>
<li><a href="https://github.com/Trusted-AI/AIF360">AIF360</a> - A
comprehensive set of fairness metrics for datasets and machine learning
models.</li>
<li><a href="https://github.com/fairlearn/fairlearn">Fairlearn</a> - A
Python package to assess and improve fairness of machine learning
models.</li>
<li><a href="https://github.com/pytorch/opacus">Opacus</a> - A library
that enables training PyTorch models with differential privacy.</li>
<li><a href="https://github.com/tensorflow/privacy">TensorFlow
Privacy</a> - Library for training machine learning models with privacy
for training data.</li>
</ul>
<h2 id="model-interpretability">Model Interpretability</h2>
<p><em>Tools for performing model
interpretability/explainability.</em></p>
<ul>
<li><a href="https://github.com/SeldonIO/alibi">Alibi</a> - Open-source
Python library enabling ML model inspection and interpretation.</li>
<li><a href="https://github.com/pytorch/captum">Captum</a> - Model
interpretability and understanding library for PyTorch.</li>
<li><a href="https://github.com/eli5-org/eli5">ELI5</a> - Python package
which helps to debug machine learning classifiers and explain their
predictions.</li>
<li><a href="https://github.com/interpretml/interpret">InterpretML</a> -
A toolkit to help understand models and enable responsible machine
learning.</li>
<li><a href="https://github.com/marcotcr/lime">LIME</a> - Explaining the
predictions of any machine learning classifier.</li>
<li><a href="https://github.com/tensorflow/lucid">Lucid</a> - Collection
of infrastructure and tools for research in neural network
interpretability.</li>
<li><a href="https://github.com/iancovert/sage">SAGE</a> - For
calculating global feature importance using Shapley values.</li>
<li><a href="https://github.com/slundberg/shap">SHAP</a> - A game
theoretic approach to explain the output of any machine learning
model.</li>
</ul>
<h2 id="model-lifecycle">Model Lifecycle</h2>
<p><em>Tools for managing model lifecycle (tracking experiments,
parameters and metrics).</em></p>
<ul>
<li><a href="https://github.com/quant-aq/aeromancy">Aeromancy</a> - A
framework for performing reproducible AI and ML for Weights and
Biases.</li>
<li><a href="https://github.com/aimhubio/aim">Aim</a> - A super-easy way
to record, search and compare 1000s of ML training runs.</li>
<li><a href="https://github.com/Oxid15/cascade">Cascade</a> - Library of
ML-Engineering tools for rapid prototyping and experiment
management.</li>
<li><a href="https://github.com/comet-ml/comet-examples">Comet</a> -
Track your datasets, code changes, experimentation history, and
models.</li>
<li><a href="https://guild.ai/">Guild AI</a> - Open source experiment
tracking, pipeline automation, and hyperparameter tuning.</li>
<li><a href="https://github.com/replicate/keepsake">Keepsake</a> -
Version control for machine learning with support to Amazon S3 and
Google Cloud Storage.</li>
<li><a href="https://losswise.com">Losswise</a> - Makes it easy to track
the progress of a machine learning project.</li>
<li><a href="https://mlflow.org/">MLflow</a> - Open source platform for
the machine learning lifecycle.</li>
<li><a href="https://github.com/VertaAI/modeldb/">ModelDB</a> - Open
source ML model versioning, metadata, and experiment management.</li>
<li><a href="https://neptune.ai/">Neptune AI</a> - The most lightweight
experiment management tool that fits any workflow.</li>
<li><a href="https://github.com/IDSIA/sacred">Sacred</a> - A tool to
help you configure, organize, log and reproduce experiments.</li>
<li><a href="https://github.com/wandb/client">Weights and Biases</a> - A
tool for visualizing and tracking your machine learning
experiments.</li>
</ul>
<h2 id="model-serving">Model Serving</h2>
<p><em>Tools for serving models in production.</em></p>
<ul>
<li><a href="https://banana.dev">Banana</a> - Host your ML inference
code on serverless GPUs and integrate it into your app with one line of
code.</li>
<li><a href="https://beam.cloud">Beam</a> - Develop on serverless GPUs,
deploy highly performant APIs, and rapidly prototype ML models.</li>
<li><a href="https://github.com/bentoml/BentoML">BentoML</a> -
Open-source platform for high-performance ML model serving.</li>
<li><a href="https://github.com/ebhy/budgetml">BudgetML</a> - Deploy a
ML inference service on a budget in less than 10 lines of code.</li>
<li><a href="https://github.com/replicate/cog">Cog</a> - Open-source
tool that lets you package ML models in a standard, production-ready
container.</li>
<li><a href="https://www.cortex.dev/">Cortex</a> - Machine learning
model serving infrastructure.</li>
<li><a href="https://docs.geniusrise.ai">Geniusrise</a> - Host inference
APIs, bulk inference and fine tune text, vision, audio and multi-modal
models.</li>
<li><a href="https://github.com/gradio-app/gradio">Gradio</a> - Create
customizable UI components around your models.</li>
<li><a href="https://oracle.github.io/graphpipe">GraphPipe</a> - Machine
learning model deployment made simple.</li>
<li><a
href="https://github.com/Hydrospheredata/hydro-serving">Hydrosphere</a>
- Platform for deploying your Machine Learning to production.</li>
<li><a href="https://github.com/kubeflow/kfserving">KFServing</a> -
Kubernetes custom resource definition for serving ML models on arbitrary
frameworks.</li>
<li><a href="https://github.com/mudler/LocalAI">LocalAI</a> - Drop-in
replacement REST API that’s compatible with OpenAI API specifications
for inferencing.</li>
<li><a href="https://github.com/gojek/merlin">Merlin</a> - A platform
for deploying and serving machine learning models.</li>
<li><a href="https://github.com/iterative/mlem">MLEM</a> - Version and
deploy your ML models following GitOps principles.</li>
<li><a href="https://github.com/ml-tooling/opyrator">Opyrator</a> -
Turns your ML code into microservices with web API, interactive GUI, and
more.</li>
<li><a href="https://github.com/apache/predictionio">PredictionIO</a> -
Event collection, deployment of algorithms, evaluation, querying
predictive results via APIs.</li>
<li><a href="https://quix.io">Quix</a> - Serverless platform for
processing data streams in real-time with machine learning models.</li>
<li><a href="https://github.com/hotg-ai/rune">Rune</a> - Provides
containers to encapsulate and deploy EdgeML pipelines and
applications.</li>
<li><a href="https://www.seldon.io/">Seldon</a> - Take your ML projects
from POC to production with maximum efficiency and minimal risk.</li>
<li><a href="https://github.com/streamlit/streamlit">Streamlit</a> -
Lets you create apps for your ML projects with deceptively simple Python
scripts.</li>
<li><a href="https://www.tensorflow.org/tfx/guide/serving">TensorFlow
Serving</a> - Flexible, high-performance serving system for ML models,
designed for production.</li>
<li><a href="https://github.com/pytorch/serve">TorchServe</a> - A
flexible and easy to use tool for serving PyTorch models.</li>
<li><a href="https://github.com/triton-inference-server/server">Triton
Inference Server</a> - Provides an optimized cloud and edge inferencing
solution.</li>
<li><a href="https://github.com/vespa-engine/vespa">Vespa</a> - Store,
search, organize and make machine-learned inferences over big data at
serving time.</li>
<li><a href="https://wallaroo.ai/">Wallaroo.AI</a> - A platform for
deploying, serving, and optimizing ML models in both cloud and edge
environments.</li>
</ul>
<h2 id="model-testing-validation">Model Testing &amp; Validation</h2>
<p><em>Tools for testing and validating models.</em></p>
<ul>
<li><a href="https://github.com/deepchecks/deepchecks">Deepchecks</a> -
Open-source package for validating ML models &amp; data, with various
checks and suites.</li>
<li><a href="https://github.com/star-whale/starwhale">Starwhale</a> - An
MLOps/LLMOps platform for model building, evaluation, and
fine-tuning.</li>
<li><a href="https://github.com/trubrics/trubrics-sdk">Trubrics</a> -
Validate machine learning with data science and domain expert
feedback.</li>
</ul>
<h2 id="optimization-tools">Optimization Tools</h2>
<p><em>Optimization tools related to model scalability in
production.</em></p>
<ul>
<li><a href="https://github.com/huggingface/accelerate">Accelerate</a> -
A simple way to train and use PyTorch models with multi-GPU, TPU,
mixed-precision.</li>
<li><a href="https://dask.org/">Dask</a> - Provides advanced parallelism
for analytics, enabling performance at scale for the tools you
love.</li>
<li><a href="https://github.com/microsoft/DeepSpeed">DeepSpeed</a> -
Deep learning optimization library that makes distributed training easy,
efficient, and effective.</li>
<li><a href="https://uber.github.io/fiber/">Fiber</a> - Python
distributed computing library for modern computer clusters.</li>
<li><a href="https://github.com/horovod/horovod">Horovod</a> -
Distributed deep learning training framework for TensorFlow, Keras,
PyTorch, and Apache MXNet.</li>
<li><a href="https://mahout.apache.org/">Mahout</a> - Distributed linear
algebra framework and mathematically expressive Scala DSL.</li>
<li><a href="https://spark.apache.org/mllib/">MLlib</a> - Apache Spark’s
scalable machine learning library.</li>
<li><a href="https://github.com/modin-project/modin">Modin</a> - Speed
up your Pandas workflows by changing a single line of code.</li>
<li><a href="https://github.com/nebuly-ai/nebullvm">Nebullvm</a> -
Easy-to-use library to boost AI inference.</li>
<li><a href="https://github.com/nebuly-ai/nos">Nos</a> - Open-source
module for running AI workloads on Kubernetes in an optimized way.</li>
<li><a href="https://github.com/uber/petastorm">Petastorm</a> - Enables
single machine or distributed training and evaluation of deep learning
models.</li>
<li><a href="https://rapids.ai/index.html">Rapids</a> - Gives the
ability to execute end-to-end data science and analytics pipelines
entirely on GPUs.</li>
<li><a href="https://github.com/ray-project/ray">Ray</a> - Fast and
simple framework for building and running distributed applications.</li>
<li><a href="http://singa.apache.org/en/index.html">Singa</a> - Apache
top level project, focusing on distributed training of DL and ML
models.</li>
<li><a href="https://github.com/EpistasisLab/tpot">Tpot</a> - Automated
ML tool that optimizes machine learning pipelines using genetic
programming.</li>
</ul>
<h2 id="simplification-tools">Simplification Tools</h2>
<p><em>Tools related to machine learning simplification and
standardization.</em></p>
<ul>
<li><a href="https://chassisml.io">Chassis</a> - Turns models into
ML-friendly containers that run just about anywhere.</li>
<li><a href="https://github.com/a3data/hermione">Hermione</a> - Help
Data Scientists on setting up more organized codes, in a quicker and
simpler way.</li>
<li><a href="https://github.com/facebookresearch/hydra">Hydra</a> - A
framework for elegantly configuring complex applications.</li>
<li><a href="https://github.com/databricks/koalas">Koalas</a> - Pandas
API on Apache Spark. Makes data scientists more productive when
interacting with big data.</li>
<li><a href="https://github.com/uber/ludwig">Ludwig</a> - Allows users
to train and test deep learning models without the need to write
code.</li>
<li><a href="https://github.com/aporia-ai/mlnotify">MLNotify</a> - No
need to keep checking your training, just one import line and you’ll
know the second it’s done.</li>
<li><a href="https://pycaret.org/">PyCaret</a> - Open source, low-code
machine learning library in Python.</li>
<li><a href="https://github.com/Kenza-AI/sagify">Sagify</a> - A CLI
utility to train and deploy ML/DL models on AWS SageMaker.</li>
<li><a href="https://github.com/ploomber/soopervisor">Soopervisor</a> -
Export ML projects to Kubernetes (Argo workflows), Airflow, AWS Batch,
and SLURM.</li>
<li><a href="https://github.com/ploomber/soorgeon">Soorgeon</a> -
Convert monolithic Jupyter notebooks into maintainable pipelines.</li>
<li><a
href="https://github.com/jrieke/traingenerator">TrainGenerator</a> - A
web app to generate template code for machine learning.</li>
<li><a href="https://github.com/apple/turicreate">Turi Create</a> -
Simplifies the development of custom machine learning models.</li>
</ul>
<h2 id="visual-analysis-and-debugging">Visual Analysis and
Debugging</h2>
<p><em>Tools for performing visual analysis and debugging of ML/DL
models.</em></p>
<ul>
<li><a href="https://www.aporia.com/">Aporia</a> - Observability with
customized monitoring and explainability for ML models.</li>
<li><a href="https://www.arize.com/">Arize</a> - A free end-to-end ML
observability and model monitoring platform.</li>
<li><a href="https://github.com/evidentlyai/evidently">Evidently</a> -
Interactive reports to analyze ML models during validation or production
monitoring.</li>
<li><a href="https://www.fiddler.ai/">Fiddler</a> - Monitor, explain,
and analyze your AI in production.</li>
<li><a href="https://github.com/uber/manifold">Manifold</a> - A
model-agnostic visual debugging tool for machine learning.</li>
<li><a href="https://github.com/NannyML/nannyml">NannyML</a> - Algorithm
capable of fully capturing the impact of data drift on performance.</li>
<li><a href="https://github.com/lutzroeder/netron">Netron</a> -
Visualizer for neural network, deep learning, and machine learning
models.</li>
<li><a href="https://github.com/comet-ml/opik">Opik</a> - Evaluate,
test, and ship LLM applications with a suite of observability
tools.</li>
<li><a href="https://phoenix.arize.com">Phoenix</a> - MLOps in a
Notebook for troubleshooting and fine-tuning generative LLM, CV, and
tabular models.</li>
<li><a
href="https://github.com/radicalbit/radicalbit-ai-monitoring/">Radicalbit</a>
- The open source solution for monitoring your AI models in
production.</li>
<li><a href="https://www.superwise.ai">Superwise</a> - Fully automated,
enterprise-grade model observability in a self-service SaaS
platform.</li>
<li><a href="https://github.com/whylabs/whylogs">Whylogs</a> - The open
source standard for data logging. Enables ML monitoring and
observability.</li>
<li><a
href="https://github.com/DistrictDataLabs/yellowbrick">Yellowbrick</a> -
Visual analysis and diagnostic tools to facilitate machine learning
model selection.</li>
</ul>
<h2 id="workflow-tools">Workflow Tools</h2>
<p><em>Tools and frameworks to create workflows or pipelines in the
machine learning context.</em></p>
<ul>
<li><a href="https://github.com/argoproj/argo">Argo</a> - Open source
container-native workflow engine for orchestrating parallel jobs on
Kubernetes.</li>
<li><a
href="https://www.veritone.com/applications/automate-studio/">Automate
Studio</a> - Rapidly build &amp; deploy AI-powered workflows.</li>
<li><a href="https://github.com/couler-proj/couler">Couler</a> - Unified
interface for constructing and managing workflows on different workflow
engines.</li>
<li><a href="https://github.com/dstackai/dstack">dstack</a> - An
open-core tool to automate data and training workflows.</li>
<li><a href="https://flyte.org/">Flyte</a> - Easy to create concurrent,
scalable, and maintainable workflows for machine learning.</li>
<li><a href="https://github.com/dagworks-inc/hamilton">Hamilton</a> - A
scalable general purpose micro-framework for defining dataflows.</li>
<li><a href="https://github.com/kubeflow-kale/kale">Kale</a> - Aims at
simplifying the Data Science experience of deploying Kubeflow Pipelines
workflows.</li>
<li><a href="https://github.com/quantumblacklabs/kedro">Kedro</a> -
Library that implements software engineering best-practice for data and
ML pipelines.</li>
<li><a href="https://github.com/spotify/luigi">Luigi</a> - Python module
that helps you build complex pipelines of batch jobs.</li>
<li><a href="https://metaflow.org/">Metaflow</a> - Human-friendly lib
that helps scientists and engineers build and manage data science
projects.</li>
<li><a href="https://github.com/mlrun/mlrun">MLRun</a> - Generic
mechanism for data scientists to build, run, and monitor ML tasks and
pipelines.</li>
<li><a href="https://github.com/orchest/orchest/">Orchest</a> - Visual
pipeline editor and workflow orchestrator with an easy to use UI and
based on Kubernetes.</li>
<li><a href="https://github.com/ploomber/ploomber">Ploomber</a> - Write
maintainable, production-ready pipelines. Develop locally, deploy to the
cloud.</li>
<li><a href="https://docs.prefect.io/">Prefect</a> - A workflow
management system, designed for modern infrastructure.</li>
<li><a href="https://github.com/instill-ai/vdp">VDP</a> - An open-source
tool to seamlessly integrate AI for unstructured data into the modern
data stack.</li>
<li><a href="https://www.wordware.ai">Wordware</a> - A web-hosted IDE
where non-technical domain experts can build task-specific AI
agents.</li>
<li><a href="https://github.com/maiot-io/zenml">ZenML</a> - An
extensible open-source MLOps framework to create reproducible
pipelines.</li>
</ul>
<hr />
<h1 id="resources">Resources</h1>
<p>Where to discover new tools and discuss about existing ones.</p>
<h2 id="articles">Articles</h2>
<ul>
<li><a href="https://martinfowler.com/articles/cd4ml.html">Continuous
Delivery for Machine Learning</a> (Martin Fowler)</li>
<li><a href="https://arxiv.org/abs/2205.02302">Machine Learning
Operations (MLOps): Overview, Definition, and Architecture</a>
(arXiv)</li>
<li><a href="https://www.scaler.com/blog/mlops-roadmap/">MLOps Roadmap:
A Complete MLOps Career Guide</a> (Scaler Blogs)</li>
<li><a
href="https://cloud.google.com/solutions/machine-learning/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning">MLOps:
Continuous delivery and automation pipelines in machine learning</a>
(Google)</li>
<li><a
href="https://towardsdatascience.com/ml-ops-machine-learning-as-an-engineering-discipline-b86ca4874a3f">MLOps:
Machine Learning as an Engineering Discipline</a> (Medium)</li>
<li><a
href="https://developers.google.com/machine-learning/guides/rules-of-ml">Rules
of Machine Learning: Best Practices for ML Engineering</a> (Google)</li>
<li><a
href="https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/aad9f93b86b7addfea4c419b9100c6cdd26cacea.pdf">The
ML Test Score: A Rubric for ML Production Readiness and Technical Debt
Reduction</a> (Google)</li>
<li><a
href="https://blogs.nvidia.com/blog/2020/09/03/what-is-mlops/">What Is
MLOps?</a> (NVIDIA)</li>
</ul>
<h2 id="books">Books</h2>
<ul>
<li><a
href="https://www.amazon.com/Beginning-MLOps-MLFlow-SageMaker-Microsoft/dp/1484265483">Beginning
MLOps with MLFlow</a> (Apress)</li>
<li><a
href="https://www.oreilly.com/library/view/building-machine-learning/9781492053187">Building
Machine Learning Pipelines</a> (O’Reilly)</li>
<li><a
href="https://www.oreilly.com/library/view/building-machine-learning/9781492045106">Building
Machine Learning Powered Applications</a> (O’Reilly)</li>
<li><a href="https://www.amazon.com/gp/product/6180033773">Deep Learning
in Production</a> (AI Summer)</li>
<li><a
href="https://www.oreilly.com/library/view/designing-machine-learning/9781098107956">Designing
Machine Learning Systems</a> (O’Reilly)</li>
<li><a
href="https://www.packtpub.com/product/engineering-mlops/9781800562882">Engineering
MLOps</a> (Packt)</li>
<li><a
href="https://www.oreilly.com/library/view/implementing-mlops-in/9781098136574">Implementing
MLOps in the Enterprise</a> (O’Reilly)</li>
<li><a
href="https://www.oreilly.com/library/view/introducing-mlops/9781492083283">Introducing
MLOps</a> (O’Reilly)</li>
<li><a
href="https://www.oreilly.com/library/view/kubeflow-for-machine/9781492050117">Kubeflow
for Machine Learning</a> (O’Reilly)</li>
<li><a
href="https://www.oreilly.com/library/view/kubeflow-operations-guide/9781492053262">Kubeflow
Operations Guide</a> (O’Reilly)</li>
<li><a
href="https://www.oreilly.com/library/view/machine-learning-design/9781098115777">Machine
Learning Design Patterns</a> (O’Reilly)</li>
<li><a
href="https://www.manning.com/books/machine-learning-engineering-in-action">Machine
Learning Engineering in Action</a> (Manning)</li>
<li><a
href="https://www.oreilly.com/library/view/ml-ops-operationalizing/9781492074663">ML
Ops: Operationalizing Data Science</a> (O’Reilly)</li>
<li><a
href="https://www.manning.com/books/mlops-engineering-at-scale">MLOps
Engineering at Scale</a> (Manning)</li>
<li><a
href="https://link.springer.com/book/10.1007/978-1-4842-9642-4">MLOps
Lifecycle Toolkit</a> (Apress)</li>
<li><a
href="https://www.packtpub.com/product/practical-deep-learning-at-scale-with-mlflow/9781803241333">Practical
Deep Learning at Scale with MLflow</a> (Packt)</li>
<li><a
href="https://www.oreilly.com/library/view/practical-mlops/9781098103002">Practical
MLOps</a> (O’Reilly)</li>
<li><a
href="https://www.packtpub.com/product/production-ready-applied-deep-learning/9781803243665">Production-Ready
Applied Deep Learning</a> (Packt)</li>
<li><a
href="https://www.oreilly.com/library/view/reliable-machine-learning/9781098106218">Reliable
Machine Learning</a> (O’Reilly)</li>
<li><a
href="https://www.packtpub.com/product/the-machine-learning-solutions-architect-handbook/9781801072168">The
Machine Learning Solutions Architect Handbook</a> (Packt)</li>
</ul>
<h2 id="events">Events</h2>
<ul>
<li><a href="https://www.applyconf.com/">apply() - The ML data
engineering conference</a></li>
<li><a
href="https://www.youtube.com/playlist?list=PLH8M0UOY0uy6d_n3vEQe6J_gRBUrISF9m">MLOps
Conference - Keynotes and Panels</a></li>
<li><a href="https://mlopsworld.com/">MLOps World: Machine Learning in
Production Conference</a></li>
<li><a href="https://normconf.com/">NormConf - The Normcore Tech
Conference</a></li>
<li><a href="https://mlsys.stanford.edu/">Stanford MLSys Seminar
Series</a></li>
</ul>
<h2 id="other-lists">Other Lists</h2>
<ul>
<li><a href="https://github.com/eugeneyan/applied-ml">Applied
ML</a></li>
<li><a
href="https://github.com/hibayesian/awesome-automl-papers">Awesome
AutoML Papers</a></li>
<li><a href="https://github.com/windmaple/awesome-AutoML">Awesome
AutoML</a></li>
<li><a href="https://github.com/academic/awesome-datascience">Awesome
Data Science</a></li>
<li><a href="https://github.com/kelvins/awesome-dataops">Awesome
DataOps</a></li>
<li><a
href="https://github.com/ChristosChristofidis/awesome-deep-learning">Awesome
Deep Learning</a></li>
<li><a href="https://github.com/leomaurodesenv/game-datasets">Awesome
Game Datasets</a> (includes AI content)</li>
<li><a
href="https://github.com/josephmisiti/awesome-machine-learning">Awesome
Machine Learning</a></li>
<li><a href="https://github.com/visenger/awesome-mlops">Awesome
MLOps</a></li>
<li><a
href="https://github.com/EthicalML/awesome-production-machine-learning">Awesome
Production Machine Learning</a></li>
<li><a href="https://github.com/vinta/awesome-python">Awesome
Python</a></li>
<li><a
href="https://github.com/ahkarami/Deep-Learning-in-Production">Deep
Learning in Production</a></li>
</ul>
<h2 id="podcasts">Podcasts</h2>
<ul>
<li><a href="https://www.youtube.com/@aistoriespodcast">AI Stories
Podcast</a></li>
<li><a href="https://kubernetespodcast.com/">Kubernetes Podcast from
Google</a></li>
<li><a
href="https://podcasts.google.com/?feed=aHR0cHM6Ly9zb2Z0d2FyZWVuZ2luZWVyaW5nZGFpbHkuY29tL2NhdGVnb3J5L21hY2hpbmUtbGVhcm5pbmcvZmVlZC8">Machine
Learning – Software Engineering Daily</a></li>
<li><a
href="https://podcasts.google.com/?feed=aHR0cHM6Ly9hbmNob3IuZm0vcy8xNzRjYjFiOC9wb2RjYXN0L3Jzcw">MLOps.community</a></li>
<li><a href="https://podcast.zenml.io/">Pipeline Conversation</a></li>
<li><a href="https://changelog.com/practicalai">Practical AI: Machine
Learning, Data Science</a></li>
<li><a href="https://twimlai.com/">This Week in Machine Learning &amp;
AI</a></li>
<li><a
href="https://www.youtube.com/playlist?list=PL4-eEhdXDO5F9Myvh41EeUh7oCgzqFRGk">True
ML Talks</a></li>
</ul>
<h2 id="slack">Slack</h2>
<ul>
<li><a href="https://kubeflow.slack.com/#/">Kubeflow Workspace</a></li>
<li><a href="https://mlops-community.slack.com">MLOps Community
Wokspace</a></li>
</ul>
<h2 id="websites">Websites</h2>
<ul>
<li><a href="https://mlops.swiss-ai-center.ch/">A guide to
MLOps</a></li>
<li><a href="http://featurestore.org/">Feature Stores for ML</a></li>
<li><a href="https://github.com/GokuMohandas/Made-With-ML">Made with
ML</a></li>
<li><a href="https://ml-ops.org/">ML-Ops</a></li>
<li><a href="https://mlops.community/">MLOps Community</a></li>
<li><a href="https://mlops-guide.github.io/">MLOps Guide</a></li>
<li><a href="https://mlopsnow.com">MLOps Now</a></li>
</ul>
<h1 id="contributing">Contributing</h1>
<p>All contributions are welcome! Please take a look at the <a
href="https://github.com/kelvins/awesome-mlops/blob/main/CONTRIBUTING.md">contribution
guidelines</a> first.</p>
<p><a href="https://github.com/kelvins/awesome-mlops">mlops.md
Github</a></p>