This commit is contained in:
2025-07-18 23:13:11 +02:00
parent c9485bf576
commit 652812eed0
2354 changed files with 1266414 additions and 1 deletions

View File

@@ -0,0 +1,442 @@
<h1 id="awesome-cheminformatics-awesome">Awesome Cheminformatics <a
href="https://awesome.re"><img src="https://awesome.re/badge.svg"
alt="Awesome" /></a></h1>
<blockquote>
<p>Cheminformatics (also known as chemoinformatics, chemioinformatics
and chemical informatics) is the use of computer and informational
techniques applied to a range of problems in the field of chemistry.— <a
href="https://en.wikipedia.org/wiki/Cheminformatics">Wikipedia</a></p>
</blockquote>
<p>A curated list of awesome Cheminformatics software, resources, and
libraries. Mostly command line based, and free or open-source. Please
feel free to <a href="CONTRIBUTING.md">contribute</a> !</p>
<h2 id="contents">Contents</h2>
<ul>
<li><a href="#applications">Applications</a>
<ul>
<li><a href="#app-visualization">Visualization</a></li>
<li><a href="#app-cmd">Command Line Tools</a></li>
<li><a href="#app-docking">Docking</a></li>
<li><a href="#app-virtual">Virtual Machine</a></li>
</ul></li>
<li><a href="#libraries">Libraries</a>
<ul>
<li><a href="#lib-general">General Purpose</a></li>
<li><a href="#lib-visualization">Visualization</a></li>
<li><a href="#lib-format">Command Line Tools</a></li>
<li><a href="#lib-dock">Docking</a></li>
<li><a href="#lib-des">Molecular Descriptors</a></li>
<li><a href="#lib-ml">Machine Learning</a></li>
<li><a href="#lib-web">Web APIs</a></li>
<li><a href="#lib-db">Databases</a></li>
<li><a href="#lib-others">Others</a></li>
</ul></li>
<li><a href="#journals">Journals</a></li>
<li><a href="#resources">Resources</a>
<ul>
<li><a href="#courses">Courses</a></li>
<li><a href="#blogs">Blogs</a></li>
<li><a href="#books">Books</a></li>
</ul></li>
<li><a href="#see-also">See Also</a></li>
</ul>
<h2 id="applications">Applications</h2>
<p><a id="app-visualization"></a> ### Visualization</p>
<ul>
<li><a href="https://sourceforge.net/projects/pymol/">PyMOL</a> -
Python-enhanced molecular graphics tool.</li>
<li><a href="http://jmol.sourceforge.net/">Jmol</a> - Browser-based
HTML5 viewer and stand-alone Java viewer for chemical structures in
3D.</li>
<li><a href="http://www.ks.uiuc.edu/Research/vmd/">VMD</a> - Molecular
visualization program for displaying, animating, and analyzing large
biomolecular systems using 3-D graphics and built-in scripting.</li>
<li><a href="https://www.cgl.ucsf.edu/chimera/">Chimera</a> - Highly
extensible program for interactive molecular visualization and analysis.
<a
href="https://www.cgl.ucsf.edu/chimera/docs/sourcecode.html">Source</a>
is available.</li>
<li><a href="https://www.cgl.ucsf.edu/chimerax/">ChimeraX</a> - The
next-generation molecular visualization program, following UCSF Chimera.
Source is available <a
href="https://www.cgl.ucsf.edu/chimerax/docs/devel/conventions.html">here</a>.</li>
<li><a
href="http://www.openmolecules.org/datawarrior/index.html">DataWarrior</a>
- A program for data Visualization and analysis which combines dynamic
graphical views and interactive row filtering with chemical
intelligence.</li>
</ul>
<p><a id="app-cmd"></a> ### Command Line Tools</p>
<ul>
<li><a href="http://openbabel.org/wiki/Main_Page">Open Babel</a> -
Chemical toolbox designed to speak the many languages of chemical
data.</li>
<li><a href="http://www.mayachemtools.org/index.html">MayaChemTools</a>
- Collection of Perl and Python scripts, modules, and classes that
support day-to-day computational discovery needs.</li>
<li><a href="http://m3g.iqm.unicamp.br/packmol/home.shtml">Packmol</a> -
Initial configurations for molecular dynamics simulations by packing
optimization.</li>
<li><a
href="http://meilerlab.org/index.php/bclcommons/show/b_apps_id/1">BCL::Commons</a></li>
</ul>
<p><a id="app-docking"></a> ### Docking</p>
<ul>
<li><a href="http://vina.scripps.edu/">AutoDock Vina</a> - Molecular
docking and virtual screening.</li>
<li><a href="https://sourceforge.net/projects/smina/">smina</a> -
Customized <a href="http://vina.scripps.edu/">AutoDock Vina</a> to
better support scoring function development and high-performance energy
minimization.</li>
</ul>
<p><a id="app-virtual"></a> ### Virtual Machine</p>
<ul>
<li><a
href="http://chembl.blogspot.com/2015/07/mychembl-20-has-landed.html">myChEMBL</a>
- A version of ChEMBL built using Open Source software (Ubuntu,
PostgreSQL, RDKit)</li>
<li><a href="https://github.com/3D-e-Chem/3D-e-Chem-VM">3D e-Chem
Virtual Machine</a> - Virtual machine with all software and sample data
to run 3D-e-Chem Knime workflows</li>
</ul>
<h2 id="libraries">Libraries</h2>
<p><a id="lib-general"></a> ### General Purpose</p>
<ul>
<li><a href="http://www.rdkit.org/">RDKit</a> - Collection of
cheminformatics and machine-learning software written in C++ and
Python.</li>
<li><a href="https://github.com/epam/Indigo">Indigo</a> - Universal
molecular toolkit that can be used for molecular fingerprinting,
substructure search, and molecular visualization written in C++ package,
with Java, C#, and Python wrappers.</li>
<li><a href="https://sourceforge.net/projects/cdk/">CDK (Chemistry
Development Kit)</a> - Algorithms for structural chemo- and
bioinformatics, implemented in Java.</li>
<li><a
href="https://www.bioconductor.org/packages/release/bioc/vignettes/ChemmineR/inst/doc/ChemmineR.html">ChemmineR</a>
- Cheminformatics package for analyzing drug-like small molecule data in
R.</li>
<li><a href="https://github.com/bjodah/chempy">ChemPy</a> - A Python
package useful for chemistry (mainly physical/inorganic/analytical
chemistry)</li>
<li><a
href="https://github.com/mojaie/MolecularGraph.jl">MolecularGraph.jl</a>
- A graph-based molecule modeling and chemoinformatics analysis toolkit
fully implemented in Julia</li>
<li><a href="https://github.com/datamol-org/datamol">datamol</a>: -
Molecular Manipulation Made Easy. A light wrapper build on top of
RDKit.</li>
<li><a href="https://github.com/cimm-kzn/CGRtools">CGRtools</a> -
Toolkit for processing molecules, reactions and condensed graphs of
reactions. Can be used for chemical standardization, MCS search,
tautomers generation with backward compatibility to RDKit and
NetworkX.</li>
</ul>
<p><a id="lib-format"></a> ### Format Checking</p>
<ul>
<li><a
href="https://github.com/chembl/ChEMBL_Structure_Pipeline">ChEMBL_Structure_Pipeline
(formerly standardiser)</a> - Tool designed to provide a simple way of
standardising molecules as a prelude to e.g. molecular modelling
exercises.</li>
<li><a href="https://github.com/mcs07/MolVS">MolVS</a> - Molecule
validation and standardization based on <a
href="http://www.rdkit.org/">RDKit</a>.</li>
<li><a href="https://github.com/PatWalters/rd_filters">rd_filters</a> -
A script to run structural alerts using the RDKit and ChEMBL</li>
<li><a href="https://github.com/haddocking/pdb-tools">pdb-tools</a> - A
swiss army knife for manipulating and editing PDB files.</li>
</ul>
<p><a id="lib-visualization"></a> ### Visualization</p>
<ul>
<li><a href="http://partridgejiang.github.io/Kekule.js/">Kekule.js</a> -
Front-end JavaScript library for providing the ability to represent,
draw, edit, compare and search molecule structures on web browsers.</li>
<li><a href="https://github.com/3dmol/3Dmol.js">3Dmol.js</a> - An
object-oriented, webGL based JavaScript library for online molecular
visualization.</li>
<li><a href="https://github.com/JChemPaint/jchempaint">JChemPaint</a> -
Chemical 2D structure editor application/applet based on the <a
href="https://sourceforge.net/projects/cdk/">Chemistry Development
Kit</a>.</li>
<li><a href="https://github.com/EBjerrum/rdeditor">rdeditor</a> - Simple
RDKit molecule editor GUI using PySide.</li>
<li><a href="http://nglviewer.org/nglview/latest/">nglviewer</a> -
Interactive molecular graphics for Jupyter notebooks.</li>
<li><a href="https://www.npmjs.com/package/@rdkit/rdkit">RDKit.js</a> -
Official JavaScript distribution of cheminformatics functionality from
the RDKit - a C++ library for cheminformatics.</li>
</ul>
<p><a id="lib-des"></a> ### Molecular Descriptors</p>
<ul>
<li><a href="https://github.com/mordred-descriptor/mordred">mordred</a>
- Molecular descriptor calculator based on <a
href="http://www.rdkit.org/">RDKit</a>.</li>
<li><a
href="https://github.com/bp-kelley/descriptastorus">DescriptaStorus</a>
- Descriptor computation(chemistry) and (optional) storage for machine
learning.</li>
<li><a href="https://github.com/samoturk/mol2vec">mol2vec</a> - Vector
representations of molecular substructures.</li>
<li><a
href="http://silicos-it.be.s3-website-eu-west-1.amazonaws.com/software/align-it/1.0.4/align-it.html#alignit-generating-pharmacophore-points">Align-it</a>
- Align molecules according their pharmacophores.</li>
<li><a href="https://nanx.me/Rcpi/index.html">Rcpi</a> - R/Bioconductor
package to generate various descriptors of proteins, compounds and their
interactions.</li>
</ul>
<p><a id="lib-ml"></a> ### Machine Learning</p>
<ul>
<li><a href="https://github.com/deepchem/deepchem">DeepChem</a> - Deep
learning library for Chemistry based on Tensorflow</li>
<li><a href="https://github.com/chemprop/chemprop">Chemprop</a> -
Directed message passing neural networks for property prediction of
molecules and reactions with uncertainty and interpretation.</li>
<li><a href="https://github.com/hachmannlab/chemml">ChemML</a> - ChemML
is a machine learning and informatics program suite for the analysis,
mining, and modeling of chemical and materials data. (based on
Tensorflow)</li>
<li><a
href="https://github.com/Oloren-AI/olorenchemengine">olorenchemengine</a>
- Molecular property prediction with unified API for diverse models and
respresentations, with integrated uncertainty quantification,
interpretability, and hyperparameter/architecture tuning.</li>
<li><a href="https://github.com/Mariewelt/OpenChem">OpenChem</a> -
OpenChem is a deep learning toolkit for Computational Chemistry with
PyTorch backend.</li>
<li><a href="https://github.com/awslabs/dgl-lifesci">DGL-LifeSci</a> -
DGL-LifeSci is a <a href="https://www.dgl.ai/">DGL</a>-based package for
various applications in life science with graph neural network.</li>
<li><a
href="https://github.com/pfnet-research/chainer-chemistry">chainer-chemistry</a>
- A Library for Deep Learning in Biology and Chemistry.</li>
<li><a
href="https://pytorch-geometric.readthedocs.io/en/latest/">pytorch-geometric</a>
- A PyTorch library provides implementation of many graph convolution
algorithms.</li>
<li><a href="https://github.com/jrash/ChemModLab">chemmodlab</a> - A
Cheminformatics Modeling Laboratory for Fitting and Assessing Machine
Learning Models in R.</li>
<li><a href="https://github.com/sustainable-processes/summit">Summit</a>
- A python package for optimizing chemical reactions using machine
learning (contains 10 algorithms + several benchmarks).</li>
</ul>
<p><a id="lib-web"></a> ### Web APIs</p>
<ul>
<li><a href="https://github.com/ropensci/webchem">webchem</a> - Chemical
Information from the Web.</li>
<li><a href="http://pubchempy.readthedocs.io">PubChemPy</a> - Python
wrapper for the PubChem PUG REST API.</li>
<li><a href="http://chemspipy.readthedocs.org">ChemSpiPy</a> - Python
wrapper for the ChemSpider API.</li>
<li><a href="http://cirpy.readthedocs.org/">CIRpy</a> - Python wrapper
for the <a href="https://cactus.nci.nih.gov/chemical/structure">NCI
Chemical Identifier Resolver (CIR)</a>.</li>
<li><a href="https://github.com/chembl/chembl_beaker">Beaker</a> - <a
href="http://www.rdkit.org/">RDKit</a> and <a
href="https://cactus.nci.nih.gov/osra/">OSRA</a> in the <a
href="http://bottlepy.org/docs/dev/">Bottle</a> on <a
href="http://www.tornadoweb.org/en/stable/">Tornado</a>.</li>
<li><a
href="https://github.com/girke-lab/chemminetools">chemminetools</a> -
Open source web framework for small molecule analysis based on
Django.</li>
<li><a href="http://ambit.sourceforge.net/">ambit</a> - offers
chemoinformatics functionality via REST web services.</li>
</ul>
<p><a id="lib-db"></a> ### Databases</p>
<ul>
<li><a href="https://github.com/rvianello/razi">razi</a> -
Cheminformatic extension for the SQLAlchemy database.</li>
<li><a
href="https://bitbucket.org/fiehnlab/fiehnlab-cts/src/master/">Chemical
Translation Service</a> - Source code of the <a
href="https://cts.fiehnlab.ucdavis.edu/">Chemical Translation
Service</a> web service.</li>
</ul>
<p><a id="lib-dock"></a> ### Docking * <a
href="https://www.rosettacommons.org/docs/latest/Home">Rosetta</a> - A
comprehensive software suite for modeling macromolecular structures.
Used larely for protein-protein docking. * <a
href="https://github.com/dockstring/dockstring">DOCKSTRING</a> -
Automates and standardizes ligand preparation for AutoDock Vina.</p>
<p><a id="lib-md"></a> ### Molecular Dynamics</p>
<ul>
<li><a href="http://www.gromacs.org/">Gromacs</a> - Molecular dynamics
package mainly designed for simulations of proteins, lipids and nucleic
acids.</li>
<li><a href="http://openmm.org/">OpenMM</a> - High performance toolkit
for molecular simulation including extensive language bindings for
Python, C, C++, and even Fortran.</li>
<li><a href="https://www.ks.uiuc.edu/Research/namd/">NAMD</a> - a
parallel molecular dynamics code designed for high-performance
simulation of large biomolecular systems.</li>
<li><a href="https://github.com/mdtraj/mdtraj">MDTraj</a> - Analysis of
molecular dynamics trajectories.</li>
<li><a href="https://github.com/cclib/cclib">cclib</a> - Parsers and
algorithms for computational chemistry logfiles.</li>
<li><a href="https://github.com/prody/ProDy">ProDy</a> - A Python
package for protein dynamics analysis</li>
</ul>
<p><a id="lib-others"></a> ### Others</p>
<ul>
<li><a href="https://github.com/girke-lab/eiR">eiR</a> - Accelerated
similarity searching of small molecules</li>
<li><a href="https://github.com/dan2097/opsin">OPSIN</a> - Open Parser
for Systematic IUPAC nomenclature</li>
<li><a href="https://github.com/MolSSI/cookiecutter-cms">Cookiecutter
for Computational Molecular Sciences</a> - Python-centric Cookiecutter
for Molecular Computational Chemistry Packages by <a
href="https://molssi.org/">MolSSL</a></li>
<li><a
href="https://github.com/PrincetonUniversity/auto-qchem">Auto-QChem</a>
- an automated workflow for the generation and storage of DFT
calculations for organic molecules.</li>
<li><a
href="https://git.durrantlab.pitt.edu/jdurrant/gypsum_dl">Gypsum-DL</a>
- a program for converting 2D SMILES strings to 3D models.</li>
<li><a href="https://github.com/connorcoley/rdchiral">RDchiral</a> -
Wrapper for RDKits RunReactants to improve stereochemistry
handling</li>
<li><a href="https://github.com/Et9797/confgen-webapp">confgen</a> -
Webapp for generating conformers</li>
</ul>
<h2 id="journals">Journals</h2>
<ul>
<li><a href="https://jcheminf.biomedcentral.com/">Journal of
Cheminformatics</a></li>
<li><a href="https://pubs.acs.org/journal/jcisd8">Journal of Chemical
Information and Modeling (ACS Publications)</a></li>
</ul>
<h2 id="resources">Resources</h2>
<h3 id="courses">Courses</h3>
<ul>
<li><a
href="http://learncheminformatics.com/">Learncheminformatics.com</a> -
“Cheminformatics: Navigating the world of chemical data” courese at
Indiana University.</li>
<li><a href="https://github.com/Mishima-syk/py4chemoinformatics">Python
for chemoinformatics</a></li>
<li><a
href="https://github.com/volkamerlab/TeachOpenCADD">TeachOpenCADD</a> -
A teaching platform for computer-aided drug design (CADD) using open
source packages and data.</li>
<li><a
href="https://chem.libretexts.org/Courses/Intercollegiate_Courses/Cheminformatics_OLCC_(2019)">Cheminformatics
OLCC</a> - Cheminformatics course of the Collaborative Intercollegiate
Online Chemistry Course (OLCC) course of University of Arkansas at
Little Rock by Robert Belford</li>
<li><a href="http://bigchem.eu/alllectures">BigChem</a> - All lectures
of <a href="http://bigchem.eu/">BigChem</a> (A Horizon 2020 MSC ITN EID
project, which provides innovative education in large chemical data
analysis.)</li>
<li><a href="https://dasher.wustl.edu/chem478/">Molecular modeling
course</a> - by Dr. <a href="https://dasher.wustl.edu/">Jay Ponder</a>,
a professor from WashU St.Louis.</li>
<li><a href="https://dasher.wustl.edu/chem430/">Simulation in Chemistry
and Biochemistry</a> - by Dr. <a href="https://dasher.wustl.edu/">Jay
Ponder</a>, a professor from WashU St.Louis.</li>
</ul>
<h3 id="blogs">Blogs</h3>
<ul>
<li><a
href="https://opensourcemolecularmodeling.github.io/README.html">Open
Source Molecular Modeling</a> - Updateable catalog of open source
molecular modeling software.</li>
<li><a href="https://pubchemblog.ncbi.nlm.nih.gov/">PubChem Blog</a> -
News, updates and tutorials about <a
href="https://pubchem.ncbi.nlm.nih.gov/">PubChem</a>.</li>
<li><a href="http://chembl.blogspot.tw/">The ChEMBL-og blog</a> -
Stories and news from Computational Chemical Biology Group at <a
href="https://www.ebi.ac.uk/">EMBL-EBI</a>.</li>
<li><a href="http://chembl.github.io/">ChEMBL blog</a> - ChEMBL on
GitHub.</li>
<li><a href="http://www.steinbeck-molecular.de/steinblog/">SteinBlog</a>
- Blog of <a
href="http://www.steinbeck-molecular.de/steinblog/index.php/about/">Christoph
Steinbeck</a>, who is the head of cheminformatics and metabolism at the
EMBL-EBI.</li>
<li><a href="http://practicalcheminformatics.blogspot.com/">Practical
Cheminformatics</a> - Blog with in-depth examples of practical
application of cheminformatics.</li>
<li><a href="http://blog.rguha.net/">So much to do, so little time -
Trying to squeeze sense out of chemical data</a> - Bolg of <a
href="http://blog.rguha.net/?page_id=8">Rajarshi Guha</a>, who is a
research scientist at NIH Center for Advancing Translational Science.  *
Some old blogs <a href="https://rguha.wordpress.com/">1</a> <a
href="http://www.rguha.net/index.html">2</a>.</li>
<li><a href="http://baoilleach.blogspot.tw/">Noel OBlog</a> - Blog of
<a href="https://www.redbrick.dcu.ie/~noel/">Noel OBoyle</a>, who is a
Senior Software Engineer at NextMove Software.</li>
<li><a href="http://chem-bla-ics.blogspot.tw/">chem-bla-ics</a> - Blog
of <a href="http://egonw.github.io/">Egon Willighagen</a>, who is an
assistant professor at Maastricht University. <!---
* [Asad's Blog](https://chembioinfo.com/) - Bolg of Syed Asad Rahman, who is a research scientist in the [Thornton group](http://www.ebi.ac.uk/research/thornton) at EMBL-EBI.
--></li>
<li><a href="http://asteeves.github.io/">steeveslab-blog</a> - Some
examples using <a href="http://www.rdkit.org/">RDKit</a>.</li>
<li><a href="http://www.macinchem.org/">Macs in Chemistry</a> - Provide
a resource for chemists using Apple Macintosh computers.</li>
<li><a href="http://www.drugdiscovery.net/">DrugDiscovery.NET</a> - Blog
of <a href="http://www.andreasbender.de/">Andreas Bender</a>, who is a
Reader for Molecular Informatics at University of Cambridge.</li>
<li><a href="https://iwatobipen.wordpress.com/">Is life worth
living?</a> - Some examples for cheminformatics libraries.</li>
<li><a href="https://cheminf20.org/">Cheminformatics 2.0</a> - Blog of
<a href="https://twitter.com/aclarkxyz">Alex M. Clark</a>, a research
scientist at Collaborative Drug Discovery.</li>
<li><a href="https://depth-first.com/">Depth-First</a> - Blog of <a
href="https://depth-first.com/about/">Richard L. Apodaca</a>, a chemist
living in La Jolla, California.</li>
<li><a href="https://www.cheminformania.com">Cheminformania</a> - Blog
of <a
href="https://www.cheminformania.com/about/esben-jannik-bjerrum/">Ph.D,
Esben Jannik Bjerrum</a>, who is a Principle Scientist and a Machine
Learning and AI specialists at AstraZeneca.</li>
</ul>
<h3 id="books">Books</h3>
<ul>
<li><a
href="https://books.google.com/books/about/Computational_Approaches_in_Cheminformat.html?id=bLqV4rYQoYsC">Computational
Approaches in Cheminformatics and Bioinformatics</a> - Include insights
from public (NIH), academic, and industrial sources at the same
time.</li>
<li><a
href="https://onlinelibrary.wiley.com/doi/book/10.1002/9781118742785">Chemoinformatics
for Drug Discovery</a> - Materials about how to use Chemoinformatics
strategies to improve drug discovery results.</li>
<li><a
href="https://onlinelibrary.wiley.com/doi/book/10.1002/9783527628766">Molecular
Descriptors for Chemoinformatics</a> - More than 3300 descriptors and
related terms for chemoinformatic analysis of chemical compound
properties.</li>
</ul>
<p><a id="see-also"></a> ## See Also</p>
<ul>
<li><a
href="https://github.com/hussius/deeplearning-biology#chemoinformatics-and-drug-discovery-">deeplearning-biology</a>
- Chemoinformatics and drug discovery section in deeplearning-biology
repo.</li>
<li><a
href="https://github.com/lmmentel/awesome-python-chemistry">awesome-python-chemistry</a>
- Another list focuses on Python stuff related to Chemistry.</li>
<li><a
href="https://github.com/benb111/awesome-small-molecule-ml">awesome-small-molecule-ml</a>
- A list of papers, data sets, and other resources for machine learning
for small-molecule drug discovery.</li>
<li><a
href="https://github.com/yangnianzu0515/awesome-molecular-docking">awesome-molecular-docking</a>
- A curated list of molecular docking software, datasets, and other
closely related resources.</li>
<li><a href="https://molssi.org/software-search/">MolSSI Molecular
Software Database</a> <a
href="https://fiehnlab.ucdavis.edu/staff/kind/metabolomics">Pages
created by Tobias Kind, PhD</a></li>
</ul>
<h2 id="license">License</h2>
<p><a href="https://creativecommons.org/publicdomain/zero/1.0/"><img
src="http://mirrors.creativecommons.org/presskit/buttons/88x31/svg/cc-zero.svg"
alt="CC0" /></a></p>
<p><a
href="https://github.com/hsiaoyi0504/awesome-cheminformatics">cheminformatics.md
Github</a></p>