328 lines
14 KiB
HTML
328 lines
14 KiB
HTML
<h1 id="awesome-hbase-awesome">Awesome HBase <a
|
||
href="https://github.com/sindresorhus/awesome"><img
|
||
src="https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg"
|
||
alt="Awesome" /></a></h1>
|
||
<p><a
|
||
href="http://hbase.apache.org/"><img src="https://cdn.rawgit.com/rayokota/awesome-hbase/c197f415/hbase_logo_with_orca-2.png" align="right" width="150"></a></p>
|
||
<p>A curated list of awesome HBase projects and resources.</p>
|
||
<p><a href="http://hbase.apache.org">HBase</a> is a distributed,
|
||
scalable, big data store.</p>
|
||
<h2 id="contents">Contents</h2>
|
||
<ul>
|
||
<li><a href="#projects">Projects</a>
|
||
<ul>
|
||
<li><a href="#clients">Clients</a></li>
|
||
<li><a href="#cloud">Cloud</a></li>
|
||
<li><a href="#frameworks">Frameworks</a>
|
||
<ul>
|
||
<li><a href="#datasets">Datasets</a></li>
|
||
<li><a href="#document">Document</a></li>
|
||
<li><a href="#entityjpa">Entity/JPA</a></li>
|
||
<li><a href="#geospatial">Geospatial</a></li>
|
||
<li><a href="#graph">Graph</a></li>
|
||
<li><a href="#sqlolap">SQL/OLAP</a></li>
|
||
<li><a href="#time-series">Time Series</a></li>
|
||
</ul></li>
|
||
<li><a href="#infrastructure">Infrastructure</a>
|
||
<ul>
|
||
<li><a href="#secondary-indices">Secondary Indices</a></li>
|
||
<li><a href="#transactions">Transactions</a></li>
|
||
</ul></li>
|
||
<li><a href="#integrations">Integrations</a></li>
|
||
<li><a href="#tools">Tools</a></li>
|
||
<li><a href="#miscellaneous">Miscellaneous</a></li>
|
||
</ul></li>
|
||
<li><a href="#resources">Resources</a>
|
||
<ul>
|
||
<li><a href="#books">Books</a></li>
|
||
<li><a href="#papers">Papers</a></li>
|
||
<li><a href="#community">Community</a></li>
|
||
</ul></li>
|
||
</ul>
|
||
<h2 id="projects">Projects</h2>
|
||
<h3 id="clients">Clients</h3>
|
||
<ul>
|
||
<li><a href="https://github.com/OpenTSDB/asynchbase">asynchbase</a> -
|
||
Fully asynchronous, non-blocking HBase client.</li>
|
||
<li><a href="https://github.com/tsuna/gohbase">gohbase</a> - Pure Go
|
||
client for HBase.</li>
|
||
<li><a href="https://github.com/wbolster/happybase">happybase</a> -
|
||
Python client for HBase.</li>
|
||
</ul>
|
||
<h3 id="cloud">Cloud</h3>
|
||
<ul>
|
||
<li><a href="https://aws.amazon.com/emr/">Amazon EMR</a> - Amazon’s
|
||
Hadoop/HBase offering on AWS.</li>
|
||
<li><a
|
||
href="https://azure.microsoft.com/en-us/services/hdinsight/">Azure
|
||
HDInsight</a> - Microsoft’s Hadoop/HBase offering on Azure.</li>
|
||
<li><a
|
||
href="https://www.cloudera.com/products/product-components/cloudera-director.html">Cloudera
|
||
Director</a> - Run Hadoop/HBase clusters on AWS, Azure or Google
|
||
Cloud.</li>
|
||
<li><a href="https://cloud.google.com/bigtable/">Google Cloud
|
||
Bigtable</a> - High-performance NoSQL database service accessible via
|
||
HBase client API.</li>
|
||
<li><a
|
||
href="https://hortonworks.com/open-source/cloudbreak/">Hortonworks
|
||
Cloudbreak</a> - Provision Hadoop/HBase clusters on AWS, Azure, Google
|
||
Cloud, or OpenStack.</li>
|
||
</ul>
|
||
<h3 id="frameworks">Frameworks</h3>
|
||
<h4 id="datasets">Datasets</h4>
|
||
<ul>
|
||
<li><a href="http://kitesdk.org">Kite</a> - High-level data layer for
|
||
Hadoop/HBase.</li>
|
||
</ul>
|
||
<h4 id="document">Document</h4>
|
||
<ul>
|
||
<li><a href="https://github.com/rayokota/hdocdb">HDocDB</a> - HBase as a
|
||
JSON document database.</li>
|
||
</ul>
|
||
<h4 id="entityjpa">Entity/JPA</h4>
|
||
<ul>
|
||
<li><a href="http://www.datanucleus.org">DataNucleus</a> - JPA
|
||
persistence layer with support for HBase.</li>
|
||
<li><a href="http://gora.apache.org">Gora</a> - Persistence library for
|
||
big data with support for HBase.</li>
|
||
<li><a href="https://github.com/flipkart-incubator/hbase-orm">HBase
|
||
ORM</a> - A production-grade HBase ORM library.</li>
|
||
<li><a href="https://github.com/rayokota/hentitydb">HEntityDB</a> -
|
||
HBase as an entity database.</li>
|
||
<li><a href="https://github.com/impetus-opensource/Kundera">Kundera</a>
|
||
- JPA client with support for HBase.</li>
|
||
</ul>
|
||
<h4 id="geospatial">Geospatial</h4>
|
||
<ul>
|
||
<li><a href="http://www.geomesa.org/">GeoMesa</a> - Spatial-temporal
|
||
database with support for Accumulo, HBase, Cassandra, and Kafka.</li>
|
||
</ul>
|
||
<h4 id="graph">Graph</h4>
|
||
<ul>
|
||
<li><a href="https://github.com/dbs-leipzig/gradoop">Gradoop</a> -
|
||
Research framework for scalable graph analytics built on Flink and
|
||
HBase.</li>
|
||
<li><a href="https://github.com/rayokota/hgraphdb">HGraphDB</a> - HBase
|
||
as a TinkerPop graph database.</li>
|
||
<li><a
|
||
href="https://github.com/apache/incubator-hugegraph">HugeGraph</a> - A
|
||
graph database that supports more than 10+ billion data, high
|
||
performance and scalability.</li>
|
||
<li><a href="http://janusgraph.org/">JanusGraph</a> - Scalable graph
|
||
database with support for Cassandra, HBase, Google Cloud Bigtable, and
|
||
BerkeleyDB.</li>
|
||
<li><a href="https://github.com/vesoft-inc/nebula">NebulaGraph</a> - A
|
||
high performance distributed Graph database.</li>
|
||
<li><a href="http://s2graph.incubator.apache.org">S2Graph</a> -
|
||
High-performance distributed graph database built on HBase.</li>
|
||
</ul>
|
||
<h4 id="sqlolap">SQL/OLAP</h4>
|
||
<ul>
|
||
<li><a href="http://antsdb.com/">AntsDB</a> - AntsDB is a low latency,
|
||
high concurrency, MySQL compliant SQL layer for HBase.</li>
|
||
<li><a href="https://esgyn.com/">EsgynDB</a> - Commercial SQL engine
|
||
providing ACID transactions and BI analytics on top of Hadoop, based on
|
||
Trafodian.</li>
|
||
<li><a href="http://kylin.apache.org">Kylin</a> - Extreme OLAP engine
|
||
for big data that stores data in HBase.</li>
|
||
<li><a href="http://www.leanxcale.com">LeanXScale</a> - Commercial full
|
||
ACID full SQL product built on Hadoop/HBase.</li>
|
||
<li><a href="https://phoenix.apache.org">Phoenix</a> - SQL layer on top
|
||
of HBase.</li>
|
||
<li><a href="https://www.splicemachine.com">Splice Machine</a> -
|
||
Commercial RDBMS built on top of HBase.</li>
|
||
<li><a href="http://trafodion.apache.org">Trafodian</a> - Transactional
|
||
SQL-on-Hadoop/HBase.</li>
|
||
</ul>
|
||
<h4 id="time-series">Time Series</h4>
|
||
<ul>
|
||
<li><a
|
||
href="http://axibase.com/products/axibase-time-series-database/">Axibase</a>
|
||
- Distributed time series database built on HBase.</li>
|
||
<li><a href="http://opentsdb.net">OpenTSDB</a> - Scalable time series
|
||
database built on HBase.</li>
|
||
<li><a href="http://www.warp10.io">Warp 10</a> - Time series database
|
||
for sensor data.</li>
|
||
</ul>
|
||
<h3 id="infrastructure">Infrastructure</h3>
|
||
<h4 id="secondary-indices">Secondary Indices</h4>
|
||
<ul>
|
||
<li><a href="https://github.com/Huawei-Hadoop/hindex">hindex</a> -
|
||
Secondary index for HBase.</li>
|
||
<li><a href="http://ngdata.github.io/hbase-indexer/">Lily HBase
|
||
Indexer</a> - Quickly and easily search for content stored in
|
||
HBase.</li>
|
||
</ul>
|
||
<h4 id="transactions">Transactions</h4>
|
||
<ul>
|
||
<li><a href="https://github.com/VCNC/haeinsa">Haeinsa</a> -
|
||
Multi-row/multi-table transaction library for HBase.</li>
|
||
<li><a href="https://github.com/algarecu/hbase-0.94.8-qod">HBase-QoD</a>
|
||
- Vector-field consistency for HBase fine-grained transactional inter-DC
|
||
replication.</li>
|
||
<li><a href="https://github.com/apache/incubator-omid">Omid</a> -
|
||
Transactional support for HBase.</li>
|
||
<li><a href="http://tephra.incubator.apache.org">Tephra</a> - Globally
|
||
consistent transactions on top of HBase.</li>
|
||
<li><a href="https://github.com/XiaoMi/themis">Themis</a> -
|
||
Cross-row/cross-table transactions on HBase based on Google’s
|
||
Percolator.</li>
|
||
</ul>
|
||
<h3 id="integrations">Integrations</h3>
|
||
<ul>
|
||
<li><a
|
||
href="https://github.com/apache/apex-malhar/tree/master/contrib/src/test/java/org/apache/apex/malhar/contrib/hbase">Apex</a>
|
||
- Apex-HBase connector.</li>
|
||
<li><a
|
||
href="https://github.com/apache/beam/tree/master/sdks/java/io/hbase">Beam</a>
|
||
- Beam HBase integration.</li>
|
||
<li><a href="http://camel.apache.org/hbase.html">Camel</a> - Camel HBase
|
||
component.</li>
|
||
<li><a href="https://github.com/Cascading/cascading.hbase">Cascading</a>
|
||
- HBase adapters for Cascading.</li>
|
||
<li><a
|
||
href="https://github.com/sorenmacbeth/hbase-cascalog">Cascalog</a> -
|
||
Wrapper around Cascading.HBase for use in Cascalog.</li>
|
||
<li><a
|
||
href="https://github.com/apache/crunch/tree/master/crunch-hbase">Crunch</a>
|
||
- HBase adapters for Crunch.</li>
|
||
<li><a href="https://drill.apache.org/docs/querying-hbase/">Drill</a> -
|
||
HBase storage plugin for Drill.</li>
|
||
<li><a
|
||
href="https://github.com/mallocator/Elasticsearch-HBase-River">Elasticsearch</a>
|
||
- Elasticsearch import river for HBase.</li>
|
||
<li><a
|
||
href="https://github.com/apache/flink/tree/master/flink-connectors/flink-connector-hbase-2.2">Flink</a>
|
||
- Flink-HBase connector.</li>
|
||
<li><a
|
||
href="https://github.com/apache/incubator-gearpump/tree/master/external/hbase">Gearpump</a>
|
||
- Gearpump integration for HBase.</li>
|
||
<li><a
|
||
href="https://github.com/apache/giraph/tree/trunk/giraph-hbase">Giraph</a>
|
||
- Giraph input and output formats for HBase.</li>
|
||
<li><a
|
||
href="https://hawq.apache.org/docs/userguide/2.3.0.0-incubating/pxf/HBasePXF.html">HAWQ</a>
|
||
- HAWQ PXF external tables on HBase.</li>
|
||
<li><a
|
||
href="https://cwiki.apache.org/confluence/display/Hive/HBaseIntegration">Hive</a>
|
||
- Hive HBase integration.</li>
|
||
<li><a
|
||
href="https://www.cloudera.com/documentation/enterprise/latest/topics/impala_hbase.html">Impala</a>
|
||
- Impala support for querying HBase tables.</li>
|
||
<li><a
|
||
href="https://github.com/apache/hbase-connectors/tree/master/kafka">Kafka</a>
|
||
- HBase Kafka proxy.</li>
|
||
<li><a
|
||
href="https://github.com/apache/pig/tree/trunk/src/org/apache/pig/backend/hadoop/hbase">Pig</a>
|
||
- Pig HBase integration.</li>
|
||
<li><a
|
||
href="https://github.com/analysys/presto-hbase-connector">Presto</a> -
|
||
Presto-HBase connector.</li>
|
||
<li><a href="http://pulsar.apache.org/docs/en/io-hbase/">Pulsar</a> -
|
||
HBase connector for Pulsar.</li>
|
||
<li><a
|
||
href="https://cwiki.apache.org/confluence/display/RANGER/HBase+Plugin">Ranger</a>
|
||
- HBase plugin for Apache Ranger.</li>
|
||
<li><a href="https://github.com/hortonworks-spark/shc">Spark</a> -
|
||
Spark-HBase connector.</li>
|
||
<li><a href="https://projects.spring.io/spring-hadoop/">Spring for
|
||
Apache Hadoop</a> - Spring-Hadoop integration, including HBase
|
||
support.</li>
|
||
<li><a
|
||
href="https://github.com/apache/storm/tree/master/external/storm-hbase">Storm</a>
|
||
- Storm/Trident integration for HBase.</li>
|
||
<li><a
|
||
href="https://tajo.apache.org/docs/current/hbase_integration.html">Tajo</a>
|
||
- Tajo integration with HBase.</li>
|
||
<li><a
|
||
href="https://zeppelin.apache.org/docs/0.6.2/interpreter/hbase.html">Zeppelin</a>
|
||
- HBase shell interpreter for Apache Zeppelin.</li>
|
||
</ul>
|
||
<h3 id="tools">Tools</h3>
|
||
<ul>
|
||
<li><a href="https://ambari.apache.org">Ambari</a> - Software for
|
||
provisioning, managing, and monitor Hadoop/HBase clusters.</li>
|
||
<li><a
|
||
href="https://www.cloudera.com/products/product-components/cloudera-manager.html">Cloudera
|
||
Manager</a> - Tool for managing Hadoop/HBase in production.</li>
|
||
<li><a href="http://www.dbschema.com/index.html">DbSchema</a> -
|
||
Diagram-oriented database designer with support for HBase.</li>
|
||
<li><a href="https://github.com/sentric/hannibal">Hannibal</a> - Tool to
|
||
monitor and maintain HBase clusters.</li>
|
||
<li><a href="https://github.com/NiceSystems/hrider">h-rider</a> - GUI
|
||
for viewing and manipulating data in HBase.</li>
|
||
<li><a href="http://gethue.com">Hue</a> - Smart analytics workbench that
|
||
includes an HBase browser.</li>
|
||
<li><a href="http://sematext.com/spm">Sematext SPM</a> - Tool for <a
|
||
href="http://sematext.com/spm/integrations/hbase-monitoring">monitoring
|
||
HBase</a>, HDFS, etc.</li>
|
||
</ul>
|
||
<h3 id="miscellaneous">Miscellaneous</h3>
|
||
<ul>
|
||
<li><a href="https://github.com/HubSpot/hbase-support">HubSpot HBase
|
||
support</a> - Configs and tools for HBase at HubSpot, including Hystrix
|
||
integration and coprocessors.</li>
|
||
</ul>
|
||
<h2 id="resources">Resources</h2>
|
||
<h3 id="books">Books</h3>
|
||
<ul>
|
||
<li><a href="https://www.manning.com/books/hbase-in-action">HBase in
|
||
Action</a> - Experience-driven guide that shows you how to use
|
||
HBase.</li>
|
||
<li><a href="http://shop.oreilly.com/product/0636920014348.do">HBase:
|
||
The Definitive Guide</a> - Comprehensive guide to HBase.</li>
|
||
<li><a
|
||
href="http://shop.oreilly.com/product/0636920035688.do">Architecting
|
||
HBase Applications</a> - Includes HBase principles, cluster guidelines,
|
||
and in-depth case studies.</li>
|
||
<li><a
|
||
href="https://www.packtpub.com/big-data-and-business-intelligence/hbase-administration-cookbook">HBase
|
||
Administration Cookbook</a> - How to master HBase configuration and
|
||
administration.</li>
|
||
<li><a
|
||
href="https://www.packtpub.com/big-data-and-business-intelligence/hbase-essentials">HBase
|
||
Essentials</a> - A practical guide to using HBase.</li>
|
||
<li><a
|
||
href="https://www.packtpub.com/big-data-and-business-intelligence/hbase-design-patterns">HBase
|
||
Design Patterns</a> - Successful patterns to develop scalable
|
||
applications with HBase.</li>
|
||
<li><a
|
||
href="https://www.packtpub.com/big-data-and-business-intelligence/learning-hbase">Learning
|
||
HBase</a> - Learn the fundamentals of HBase administration and
|
||
development.</li>
|
||
<li><a
|
||
href="https://www.packtpub.com/big-data-and-business-intelligence/hbase-high-performance-cookbook">HBase
|
||
High Performance Cookbook</a> - Exciting projects that teach you how to
|
||
use HBase.</li>
|
||
<li><a href="http://www.apress.com/us/book/9781484224236">Apache HBase
|
||
Primer</a> - A compact guide to HBase essentials.</li>
|
||
<li><a href="http://www.apress.com/us/book/9781484223697">Pro Apache
|
||
Phoenix</a> - Basic and best practices for using Phoenix.</li>
|
||
</ul>
|
||
<h3 id="papers">Papers</h3>
|
||
<ul>
|
||
<li><a
|
||
href="https://static.googleusercontent.com/media/research.google.com/en//archive/bigtable-osdi06.pdf">Bigtable:
|
||
A Distributed Storage System for Structured Data</a> - The inspiration
|
||
for HBase.</li>
|
||
<li><a
|
||
href="https://pdfs.semanticscholar.org/865a/215390cd49af9e4941e03107120e631dcaa0.pdf">Apache
|
||
Hadoop Goes Realtime at Facebook</a> - How Facebook deployed HBase to
|
||
production.</li>
|
||
</ul>
|
||
<h3 id="community">Community</h3>
|
||
<ul>
|
||
<li><a href="https://blogs.apache.org/hbase/">Blog</a></li>
|
||
<li><a href="http://hbase.apache.org/mail-lists.html">Mailing
|
||
Lists</a></li>
|
||
<li><a href="https://www.reddit.com/r/hbase/">Reddit</a></li>
|
||
<li><a href="https://stackoverflow.com/questions/tagged/hbase">Stack
|
||
Overflow</a></li>
|
||
<li><a href="https://twitter.com/HBase">Twitter</a></li>
|
||
</ul>
|
||
<h2 id="license">License</h2>
|
||
<p xmlns:dct="http://purl.org/dc/terms/">
|
||
<p><a rel="license" href="http://creativecommons.org/publicdomain/mark/1.0/">
|
||
<img src="https://mirrors.creativecommons.org/presskit/buttons/88x31/svg/publicdomain.svg"
|
||
style="border-style: none;" alt="Public Domain Mark" /> </a></p>
|