Awesome HBase !Awesome (https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg) (https://github.com/sindresorhus/awesome)  (http://hbase.apache.org/) A curated list of awesome HBase projects and resources. HBase (http://hbase.apache.org) is a distributed, scalable, big data store. Contents - Projects (#projects) - **Clients** (#clients)  - **Cloud** (#cloud)  - **Frameworks** (#frameworks)   - **Datasets** (#datasets)   - **Document** (#document)   - **Entity/JPA** (#entityjpa)   - **Geospatial** (#geospatial)   - **Graph** (#graph)   - **SQL/OLAP** (#sqlolap)   - **Time Series** (#time-series)  - **Infrastructure** (#infrastructure)   - **Secondary Indices** (#secondary-indices)  - **Transactions** (#transactions)  - **Integrations** (#integrations)  - **Tools** (#tools)  - **Miscellaneous** (#miscellaneous)  - Resources (#resources) - **Books** (#books)  - **Papers** (#papers)  - **Community** (#community)  Projects Clients ⟡ asynchbase (https://github.com/OpenTSDB/asynchbase) - Fully asynchronous, non-blocking HBase client. ⟡ gohbase (https://github.com/tsuna/gohbase) - Pure Go client for HBase. ⟡ happybase (https://github.com/wbolster/happybase) - Python client for HBase. Cloud ⟡ Amazon EMR (https://aws.amazon.com/emr/) - Amazon's Hadoop/HBase offering on AWS. ⟡ Azure HDInsight (https://azure.microsoft.com/en-us/services/hdinsight/) - Microsoft's Hadoop/HBase offering on Azure. ⟡ Cloudera Director (https://www.cloudera.com/products/product-components/cloudera-director.html) - Run Hadoop/HBase clusters on AWS, Azure or Google Cloud. ⟡ Google Cloud Bigtable (https://cloud.google.com/bigtable/) - High-performance NoSQL database service accessible via HBase client API. ⟡ Hortonworks Cloudbreak (https://hortonworks.com/open-source/cloudbreak/) - Provision Hadoop/HBase clusters on AWS, Azure, Google Cloud, or OpenStack. Frameworks Datasets ⟡ Kite (http://kitesdk.org) - High-level data layer for Hadoop/HBase. Document ⟡ HDocDB (https://github.com/rayokota/hdocdb) - HBase as a JSON document database. Entity/JPA ⟡ DataNucleus (http://www.datanucleus.org) - JPA persistence layer with support for HBase. ⟡ Gora (http://gora.apache.org) - Persistence library for big data with support for HBase. ⟡ HBase ORM (https://github.com/flipkart-incubator/hbase-orm) - A production-grade HBase ORM library. ⟡ HEntityDB (https://github.com/rayokota/hentitydb) - HBase as an entity database. ⟡ Kundera (https://github.com/impetus-opensource/Kundera) - JPA client with support for HBase. Geospatial ⟡ GeoMesa (http://www.geomesa.org/) - Spatial-temporal database with support for Accumulo, HBase, Cassandra, and Kafka. Graph ⟡ Gradoop (https://github.com/dbs-leipzig/gradoop) - Research framework for scalable graph analytics built on Flink and HBase. ⟡ HGraphDB (https://github.com/rayokota/hgraphdb) - HBase as a TinkerPop graph database. ⟡ HugeGraph (https://github.com/apache/incubator-hugegraph) - A graph database that supports more than 10+ billion data, high performance and scalability. ⟡ JanusGraph (http://janusgraph.org/) - Scalable graph database with support for Cassandra, HBase, Google Cloud Bigtable, and BerkeleyDB. ⟡ NebulaGraph (https://github.com/vesoft-inc/nebula) - A high performance distributed Graph database. ⟡ S2Graph (http://s2graph.incubator.apache.org) - High-performance distributed graph database built on HBase. SQL/OLAP ⟡ AntsDB (http://antsdb.com/) - AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase. ⟡ EsgynDB (https://esgyn.com/) - Commercial SQL engine providing ACID transactions and BI analytics on top of Hadoop, based on Trafodian. ⟡ Kylin (http://kylin.apache.org) - Extreme OLAP engine for big data that stores data in HBase. ⟡ LeanXScale (http://www.leanxcale.com) - Commercial full ACID full SQL product built on Hadoop/HBase. ⟡ Phoenix (https://phoenix.apache.org) - SQL layer on top of HBase. ⟡ Splice Machine (https://www.splicemachine.com) - Commercial RDBMS built on top of HBase. ⟡ Trafodian (http://trafodion.apache.org) - Transactional SQL-on-Hadoop/HBase. Time Series ⟡ Axibase (http://axibase.com/products/axibase-time-series-database/) - Distributed time series database built on HBase. ⟡ OpenTSDB (http://opentsdb.net) - Scalable time series database built on HBase. ⟡ Warp 10 (http://www.warp10.io) - Time series database for sensor data. Infrastructure Secondary Indices ⟡ hindex (https://github.com/Huawei-Hadoop/hindex) - Secondary index for HBase. ⟡ Lily HBase Indexer (http://ngdata.github.io/hbase-indexer/) - Quickly and easily search for content stored in HBase. Transactions ⟡ Haeinsa (https://github.com/VCNC/haeinsa) - Multi-row/multi-table transaction library for HBase. ⟡ HBase-QoD (https://github.com/algarecu/hbase-0.94.8-qod) - Vector-field consistency for HBase fine-grained transactional inter-DC replication. ⟡ Omid (https://github.com/apache/incubator-omid) - Transactional support for HBase. ⟡ Tephra (http://tephra.incubator.apache.org) - Globally consistent transactions on top of HBase. ⟡ Themis (https://github.com/XiaoMi/themis) - Cross-row/cross-table transactions on HBase based on Google's Percolator. Integrations ⟡ Apex (https://github.com/apache/apex-malhar/tree/master/contrib/src/test/java/org/apache/apex/malhar/contrib/hbase) - Apex-HBase connector. ⟡ Beam (https://github.com/apache/beam/tree/master/sdks/java/io/hbase) - Beam HBase integration. ⟡ Camel (http://camel.apache.org/hbase.html) - Camel HBase component. ⟡ Cascading (https://github.com/Cascading/cascading.hbase) - HBase adapters for Cascading. ⟡ Cascalog (https://github.com/sorenmacbeth/hbase-cascalog) - Wrapper around Cascading.HBase for use in Cascalog. ⟡ Crunch (https://github.com/apache/crunch/tree/master/crunch-hbase) - HBase adapters for Crunch. ⟡ Drill (https://drill.apache.org/docs/querying-hbase/) - HBase storage plugin for Drill. ⟡ Elasticsearch (https://github.com/mallocator/Elasticsearch-HBase-River) - Elasticsearch import river for HBase. ⟡ Flink (https://github.com/apache/flink/tree/master/flink-connectors/flink-connector-hbase-2.2) - Flink-HBase connector. ⟡ Gearpump (https://github.com/apache/incubator-gearpump/tree/master/external/hbase) - Gearpump integration for HBase. ⟡ Giraph (https://github.com/apache/giraph/tree/trunk/giraph-hbase) - Giraph input and output formats for HBase. ⟡ HAWQ (https://hawq.apache.org/docs/userguide/2.3.0.0-incubating/pxf/HBasePXF.html) - HAWQ PXF external tables on HBase. ⟡ Hive (https://cwiki.apache.org/confluence/display/Hive/HBaseIntegration) - Hive HBase integration. ⟡ Impala (https://www.cloudera.com/documentation/enterprise/latest/topics/impala_hbase.html) - Impala support for querying HBase tables. ⟡ Kafka (https://github.com/apache/hbase-connectors/tree/master/kafka) - HBase Kafka proxy. ⟡ Pig (https://github.com/apache/pig/tree/trunk/src/org/apache/pig/backend/hadoop/hbase) - Pig HBase integration. ⟡ Presto (https://github.com/analysys/presto-hbase-connector) - Presto-HBase connector. ⟡ Pulsar (http://pulsar.apache.org/docs/en/io-hbase/) - HBase connector for Pulsar. ⟡ Ranger (https://cwiki.apache.org/confluence/display/RANGER/HBase+Plugin) - HBase plugin for Apache Ranger. ⟡ Spark (https://github.com/hortonworks-spark/shc) - Spark-HBase connector. ⟡ Spring for Apache Hadoop (https://projects.spring.io/spring-hadoop/) - Spring-Hadoop integration, including HBase support. ⟡ Storm (https://github.com/apache/storm/tree/master/external/storm-hbase) - Storm/Trident integration for HBase. ⟡ Tajo (https://tajo.apache.org/docs/current/hbase_integration.html) - Tajo integration with HBase. ⟡ Zeppelin (https://zeppelin.apache.org/docs/0.6.2/interpreter/hbase.html) - HBase shell interpreter for Apache Zeppelin. Tools ⟡ Ambari (https://ambari.apache.org) - Software for provisioning, managing, and monitor Hadoop/HBase clusters. ⟡ Cloudera Manager (https://www.cloudera.com/products/product-components/cloudera-manager.html) - Tool for managing Hadoop/HBase in production. ⟡ DbSchema (http://www.dbschema.com/index.html) - Diagram-oriented database designer with support for HBase. ⟡ Hannibal (https://github.com/sentric/hannibal) - Tool to monitor and maintain HBase clusters. ⟡ h-rider (https://github.com/NiceSystems/hrider) - GUI for viewing and manipulating data in HBase. ⟡ Hue (http://gethue.com) - Smart analytics workbench that includes an HBase browser. ⟡ Sematext SPM (http://sematext.com/spm) - Tool for monitoring HBase (http://sematext.com/spm/integrations/hbase-monitoring), HDFS, etc. Miscellaneous ⟡ HubSpot HBase support (https://github.com/HubSpot/hbase-support) - Configs and tools for HBase at HubSpot, including Hystrix integration and coprocessors. Resources Books ⟡ HBase in Action (https://www.manning.com/books/hbase-in-action) - Experience-driven guide that shows you how to use HBase. ⟡ HBase: The Definitive Guide (http://shop.oreilly.com/product/0636920014348.do) - Comprehensive guide to HBase. ⟡ Architecting HBase Applications (http://shop.oreilly.com/product/0636920035688.do) - Includes HBase principles, cluster guidelines, and in-depth case studies. ⟡ HBase Administration Cookbook (https://www.packtpub.com/big-data-and-business-intelligence/hbase-administration-cookbook) - How to master HBase configuration and administration. ⟡ HBase Essentials (https://www.packtpub.com/big-data-and-business-intelligence/hbase-essentials) - A practical guide to using HBase. ⟡ HBase Design Patterns (https://www.packtpub.com/big-data-and-business-intelligence/hbase-design-patterns) - Successful patterns to develop scalable applications with HBase. ⟡ Learning HBase (https://www.packtpub.com/big-data-and-business-intelligence/learning-hbase) - Learn the fundamentals of HBase administration and development. ⟡ HBase High Performance Cookbook (https://www.packtpub.com/big-data-and-business-intelligence/hbase-high-performance-cookbook) - Exciting projects that teach you how to use HBase. ⟡ Apache HBase Primer (http://www.apress.com/us/book/9781484224236) - A compact guide to HBase essentials. ⟡ Pro Apache Phoenix (http://www.apress.com/us/book/9781484223697) - Basic and best practices for using Phoenix. Papers ⟡ Bigtable: A Distributed Storage System for Structured Data (https://static.googleusercontent.com/media/research.google.com/en//archive/bigtable-osdi06.pdf) - The inspiration for HBase. ⟡ Apache Hadoop Goes Realtime at Facebook (https://pdfs.semanticscholar.org/865a/215390cd49af9e4941e03107120e631dcaa0.pdf) - How Facebook deployed HBase to production. Community ⟡ Blog (https://blogs.apache.org/hbase/) ⟡ Mailing Lists (http://hbase.apache.org/mail-lists.html) ⟡ Reddit (https://www.reddit.com/r/hbase/) ⟡ Stack Overflow (https://stackoverflow.com/questions/tagged/hbase) ⟡ Twitter (https://twitter.com/HBase) License Public Domain Mark