Files
2025-07-18 22:22:32 +02:00

20 KiB

Awesome HBase !Awesome (https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg) (https://github.com/sindresorhus/awesome)
 
(http://hbase.apache.org/)
 
A curated list of awesome HBase projects and resources.
 
HBase (http://hbase.apache.org) is a distributed, scalable, big data store.
 
Contents
 
- Projects (#projects)
- **Clients** (#clients)
- **Cloud** (#cloud)
- **Frameworks** (#frameworks)
- **Datasets** (#datasets)
- **Document** (#document)
- **Entity/JPA** (#entityjpa)
- **Geospatial** (#geospatial)
- **Graph** (#graph)
- **SQL/OLAP** (#sqlolap)
- **Time Series** (#time-series)
- **Infrastructure** (#infrastructure)
- **Secondary Indices** (#secondary-indices)
- **Transactions** (#transactions)
- **Integrations** (#integrations)
- **Tools** (#tools)
- **Miscellaneous** (#miscellaneous)
 
- Resources (#resources)
- **Books** (#books)
- **Papers** (#papers)
- **Community** (#community)
 
 
Projects
 
Clients
 
asynchbase (https://github.com/OpenTSDB/asynchbase) - Fully asynchronous, non-blocking HBase client.
gohbase (https://github.com/tsuna/gohbase) - Pure Go client for HBase.
happybase (https://github.com/wbolster/happybase) - Python client for HBase.
 
 
Cloud
 
Amazon EMR (https://aws.amazon.com/emr/) - Amazon's Hadoop/HBase offering on AWS.
Azure HDInsight (https://azure.microsoft.com/en-us/services/hdinsight/) - Microsoft's Hadoop/HBase offering on Azure.
Cloudera Director (https://www.cloudera.com/products/product-components/cloudera-director.html) - Run Hadoop/HBase clusters on AWS, Azure or Google Cloud.
Google Cloud Bigtable (https://cloud.google.com/bigtable/) - High-performance NoSQL database service accessible via HBase client API.
Hortonworks Cloudbreak (https://hortonworks.com/open-source/cloudbreak/) - Provision Hadoop/HBase clusters on AWS, Azure, Google Cloud, or OpenStack.
 
Frameworks
 
Datasets
 
Kite (http://kitesdk.org) - High-level data layer for Hadoop/HBase.
 
Document
 
HDocDB (https://github.com/rayokota/hdocdb) - HBase as a JSON document database.
 
Entity/JPA
 
DataNucleus (http://www.datanucleus.org) - JPA persistence layer with support for HBase.
Gora (http://gora.apache.org) - Persistence library for big data with support for HBase.
HBase ORM (https://github.com/flipkart-incubator/hbase-orm) - A production-grade HBase ORM library.
HEntityDB (https://github.com/rayokota/hentitydb) - HBase as an entity database.
Kundera (https://github.com/impetus-opensource/Kundera) - JPA client with support for HBase.
 
Geospatial
 
GeoMesa (http://www.geomesa.org/) - Spatial-temporal database with support for Accumulo, HBase, Cassandra, and Kafka.
 
Graph
Gradoop (https://github.com/dbs-leipzig/gradoop) - Research framework for scalable graph analytics built on Flink and HBase.
HGraphDB (https://github.com/rayokota/hgraphdb) - HBase as a TinkerPop graph database.
HugeGraph (https://github.com/apache/incubator-hugegraph) - A graph database that supports more than 10+ billion data, high performance and scalability.
JanusGraph (http://janusgraph.org/) - Scalable graph database with support for Cassandra, HBase, Google Cloud Bigtable, and BerkeleyDB.
NebulaGraph (https://github.com/vesoft-inc/nebula) - A high performance distributed Graph database.
S2Graph (http://s2graph.incubator.apache.org) - High-performance distributed graph database built on HBase.
 
SQL/OLAP
 
AntsDB (http://antsdb.com/) - AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase.
EsgynDB (https://esgyn.com/) - Commercial SQL engine providing ACID transactions and BI analytics on top of Hadoop, based on Trafodian.
Kylin (http://kylin.apache.org) - Extreme OLAP engine for big data that stores data in HBase.
LeanXScale (http://www.leanxcale.com) - Commercial full ACID full SQL product built on Hadoop/HBase.
Phoenix (https://phoenix.apache.org) - SQL layer on top of HBase.
Splice Machine (https://www.splicemachine.com) - Commercial RDBMS built on top of HBase.
Trafodian (http://trafodion.apache.org) - Transactional SQL-on-Hadoop/HBase.
 
Time Series
 
Axibase (http://axibase.com/products/axibase-time-series-database/) - Distributed time series database built on HBase.
OpenTSDB (http://opentsdb.net) - Scalable time series database built on HBase.
Warp 10 (http://www.warp10.io) - Time series database for sensor data.
 
Infrastructure
Secondary Indices
 
hindex (https://github.com/Huawei-Hadoop/hindex) - Secondary index for HBase.
Lily HBase Indexer (http://ngdata.github.io/hbase-indexer/) - Quickly and easily search for content stored in HBase.
 
Transactions
 
Haeinsa (https://github.com/VCNC/haeinsa) - Multi-row/multi-table transaction library for HBase.
HBase-QoD (https://github.com/algarecu/hbase-0.94.8-qod) - Vector-field consistency for HBase fine-grained transactional inter-DC replication.
Omid (https://github.com/apache/incubator-omid) - Transactional support for HBase.
Tephra (http://tephra.incubator.apache.org) - Globally consistent transactions on top of HBase.
Themis (https://github.com/XiaoMi/themis) - Cross-row/cross-table transactions on HBase based on Google's Percolator.
 
Integrations
 
Apex (https://github.com/apache/apex-malhar/tree/master/contrib/src/test/java/org/apache/apex/malhar/contrib/hbase) - Apex-HBase connector.
Beam (https://github.com/apache/beam/tree/master/sdks/java/io/hbase) - Beam HBase integration.
Camel (http://camel.apache.org/hbase.html) - Camel HBase component.
Cascading (https://github.com/Cascading/cascading.hbase) - HBase adapters for Cascading.
Cascalog (https://github.com/sorenmacbeth/hbase-cascalog) - Wrapper around Cascading.HBase for use in Cascalog.
Crunch (https://github.com/apache/crunch/tree/master/crunch-hbase) - HBase adapters for Crunch.
Drill (https://drill.apache.org/docs/querying-hbase/) - HBase storage plugin for Drill.
Elasticsearch (https://github.com/mallocator/Elasticsearch-HBase-River) - Elasticsearch import river for HBase.
Flink (https://github.com/apache/flink/tree/master/flink-connectors/flink-connector-hbase-2.2) - Flink-HBase connector.
Gearpump (https://github.com/apache/incubator-gearpump/tree/master/external/hbase) - Gearpump integration for HBase.
Giraph (https://github.com/apache/giraph/tree/trunk/giraph-hbase) - Giraph input and output formats for HBase.
HAWQ (https://hawq.apache.org/docs/userguide/2.3.0.0-incubating/pxf/HBasePXF.html) - HAWQ PXF external tables on HBase.
Hive (https://cwiki.apache.org/confluence/display/Hive/HBaseIntegration) - Hive HBase integration.
Impala (https://www.cloudera.com/documentation/enterprise/latest/topics/impala_hbase.html) - Impala support for querying HBase tables.
Kafka (https://github.com/apache/hbase-connectors/tree/master/kafka) - HBase Kafka proxy.
Pig (https://github.com/apache/pig/tree/trunk/src/org/apache/pig/backend/hadoop/hbase) - Pig HBase integration.
Presto (https://github.com/analysys/presto-hbase-connector) - Presto-HBase connector.
Pulsar (http://pulsar.apache.org/docs/en/io-hbase/) - HBase connector for Pulsar.
Ranger (https://cwiki.apache.org/confluence/display/RANGER/HBase+Plugin) - HBase plugin for Apache Ranger.
Spark (https://github.com/hortonworks-spark/shc) - Spark-HBase connector.
Spring for Apache Hadoop (https://projects.spring.io/spring-hadoop/) - Spring-Hadoop integration, including HBase support.
Storm (https://github.com/apache/storm/tree/master/external/storm-hbase) - Storm/Trident integration for HBase.
Tajo (https://tajo.apache.org/docs/current/hbase_integration.html) - Tajo integration with HBase.
Zeppelin (https://zeppelin.apache.org/docs/0.6.2/interpreter/hbase.html) - HBase shell interpreter for Apache Zeppelin.
 
Tools
 
Ambari (https://ambari.apache.org) - Software for provisioning, managing, and monitor Hadoop/HBase clusters.
Cloudera Manager (https://www.cloudera.com/products/product-components/cloudera-manager.html) - Tool for managing Hadoop/HBase in production.
DbSchema (http://www.dbschema.com/index.html) - Diagram-oriented database designer with support for HBase.
Hannibal (https://github.com/sentric/hannibal) - Tool to monitor and maintain HBase clusters.
h-rider (https://github.com/NiceSystems/hrider) - GUI for viewing and manipulating data in HBase.
Hue (http://gethue.com) - Smart analytics workbench that includes an HBase browser.
Sematext SPM (http://sematext.com/spm) - Tool for monitoring HBase (http://sematext.com/spm/integrations/hbase-monitoring), HDFS, etc.
 
Miscellaneous
 
HubSpot HBase support (https://github.com/HubSpot/hbase-support) - Configs and tools for HBase at HubSpot, including Hystrix integration and coprocessors.
 
Resources
 
Books
 
HBase in Action (https://www.manning.com/books/hbase-in-action) - Experience-driven guide that shows you how to use HBase.
HBase: The Definitive Guide (http://shop.oreilly.com/product/0636920014348.do) - Comprehensive guide to HBase.
Architecting HBase Applications (http://shop.oreilly.com/product/0636920035688.do) - Includes HBase principles, cluster guidelines, and in-depth case studies.
HBase Administration Cookbook (https://www.packtpub.com/big-data-and-business-intelligence/hbase-administration-cookbook) - How to master HBase configuration and administration.
HBase Essentials (https://www.packtpub.com/big-data-and-business-intelligence/hbase-essentials) - A practical guide to using HBase.
HBase Design Patterns (https://www.packtpub.com/big-data-and-business-intelligence/hbase-design-patterns) - Successful patterns to develop scalable applications with HBase.
Learning HBase (https://www.packtpub.com/big-data-and-business-intelligence/learning-hbase) - Learn the fundamentals of HBase administration and development.
HBase High Performance Cookbook (https://www.packtpub.com/big-data-and-business-intelligence/hbase-high-performance-cookbook) - Exciting projects that teach you how to use HBase.
Apache HBase Primer (http://www.apress.com/us/book/9781484224236) - A compact guide to HBase essentials.
Pro Apache Phoenix (http://www.apress.com/us/book/9781484223697) - Basic and best practices for using Phoenix.
 
Papers
 
Bigtable: A Distributed Storage System for Structured Data (https://static.googleusercontent.com/media/research.google.com/en//archive/bigtable-osdi06.pdf) - The inspiration for HBase.
Apache Hadoop Goes Realtime at Facebook (https://pdfs.semanticscholar.org/865a/215390cd49af9e4941e03107120e631dcaa0.pdf) - How Facebook deployed HBase to production.
 
Community
 
Blog (https://blogs.apache.org/hbase/)
Mailing Lists (http://hbase.apache.org/mail-lists.html)
Reddit (https://www.reddit.com/r/hbase/)
Stack Overflow (https://stackoverflow.com/questions/tagged/hbase)
Twitter (https://twitter.com/HBase)
 
License
 
 
 
<img src="https://mirrors.creativecommons.org/presskit/buttons/88x31/svg/publicdomain.svg"
style="border-style: none;" alt="Public Domain Mark" />
 
 
hbase Github: https://github.com/rayokota/awesome-hbase