Update render script and Makefile
This commit is contained in:
288
terminal/hadoop2
288
terminal/hadoop2
@@ -1,288 +0,0 @@
|
||||
[38;5;12m [39m[38;2;255;187;0m[1m[4mAwesome Hadoop [0m[38;5;14m[1m[4m![0m[38;2;255;187;0m[1m[4mAwesome[0m[38;5;14m[1m[4m (https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)[0m[38;2;255;187;0m[1m[4m (https://github.com/sindresorhus/awesome)[0m
|
||||
|
||||
[38;5;12mA[39m[38;5;12m [39m[38;5;12mcurated[39m[38;5;12m [39m[38;5;12mlist[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mamazingly[39m[38;5;12m [39m[38;5;12mawesome[39m[38;5;12m [39m[38;5;12mHadoop[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12mHadoop[39m[38;5;12m [39m[38;5;12mecosystem[39m[38;5;12m [39m[38;5;12mresources.[39m[38;5;12m [39m[38;5;12mInspired[39m[38;5;12m [39m[38;5;12mby[39m[38;5;12m [39m[38;5;14m[1mAwesome[0m[38;5;14m[1m [0m[38;5;14m[1mPHP[0m[38;5;12m [39m[38;5;12m(https://github.com/ziadoz/awesome-php),[39m[38;5;12m [39m[38;5;14m[1mAwesome[0m[38;5;14m[1m [0m[38;5;14m[1mPython[0m[38;5;12m [39m[38;5;12m(https://github.com/vinta/awesome-python)[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;14m[1mAwesome[0m[38;5;14m[1m [0m
|
||||
[38;5;14m[1mSysadmin[0m[38;5;12m [39m[38;5;12m(https://github.com/kahun/awesome-sysadmin)[39m
|
||||
|
||||
[38;5;12m- [39m[38;5;14m[1mAwesome Hadoop[0m[38;5;12m (#awesome-hadoop)[39m
|
||||
[48;5;235m[38;5;249m- **Hadoop** (#hadoop)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **YARN** (#yarn)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **NoSQL** (#nosql)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **SQL on Hadoop** (#sql-on-hadoop)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Data Management** (#data-management)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Workflow, Lifecycle and Governance** (#workflow-lifecycle-and-governance)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Data Ingestion and Integration** (#data-ingestion-and-integration)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **DSL** (#dsl)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Libraries and Tools** (#libraries-and-tools)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Realtime Data Processing** (#realtime-data-processing)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Distributed Computing and Programming** (#distributed-computing-and-programming)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Packaging, Provisioning and Monitoring** (#packaging-provisioning-and-monitoring)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Monitoring** (#monitoring)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Search** (#search)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Security** (#security)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Benchmark** (#benchmark)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Machine learning and Big Data analytics** (#machine-learning-and-big-data-analytics)[49m[39m
|
||||
[48;5;235m[38;5;249m- **Misc.** (#misc)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[38;5;12m- [39m[38;5;14m[1mResources[0m[38;5;12m (#resources)[39m
|
||||
[48;5;235m[38;5;249m- **Websites** (#websites)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Presentations** (#presentations)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Books** (#books)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m- **Hadoop and Big Data Events** (#hadoop-and-big-data-events)[49m[39m
|
||||
[38;5;12m- [39m[38;5;14m[1mOther Awesome Lists[0m[38;5;12m (#other-awesome-lists)[39m
|
||||
|
||||
[38;2;255;187;0m[4mHadoop[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Hadoop[0m[38;5;12m (http://hadoop.apache.org/) - Apache Hadoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Hadoop Ozone[0m[38;5;12m (http://hadoop.apache.org/ozone/) - An Object Store for Apache Hadoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Tez[0m[38;5;12m (http://tez.apache.org/) - A Framework for YARN-based, Data Processing Applications In Hadoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mSpatialHadoop[0m[38;5;12m (http://spatialhadoop.cs.umn.edu/) - SpatialHadoop is a MapReduce extension to Apache Hadoop designed specially to work with spatial data. [39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mGIS Tools for Hadoop[0m[38;5;12m (http://esri.github.io/gis-tools-for-hadoop/) - Big Data Spatial Analytics for the Hadoop Framework[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mElasticsearch Hadoop[0m
|
||||
[38;5;12m (https://github.com/elastic/elasticsearch-hadoop) - Elasticsearch real-time search and analytics natively integrated with Hadoop. Supports Map/Reduce, Cascading, Apache Hive and Apache Pig.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mhadoopy[0m[38;5;12m (https://github.com/bwhite/hadoopy) - Python MapReduce library written in Cython. [39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mmrjob[0m[38;5;12m (https://github.com/Yelp/mrjob/) - mrjob is a Python 2.5+ package that helps you write and run Hadoop Streaming jobs.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mpydoop[0m[38;5;12m (http://pydoop.sourceforge.net/) - Pydoop is a package that provides a Python API for Hadoop.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mhdfs-du[0m[38;5;12m (https://github.com/twitter/hdfs-du) - HDFS-DU is an interactive visualization of the Hadoop distributed file system. [39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mWhite Elephant[0m[38;5;12m (https://github.com/linkedin/white-elephant) - Hadoop log aggregator and dashboard[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mGenie[0m[38;5;12m (https://github.com/Netflix/genie) - Genie provides REST-ful APIs to run Hadoop, Hive and Pig jobs, and to manage multiple Hadoop resources and perform job submissions across them.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache[0m[38;5;14m[1m [0m[38;5;14m[1mKylin[0m[38;5;12m [39m[38;5;12m(http://kylin.incubator.apache.org/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mKylin[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12man[39m[38;5;12m [39m[38;5;12mopen[39m[38;5;12m [39m[38;5;12msource[39m[38;5;12m [39m[38;5;12mDistributed[39m[38;5;12m [39m[38;5;12mAnalytics[39m[38;5;12m [39m[38;5;12mEngine[39m[38;5;12m [39m[38;5;12mfrom[39m[38;5;12m [39m[38;5;12meBay[39m[38;5;12m [39m[38;5;12mInc.[39m[38;5;12m [39m[38;5;12mthat[39m[38;5;12m [39m[38;5;12mprovides[39m[38;5;12m [39m[38;5;12mSQL[39m[38;5;12m [39m[38;5;12minterface[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12mmulti-dimensional[39m[38;5;12m [39m[38;5;12manalysis[39m[38;5;12m [39m[38;5;12m(OLAP)[39m[38;5;12m [39m[38;5;12mon[39m[38;5;12m [39m[38;5;12mHadoop[39m[38;5;12m [39m
|
||||
[38;5;12msupporting[39m[38;5;12m [39m[38;5;12mextremely[39m[38;5;12m [39m[38;5;12mlarge[39m[38;5;12m [39m[38;5;12mdatasets[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mCrunch[0m[38;5;12m (https://github.com/jondot/crunch) - Go-based toolkit for ETL and feature extraction on Hadoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Ignite[0m[38;5;12m (http://ignite.apache.org/) - Distributed in-memory platform[39m
|
||||
|
||||
[38;2;255;187;0m[4mYARN[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache[0m[38;5;14m[1m [0m[38;5;14m[1mSlider[0m[38;5;12m [39m[38;5;12m(http://slider.incubator.apache.org/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mSlider[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mproject[39m[38;5;12m [39m[38;5;12min[39m[38;5;12m [39m[38;5;12mincubation[39m[38;5;12m [39m[38;5;12mat[39m[38;5;12m [39m[38;5;12mthe[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mSoftware[39m[38;5;12m [39m[38;5;12mFoundation[39m[38;5;12m [39m[38;5;12mwith[39m[38;5;12m [39m[38;5;12mthe[39m[38;5;12m [39m[38;5;12mgoal[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mmaking[39m[38;5;12m [39m[38;5;12mit[39m[38;5;12m [39m[38;5;12mpossible[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12measy[39m[38;5;12m [39m[38;5;12mto[39m[38;5;12m [39m[38;5;12mdeploy[39m[38;5;12m [39m[38;5;12mexisting[39m[38;5;12m [39m[38;5;12mapplications[39m[38;5;12m [39m
|
||||
[38;5;12monto[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mYARN[39m[38;5;12m [39m[38;5;12mcluster.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache[0m[38;5;14m[1m [0m[38;5;14m[1mTwill[0m[38;5;12m [39m[38;5;12m(http://twill.incubator.apache.org/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mTwill[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12man[39m[38;5;12m [39m[38;5;12mabstraction[39m[38;5;12m [39m[38;5;12mover[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mHadoop®[39m[38;5;12m [39m[38;5;12mYARN[39m[38;5;12m [39m[38;5;12mthat[39m[38;5;12m [39m[38;5;12mreduces[39m[38;5;12m [39m[38;5;12mthe[39m[38;5;12m [39m[38;5;12mcomplexity[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mdeveloping[39m[38;5;12m [39m[38;5;12mdistributed[39m[38;5;12m [39m[38;5;12mapplications,[39m[38;5;12m [39m[38;5;12mallowing[39m[38;5;12m [39m[38;5;12mdevelopers[39m[38;5;12m [39m[38;5;12mto[39m[38;5;12m [39m[38;5;12mfocus[39m[38;5;12m [39m[38;5;12mmore[39m
|
||||
[38;5;12mon[39m[38;5;12m [39m[38;5;12mtheir[39m[38;5;12m [39m[38;5;12mapplication[39m[38;5;12m [39m[38;5;12mlogic.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mmpich2-yarn[0m[38;5;12m (https://github.com/alibaba/mpich2-yarn) - Running MPICH2 on Yarn[39m
|
||||
|
||||
[38;2;255;187;0m[4mNoSQL[0m
|
||||
[48;2;30;30;40m[38;5;13m[3mNext Generation Databases mostly addressing some of the points: being non-relational, distributed, open-source and horizontally scalable.[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache HBase[0m[38;5;12m (http://hbase.apache.org) - Apache HBase[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Phoenix[0m[38;5;12m (http://phoenix.apache.org/) - A SQL skin over HBase supporting secondary indices[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mhappybase[0m[38;5;12m (https://github.com/wbolster/happybase) - A developer-friendly Python library to interact with Apache HBase.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHannibal[0m[38;5;12m (https://github.com/sentric/hannibal) - Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHaeinsa[0m[38;5;12m (https://github.com/VCNC/haeinsa) - Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mhindex[0m[38;5;12m (https://github.com/Huawei-Hadoop/hindex) - Secondary Index for HBase[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Accumulo[0m[38;5;12m (https://accumulo.apache.org/) - The Apache Accumulo™ sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mOpenTSDB[0m[38;5;12m (http://opentsdb.net/) - The Scalable Time Series Database[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Cassandra[0m[38;5;12m (http://cassandra.apache.org/)[39m
|
||||
|
||||
[38;2;255;187;0m[4mSQL on Hadoop[0m
|
||||
[48;2;30;30;40m[38;5;13m[3mSQL on Hadoop[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Hive[0m[38;5;12m (http://hive.apache.org) - The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Phoenix[0m[38;5;12m (http://phoenix.apache.org) A SQL skin over HBase supporting secondary indices[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache HAWQ (incubating)[0m
|
||||
[38;5;12m (http://hawq.incubator.apache.org/) - Apache HAWQ is a Hadoop native SQL query engine that combines the key technological advantages of MPP database with the scalability and convenience of Hadoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mLingual[0m[38;5;12m (http://www.cascading.org/projects/lingual/) - SQL interface for Cascading (MR/Tez job generator)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache[0m[38;5;14m[1m [0m[38;5;14m[1mImpala[0m[38;5;12m [39m[38;5;12m(https://impala.apache.org/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mImpala[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12man[39m[38;5;12m [39m[38;5;12mopen[39m[38;5;12m [39m[38;5;12msource[39m[38;5;12m [39m[38;5;12mmassively[39m[38;5;12m [39m[38;5;12mparallel[39m[38;5;12m [39m[38;5;12mprocessing[39m[38;5;12m [39m[38;5;12m(MPP)[39m[38;5;12m [39m[38;5;12mSQL[39m[38;5;12m [39m[38;5;12mquery[39m[38;5;12m [39m[38;5;12mengine[39m[38;5;12m [39m[38;5;12mfor[39m[38;5;12m [39m[38;5;12mdata[39m[38;5;12m [39m[38;5;12mstored[39m[38;5;12m [39m[38;5;12min[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mcomputer[39m[38;5;12m [39m[38;5;12mcluster[39m[38;5;12m [39m[38;5;12mrunning[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mHadoop.[39m[38;5;12m [39m[38;5;12mImpala[39m[38;5;12m [39m[38;5;12mhas[39m[38;5;12m [39m[38;5;12mbeen[39m[38;5;12m [39m
|
||||
[38;5;12mdescribed[39m[38;5;12m [39m[38;5;12mas[39m[38;5;12m [39m[38;5;12mthe[39m[38;5;12m [39m[38;5;12mopen-source[39m[38;5;12m [39m[38;5;12mequivalent[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mGoogle[39m[38;5;12m [39m[38;5;12mF1,[39m[38;5;12m [39m[38;5;12mwhich[39m[38;5;12m [39m[38;5;12minspired[39m[38;5;12m [39m[38;5;12mits[39m[38;5;12m [39m[38;5;12mdevelopment[39m[38;5;12m [39m[38;5;12min[39m[38;5;12m [39m[38;5;12m2012.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mPresto[0m[38;5;12m (https://prestodb.io/) - Distributed SQL Query Engine for Big Data. Open sourced by Facebook.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Tajo[0m[38;5;12m (http://tajo.apache.org/) - Data warehouse system for Apache Hadoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Drill[0m[38;5;12m (https://drill.apache.org/) - Schema-free SQL Query Engine[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Trafodion[0m[38;5;12m (http://trafodion.apache.org/)[39m
|
||||
|
||||
[38;2;255;187;0m[4mData Management[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Calcite[0m[38;5;12m (http://calcite.apache.org/) - A Dynamic Data Management Framework[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Atlas[0m[38;5;12m (http://atlas.incubator.apache.org/) - Metadata tagging & lineage capture suppoting complex business data taxonomies[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache[0m[38;5;14m[1m [0m[38;5;14m[1mKudu[0m[38;5;12m [39m[38;5;12m(https://kudu.apache.org/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mKudu[39m[38;5;12m [39m[38;5;12mprovides[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mcombination[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mfast[39m[38;5;12m [39m[38;5;12minserts/updates[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12mefficient[39m[38;5;12m [39m[38;5;12mcolumnar[39m[38;5;12m [39m[38;5;12mscans[39m[38;5;12m [39m[38;5;12mto[39m[38;5;12m [39m[38;5;12menable[39m[38;5;12m [39m[38;5;12mmultiple[39m[38;5;12m [39m[38;5;12mreal-time[39m[38;5;12m [39m[38;5;12manalytic[39m[38;5;12m [39m[38;5;12mworkloads[39m[38;5;12m [39m[38;5;12macross[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12msingle[39m[38;5;12m [39m[38;5;12mstorage[39m[38;5;12m [39m[38;5;12mlayer,[39m[38;5;12m [39m
|
||||
[38;5;12mcomplementing[39m[38;5;12m [39m[38;5;12mHDFS[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mHBase.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mConfluent Schema registry for Kafka[0m
|
||||
[38;5;12m (https://github.com/confluentinc/schema-registry) - Schema Registry provides a serving layer for your metadata. It provides a RESTful interface for storing and retrieving Avro schemas.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHortonworks Schema Registry[0m[38;5;12m (https://github.com/hortonworks/registry) - Schema Registry is a framework to build metadata repositories.[39m
|
||||
|
||||
[38;2;255;187;0m[4mWorkflow, Lifecycle and Governance[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Oozie[0m[38;5;12m (http://oozie.apache.org) - Apache Oozie[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mAzkaban[0m[38;5;12m (http://azkaban.github.io/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Falcon[0m[38;5;12m (http://falcon.apache.org/) - Data management and processing platform[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache NiFi[0m[38;5;12m (http://nifi.apache.org/) - A dataflow system[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache AirFlow[0m[38;5;12m (https://github.com/apache/incubator-airflow) - Airflow is a workflow automation and scheduling system that can be used to author and manage data pipelines[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mLuigi[0m[38;5;12m (http://luigi.readthedocs.org/en/latest/) - Python package that helps you build complex pipelines of batch jobs[39m
|
||||
|
||||
[38;2;255;187;0m[4mData Ingestion and Integration[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Flume[0m[38;5;12m (http://flume.apache.org) - Apache Flume[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mSuro[0m[38;5;12m (https://github.com/Netflix/suro) - Netflix's distributed Data Pipeline[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Sqoop[0m[38;5;12m (http://sqoop.apache.org) - Apache Sqoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Kafka[0m[38;5;12m (http://kafka.apache.org/) - Apache Kafka[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mGobblin from LinkedIn[0m[38;5;12m (https://github.com/linkedin/gobblin) - Universal data ingestion framework for Hadoop[39m
|
||||
|
||||
[38;2;255;187;0m[4mDSL[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Pig[0m[38;5;12m (http://pig.apache.org) - Apache Pig[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache DataFu[0m[38;5;12m (http://datafu.incubator.apache.org/) - A collection of libraries for working with large-scale data in Hadoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mvahara[0m[38;5;12m (https://github.com/thedatachef/varaha) - Machine learning and natural language processing with Apache Pig[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mpacketpig[0m[38;5;12m (https://github.com/packetloop/packetpig) - Open Source Big Data Security Analytics[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1makela[0m[38;5;12m (https://github.com/mozilla-metrics/akela) - Mozilla's utility library for Hadoop, HBase, Pig, etc.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mseqpig[0m[38;5;12m (http://seqpig.sourceforge.net/) - Simple and scalable scripting for large sequencing data set(ex: bioinfomation) in Hadoop [39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mLipstick[0m[38;5;12m (https://github.com/Netflix/Lipstick) - Pig workflow visualization tool. [39m[38;5;14m[1mIntroducing Lipstick on A(pache) Pig[0m[38;5;12m (http://techblog.netflix.com/2013/06/introducing-lipstick-on-apache-pig.html)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mPigPen[0m[38;5;12m (https://github.com/Netflix/PigPen) - PigPen is map-reduce for Clojure, or distributed Clojure. It compiles to Apache Pig, but you don't need to know much about Pig to use it.[39m
|
||||
|
||||
[38;2;255;187;0m[4mLibraries and Tools[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mKite Software Development Kit[0m[38;5;12m (http://kitesdk.org/) - A set of libraries, tools, examples, and documentation[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mgohadoop[0m[38;5;12m (https://github.com/hortonworks/gohadoop) - Native go clients for Apache Hadoop YARN.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHue[0m[38;5;12m (http://gethue.com/) - A Web interface for analyzing data with Apache Hadoop.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Zeppelin[0m[38;5;12m (https://zeppelin.incubator.apache.org/) - A web-based notebook that enables interactive data analytics[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Thrift[0m[38;5;12m (http://thrift.apache.org/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Avro[0m[38;5;12m (http://avro.apache.org/) - Apache Avro is a data serialization system.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mElephant Bird[0m[38;5;12m (https://github.com/twitter/elephant-bird) - Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mSpring for Apache Hadoop[0m[38;5;12m (http://projects.spring.io/spring-hadoop/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mhdfs - A native go client for HDFS[0m[38;5;12m (https://github.com/colinmarc/hdfs)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mOozie Eclipse Plugin[0m[38;5;12m (https://marketplace.eclipse.org/content/oozie-eclipse-plugin) - A graphical editor for editing Apache Oozie workflows inside Eclipse.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1msnakebite[0m[38;5;12m (https://pypi.python.org/pypi/snakebite/) - A pure python HDFS client[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache[0m[38;5;14m[1m [0m[38;5;14m[1mParquet[0m[38;5;12m [39m[38;5;12m(https://parquet.apache.org/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mParquet[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mcolumnar[39m[38;5;12m [39m[38;5;12mstorage[39m[38;5;12m [39m[38;5;12mformat[39m[38;5;12m [39m[38;5;12mavailable[39m[38;5;12m [39m[38;5;12mto[39m[38;5;12m [39m[38;5;12many[39m[38;5;12m [39m[38;5;12mproject[39m[38;5;12m [39m[38;5;12min[39m[38;5;12m [39m[38;5;12mthe[39m[38;5;12m [39m[38;5;12mHadoop[39m[38;5;12m [39m[38;5;12mecosystem,[39m[38;5;12m [39m[38;5;12mregardless[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mthe[39m[38;5;12m [39m[38;5;12mchoice[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mdata[39m[38;5;12m [39m[38;5;12mprocessing[39m[38;5;12m [39m[38;5;12mframework,[39m[38;5;12m [39m[38;5;12mdata[39m[38;5;12m [39m[38;5;12mmodel[39m[38;5;12m [39m[38;5;12mor[39m
|
||||
[38;5;12mprogramming[39m[38;5;12m [39m[38;5;12mlanguage.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Superset (incubating)[0m[38;5;12m (https://superset.incubator.apache.org/) - Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mSchema Registry UI[0m
|
||||
[38;5;12m (https://github.com/Landoop/schema-registry-ui) - Web tool for the Confluent Schema Registry in order to create / view / search / evolve / view history & configure Avro schemas of your Kafka cluster.[39m
|
||||
|
||||
[38;2;255;187;0m[4mRealtime Data Processing[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Storm[0m[38;5;12m (http://storm.apache.org/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Samza[0m[38;5;12m (http://samza.apache.org/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Spark[0m[38;5;12m (http://spark.apache.org/streaming/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Flink[0m[38;5;12m (https://flink.apache.org) - Apache Flink is a platform for efficient, distributed, general-purpose data processing. It supports exactly once stream processing.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache[0m[38;5;14m[1m [0m[38;5;14m[1mPulsar[0m[38;5;14m[1m [0m[38;5;14m[1m(incubating)[0m[38;5;12m [39m[38;5;12m(http://pulsar.incubator.apache.org/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mPulsar[39m[38;5;12m [39m[38;5;12m(incubating)[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mhighly[39m[38;5;12m [39m[38;5;12mscalable,[39m[38;5;12m [39m[38;5;12mlow[39m[38;5;12m [39m[38;5;12mlatency[39m[38;5;12m [39m[38;5;12mmessaging[39m[38;5;12m [39m[38;5;12mplatform[39m[38;5;12m [39m[38;5;12mrunning[39m[38;5;12m [39m[38;5;12mon[39m[38;5;12m [39m[38;5;12mcommodity[39m[38;5;12m [39m[38;5;12mhardware.[39m[38;5;12m [39m[38;5;12mIt[39m[38;5;12m [39m[38;5;12mprovides[39m[38;5;12m [39m[38;5;12msimple[39m[38;5;12m [39m[38;5;12mpub-sub[39m[38;5;12m [39m
|
||||
[38;5;12msemantics[39m[38;5;12m [39m[38;5;12mover[39m[38;5;12m [39m[38;5;12mtopics,[39m[38;5;12m [39m[38;5;12mguaranteed[39m[38;5;12m [39m[38;5;12mat-least-once[39m[38;5;12m [39m[38;5;12mdelivery[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mmessages,[39m[38;5;12m [39m[38;5;12mautomatic[39m[38;5;12m [39m[38;5;12mcursor[39m[38;5;12m [39m[38;5;12mmanagement[39m[38;5;12m [39m[38;5;12mfor[39m[38;5;12m [39m[38;5;12msubscribers,[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12mcross-datacenter[39m[38;5;12m [39m[38;5;12mreplication.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Druid (incubating)[0m[38;5;12m (http://druid.incubator.apache.org/) - A high-performance, column-oriented, distributed data store.[39m
|
||||
|
||||
[38;2;255;187;0m[4mDistributed Computing and Programming[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Spark[0m[38;5;12m (http://spark.apache.org/)[39m
|
||||
[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mSpark Packages[0m[38;5;12m (http://spark-packages.org/) - A community index of packages for Apache Spark[39m
|
||||
[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mSparkHub[0m[38;5;12m (https://sparkhub.databricks.com/) - A community site for Apache Spark[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Crunch[0m[38;5;12m (http://crunch.apache.org)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mCascading[0m[38;5;12m (http://www.cascading.org/) - Cascading is the proven application development platform for building data applications on Hadoop.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Flink[0m[38;5;12m (http://flink.apache.org/) - Apache Flink is a platform for efficient, distributed, general-purpose data processing.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Apex (incubating)[0m[38;5;12m (http://apex.incubator.apache.org/) - Enterprise-grade unified stream and batch processing engine.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache[0m[38;5;14m[1m [0m[38;5;14m[1mLivy[0m[38;5;14m[1m [0m[38;5;14m[1m(incubating)[0m[38;5;12m [39m[38;5;12m(https://livy.incubator.apache.org/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mLivy[39m[38;5;12m [39m[38;5;12m(incubating)[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12mweb[39m[38;5;12m [39m[38;5;12mservice[39m[38;5;12m [39m[38;5;12mthat[39m[38;5;12m [39m[38;5;12mexposes[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mREST[39m[38;5;12m [39m[38;5;12minterface[39m[38;5;12m [39m[38;5;12mfor[39m[38;5;12m [39m[38;5;12mmanaging[39m[38;5;12m [39m[38;5;12mlong[39m[38;5;12m [39m[38;5;12mrunning[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mSpark[39m[38;5;12m [39m[38;5;12mcontexts[39m[38;5;12m [39m[38;5;12min[39m[38;5;12m [39m[38;5;12myour[39m[38;5;12m [39m[38;5;12mcluster.[39m[38;5;12m [39m[38;5;12mWith[39m[38;5;12m [39m[38;5;12mLivy,[39m
|
||||
[38;5;12mnew[39m[38;5;12m [39m[38;5;12mapplications[39m[38;5;12m [39m[38;5;12mcan[39m[38;5;12m [39m[38;5;12mbe[39m[38;5;12m [39m[38;5;12mbuilt[39m[38;5;12m [39m[38;5;12mon[39m[38;5;12m [39m[38;5;12mtop[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mSpark[39m[38;5;12m [39m[38;5;12mthat[39m[38;5;12m [39m[38;5;12mrequire[39m[38;5;12m [39m[38;5;12mfine[39m[38;5;12m [39m[38;5;12mgrained[39m[38;5;12m [39m[38;5;12minteraction[39m[38;5;12m [39m[38;5;12mwith[39m[38;5;12m [39m[38;5;12mmany[39m[38;5;12m [39m[38;5;12mSpark[39m[38;5;12m [39m[38;5;12mcontexts.[39m
|
||||
|
||||
[38;2;255;187;0m[4mPackaging, Provisioning and Monitoring[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Bigtop[0m[38;5;12m (http://bigtop.apache.org/) - Apache Bigtop: Packaging and tests of the Apache Hadoop ecosystem [39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Ambari[0m[38;5;12m (http://ambari.apache.org/) - Apache Ambari[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mGanglia Monitoring System[0m[38;5;12m (http://ganglia.sourceforge.net/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mankush[0m[38;5;12m (https://github.com/impetus-opensource/ankush) - A big data cluster management tool that creates and manages clusters of different technologies.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Zookeeper[0m[38;5;12m (http://zookeeper.apache.org/) - Apache Zookeeper[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Curator[0m[38;5;12m (http://curator.apache.org/) - ZooKeeper client wrapper and rich ZooKeeper framework[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1minviso[0m[38;5;12m (https://github.com/Netflix/inviso) - Inviso is a lightweight tool that provides the ability to search for Hadoop jobs, visualize the performance, and view cluster utilization.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mLogit.io[0m[38;5;12m (https://logit.io/) - Send logs from Hadoop to Elasticsearch for monitoring and alerting.[39m
|
||||
|
||||
|
||||
[38;2;255;187;0m[4mSearch[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mElasticSearch[0m[38;5;12m (https://www.elastic.co/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Solr[0m[38;5;12m (http://lucene.apache.org/solr/) - Apache Solr is an open source search platform built upon a Java library called Lucene.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mBanana[0m[38;5;12m (https://github.com/LucidWorks/banana) - Kibana port for Apache Solr[39m
|
||||
|
||||
[38;2;255;187;0m[4mSearch Engine Framework[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Nutch[0m[38;5;12m (http://nutch.apache.org/) - Apache Nutch is a highly extensible and scalable open source web crawler software project.[39m
|
||||
|
||||
[38;2;255;187;0m[4mSecurity[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Ranger[0m[38;5;12m (http://ranger.incubator.apache.org/) - Ranger is a framework to enable, monitor and manage comprehensive data security across the Hadoop platform.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Sentry[0m[38;5;12m (https://sentry.incubator.apache.org/) - An authorization module for Hadoop[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Knox Gateway[0m[38;5;12m (https://knox.apache.org/) - A REST API Gateway for interacting with Hadoop clusters.[39m
|
||||
|
||||
[38;2;255;187;0m[4mBenchmark[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mBig Data Benchmark[0m[38;5;12m (https://amplab.cs.berkeley.edu/benchmark/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHiBench[0m[38;5;12m (https://github.com/intel-hadoop/HiBench)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mYCSB[0m[38;5;12m [39m[38;5;12m(https://github.com/brianfrankcooper/YCSB)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mThe[39m[38;5;12m [39m[38;5;12mYahoo![39m[38;5;12m [39m[38;5;12mCloud[39m[38;5;12m [39m[38;5;12mServing[39m[38;5;12m [39m[38;5;12mBenchmark[39m[38;5;12m [39m[38;5;12m(YCSB)[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12man[39m[38;5;12m [39m[38;5;12mopen-source[39m[38;5;12m [39m[38;5;12mspecification[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12mprogram[39m[38;5;12m [39m[38;5;12msuite[39m[38;5;12m [39m[38;5;12mfor[39m[38;5;12m [39m[38;5;12mevaluating[39m[38;5;12m [39m[38;5;12mretrieval[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12mmaintenance[39m[38;5;12m [39m[38;5;12mcapabilities[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mcomputer[39m[38;5;12m [39m
|
||||
[38;5;12mprograms.[39m[38;5;12m [39m[38;5;12mIt[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12moften[39m[38;5;12m [39m[38;5;12mused[39m[38;5;12m [39m[38;5;12mto[39m[38;5;12m [39m[38;5;12mcompare[39m[38;5;12m [39m[38;5;12mrelative[39m[38;5;12m [39m[38;5;12mperformance[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mNoSQL[39m[38;5;12m [39m[38;5;12mdatabase[39m[38;5;12m [39m[38;5;12mmanagement[39m[38;5;12m [39m[38;5;12msystems.[39m
|
||||
|
||||
[38;2;255;187;0m[4mMachine learning and Big Data analytics[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Mahout[0m[38;5;12m (http://mahout.apache.org)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mOryx 2[0m[38;5;12m (https://github.com/OryxProject/oryx) - Lambda architecture on Spark, Kafka for real-time large scale machine learning[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mMLlib[0m[38;5;12m (https://spark.apache.org/mllib/) - MLlib is Apache Spark's scalable machine learning library.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mR[0m[38;5;12m (http://www.r-project.org/) - R is a free software environment for statistical computing and graphics.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mRHadoop[0m[38;5;12m (https://github.com/RevolutionAnalytics/RHadoop/wiki) including RHDFS, RHBase, RMR2, plyrmr[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Lens[0m[38;5;12m (http://lens.apache.org/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache SINGA (incubating)[0m[38;5;12m (https://singa.incubator.apache.org/) - SINGA is a general distributed deep learning platform for training big deep learning models over large datasets[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mBigDL[0m[38;5;12m [39m[38;5;12m(https://bigdl-project.github.io/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mBigDL[39m[38;5;12m [39m[38;5;12mis[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mdistributed[39m[38;5;12m [39m[38;5;12mdeep[39m[38;5;12m [39m[38;5;12mlearning[39m[38;5;12m [39m[38;5;12mlibrary[39m[38;5;12m [39m[38;5;12mfor[39m[38;5;12m [39m[38;5;12mApache[39m[38;5;12m [39m[38;5;12mSpark;[39m[38;5;12m [39m[38;5;12mwith[39m[38;5;12m [39m[38;5;12mBigDL,[39m[38;5;12m [39m[38;5;12musers[39m[38;5;12m [39m[38;5;12mcan[39m[38;5;12m [39m[38;5;12mwrite[39m[38;5;12m [39m[38;5;12mtheir[39m[38;5;12m [39m[38;5;12mdeep[39m[38;5;12m [39m[38;5;12mlearning[39m[38;5;12m [39m[38;5;12mapplications[39m[38;5;12m [39m[38;5;12mas[39m[38;5;12m [39m[38;5;12mstandard[39m[38;5;12m [39m[38;5;12mSpark[39m[38;5;12m [39m[38;5;12mprograms,[39m[38;5;12m [39m[38;5;12mwhich[39m[38;5;12m [39m[38;5;12mcan[39m[38;5;12m [39m
|
||||
[38;5;12mdirectly[39m[38;5;12m [39m[38;5;12mrun[39m[38;5;12m [39m[38;5;12mon[39m[38;5;12m [39m[38;5;12mtop[39m[38;5;12m [39m[38;5;12mof[39m[38;5;12m [39m[38;5;12mexisting[39m[38;5;12m [39m[38;5;12mSpark[39m[38;5;12m [39m[38;5;12mor[39m[38;5;12m [39m[38;5;12mHadoop[39m[38;5;12m [39m[38;5;12mclusters.[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Hivemall (incubating)[0m[38;5;12m (http://hivemall.incubator.apache.org/) - Apache Hivemall is a scalable machine learning library that runs on Apache Hive, Spark and Pig.[39m
|
||||
|
||||
[38;2;255;187;0m[4mMisc.[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;12mHive Plugins[39m
|
||||
[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;12mUDF[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/edwardcapriolo/hive_cassandra_udfs[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/livingsocial/HiveSwarm[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/ThinkBigAnalytics/Hive-Extensions-from-Think-Big-Analytics[49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/twitter/elephant-bird - Twitter[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/lovelysystems/ls-hive[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/klout/brickhouse[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;12mStorage Handler[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/dvasilen/Hive-Cassandra[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/yc-huang/Hive-mongo[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/balshor/gdata-storagehandler[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/chimpler/hive-solr[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/bfemiano/accumulo-hive-storage-manager[49m[39m
|
||||
[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;12mLibraries and tools[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/forward3d/rbhive[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/synctree/activerecord-hive-adapter[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/hrp/sequel-hive-adapter[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/forward/node-hive[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/recruitcojp/WebHive[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * **shib** (https://github.com/tagomoris/shib) - WebUI for query engines: Hive and Presto[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/dmorel/Thrift-API-HiveClient2 (Perl - HiveServer2)[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * **PyHive** (https://github.com/dropbox/PyHive) - Python interface to Hive and Presto[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * https://github.com/recruitcojp/OdbcHive[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * **HiveRunner** (https://github.com/klarna/HiveRunner) - An Open Source unit test framework for hadoop hive queries based on JUnit4[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;235m[38;5;249m * **Beetest** (https://github.com/kawaa/Beetest) - A super simple utility for testing Apache Hive scripts locally for non-Java developers.[49m[39m
|
||||
[48;5;235m[38;5;249m * **Hive_test** (https://github.com/edwardcapriolo/hive_test)- Unit test framework for hive and hive-service[49m[39m[48;5;235m[38;5;249m [49m[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;12mFlume Plugins[39m
|
||||
[38;5;12m [39m[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mFlume MongoDB Sink[0m[38;5;12m (https://github.com/leonlee/flume-ng-mongodb-sink)[39m
|
||||
[38;5;12m [39m[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mFlume RabbitMQ source and sink[0m[38;5;12m (https://github.com/jcustenborder/flume-ng-rabbitmq)[39m
|
||||
[38;5;12m [39m[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mFlume UDP Source[0m[38;5;12m (https://github.com/whitepages/flume-udp-source)[39m
|
||||
[38;5;12m [39m[38;5;12m [39m[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1m.Net FlumeNG Clients[0m[38;5;12m (https://github.com/marksl/DotNetFlumeNG.Clients)[39m
|
||||
|
||||
[38;5;12m [39m[38;2;255;187;0m[1m[4mResources[0m
|
||||
[38;5;12mVarious resources, such as books, websites and articles.[39m
|
||||
|
||||
[38;2;255;187;0m[4mWebsites[0m
|
||||
[48;2;30;30;40m[38;5;13m[3mUseful websites and articles[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop Weekly[0m[38;5;12m (http://www.hadoopweekly.com/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mThe Hadoop Ecosystem Table[0m[38;5;12m (http://hadoopecosystemtable.github.io/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop illuminated[0m[38;5;12m (http://hadoopilluminated.com/) - Open Source Hadoop Book[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mAWS BigData Blog[0m[38;5;12m (http://blogs.aws.amazon.com/bigdata/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop360[0m[38;5;12m (http://www.hadoop360.com/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHow to monitor Hadoop metrics[0m[38;5;12m (https://www.datadoghq.com/blog/monitor-hadoop-metrics/)[39m
|
||||
|
||||
[38;2;255;187;0m[4mPresentations[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Hadoop In Theory And Practice[0m[38;5;12m (http://www.slideshare.net/AdamKawa/hadoop-intheoryandpractice)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop Operations at LinkedIn[0m[38;5;12m (http://www.slideshare.net/allenwittenauer/2013-hadoopsummitemea)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop Performance at LinkedIn[0m[38;5;12m (http://www.slideshare.net/allenwittenauer/2012-lihadoopperf)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mDocker based Hadoop provisioning[0m[38;5;12m (http://www.slideshare.net/JanosMatyas/docker-based-hadoop-provisioning)[39m
|
||||
|
||||
[38;2;255;187;0m[4mBooks[0m
|
||||
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop: The Definitive Guide[0m[38;5;12m (http://www.amazon.com/gp/product/1449311520/ref=as_li_ss_tl?ie=UTF8&camp=1789&creative=390957&creativeASIN=1449311520&linkCode=as2&tag=matratsblo-20)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop Operations[0m[38;5;12m (http://www.amazon.com/gp/product/1449327052/ref=as_li_ss_tl?ie=UTF8&camp=1789&creative=390957&creativeASIN=1449327052&linkCode=as2&tag=matratsblo-20)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApache Hadoop Yarn[0m[38;5;12m (http://www.amazon.com/dp/0321934504?tag=matratsblo-20)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHBase: The Definitive Guide[0m[38;5;12m (http://shop.oreilly.com/product/0636920014348.do)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mProgramming Pig[0m[38;5;12m (http://shop.oreilly.com/product/0636920018087.do)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mProgramming Hive[0m[38;5;12m (http://shop.oreilly.com/product/0636920023555.do)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop in Practice, Second Edition[0m[38;5;12m (http://www.manning.com/holmes2/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mHadoop in Action, Second Edition[0m[38;5;12m (http://www.manning.com/lam2/)[39m
|
||||
|
||||
[38;2;255;187;0m[4mHadoop and Big Data Events[0m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mApacheCon[0m[38;5;12m (http://www.apachecon.com/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mStrata + Hadoop World[0m[38;5;12m (http://conferences.oreilly.com/strata)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mDataWorks Summit[0m[38;5;12m (https://dataworkssummit.com/)[39m
|
||||
[48;5;12m[38;5;11m⟡[49m[39m[38;5;12m [39m[38;5;14m[1mSpark Summit[0m[38;5;12m (https://databricks.com/sparkaisummit)[39m
|
||||
|
||||
[38;5;12m [39m[38;2;255;187;0m[1m[4mOther Awesome Lists[0m
|
||||
[38;5;12mOther amazingly awesome lists can be found in the [39m[38;5;14m[1mawesome-awesomeness[0m[38;5;12m (https://github.com/bayandin/awesome-awesomeness) and [39m[38;5;14m[1mawesome[0m[38;5;12m (https://github.com/sindresorhus/awesome) list.[39m
|
||||
Reference in New Issue
Block a user