apache kudu

APACHE KUDUASIM JALIS

GALVANIZE

ASIM JALISGalvanize/Zipfian, DataEngineeringCloudera, Microso!,SalesforceMS in Computer Sciencefrom University ofVirginia

WHAT IS GALVANIZE’S DATAENGINEERING IMMERSIVE?

Immersive Peer LearningEnvironmentMaster High-DemandSkills and TechnologiesHeart of San Francisco inSOMA

YOU GET TO . . .Play with Terabytes ofDataSpark, Hadoop, Hive,Kafka, Storm, HBaseData Science at ScaleLevel UP your Career

FOR MORE INFORMATIONCheck outhttp://[email protected]

http://galvanize.com/

TALK OVERVIEW

WHAT IS THIS TALK ABOUT?What is Kudu?How can I use it tosimplify my Big Dataarchitecture?How can I use it as analternative to HBase?How does it work?Demo

HOW MANY PEOPLE HERE AREFAMILIAR WITH HDFS?

HOW MANY PEOPLE HERE AREFAMILIAR WITH HBASE?

HOW MANY PEOPLE HERE AREFAMILIAR WITH KUDU?

WHY KUDU

WHAT IS A KUDU?Kudus are a kind ofantelopeFound in eastern andsouthern Africa

WHAT PROBLEM DOES KUDUSOLVE?

WHAT IS LATENCY?How long it takes to startHow long it takes to setup

WHICH HAS HIGHER LATENCY:AIRPLANES OR CARS?

WHICH HAS HIGHER LATENCY:AIRPLANES OR CARS?

Airplanes have highlatencyCars have lower latencyWinner: Cars

WHAT IS THROUGHPUT?Operations per second/minute/hourHow much you get done in unit time

WHICH HAS HIGHERTHROUGHPUT: AIRPLANES OR

CARS?

WHICH HAS HIGHERTHROUGHPUT: AIRPLANES OR

CARS?Airplanes have highthroughputCars have lowerthroughputWinner: Airplanes

WHICH IS BETTER? Low throughput High throughput

Low latency Cars Jet-Packs

High latency Horses Planes

It depends, usually there is a tradeoffGoing to Oakland: DriveGoing to Florida: Fly

WHY DOES HBASE EXIST?HDFS immutable,append-onlyHBase mutable, randomaccess read/writeFast random accessLow latency (good)

WHY DOES PARQUET EXIST?Main idea: columnarstorageFast scan, fast batchprocessingHigh throughput (good)High latency (bad)

WHY DOES KUDU EXIST? Low throughput High throughput

Low latency HBase Kudu

High latency JSON on HDFS Parquet on HDFS

WHY KUDU?Kudu is the child ofParquet and HBaseHBase with columnarstorageMutable Parquet withfast random access forread/writeParquet-like HBaseHBase-like Parquet

WHAT IS THE USE CASE FORKUDU?

Use HBase-like features to store data as it arrives in real-timeUse Parquet-like features to run analytic workloads onreal-time dataIn queries easily combine long-term historical data andshort-term real-time data

WHAT IS THE BIG IDEA OF KUDU?HBase uses in-memory store for low-latency reads/writesParquet on HDFS uses columnar layout for high-throughput scansKudu idea: in-memory store + columnar layout

KUDU MOTIVATIONGoal Competing With

Fast columnar scans Parquet/HDFS

Low-latency random updates HBase

Consistent performance -

KUDU, HBASE, HDFS

WHAT LANGUAGE IS KUDUWRITTEN IN?

C++ for performanceJava and Python wrappers

HOW CAN I INTERACT WITHKUDU?

Impala is Kudu’s defacto shellC++, Java, or Python clientMapReduceSpark (beta)MapReduce and Spark read-only access (currently)

BENCHMARKS

KUDU VS PARQUET ON HDFSTPC-H: Business-oriented queries/updatesLatency in ms: lower is better

KUDU VS HBASEYahoo! Cloud SystemBenchmark (YCSB)Evaluates key-value andcloud serving storesRandom acccessworkloadThroughput: higher isbetter

KUDU VS PHOENIX VS PARQUET

SQL analytic workloadTPC-H LINEITEM table onlyPhoenix best-of-breed SQL on HBase

LAMBDA ARCHITECTURE

KUDU USE CASE: LAMBDAARCHITECTURE

LAMBDA ARCHITECTUREReal-time data processby Speed LayerHistorical dataprocessed by BatchLayerSpeed Layer resultssaved in HBaseNew and old data joinedin Spark Streaming

ONLINE RETAILER USE CASEList popular items top offront pageWhat is best selling item:

this hourtodaythis week

Keep inventory numbersupdated

KUDUUsing KuduSpeed layer queries caninclude analytics

PRE-KUDU ARCH

POST-KUDU ARCH

KUDU INTERNALS

KUDU ARCHITECTURE

KUDU MASTERFault toleranceFailover to backupmastersRa! used for electingnew leadersOnly leader serves clientrequests

KUDU MASTER ROLESCatalog Manager

Metadata: schema,replicationMaster Tablet storesmetadata

Cluster CoordinatorWho is alive:redistribute data ondeath

Tablet directoryWhich tablet is where

TABLETSTablets are similar toHBase RegionsTables staticallypartitioned into tabletsTablets can bereplicated to 3 or 5Replication uses Ra!’sLeader/Follower patternfor consensusData stored on local filesystem, not HDFS

TABLET REPLICATIONWrites must be done onleader tabletReads can be on leaderor followersWrite log is replicatedLog replication driven byleaderUses Ra! consensus

THICK CLIENTClient caches tablet metadataUses metadata to figure out leader to write toFailure on write to deposed leaderWrite failure forces metadata refresh

TABLET INTERNALSComponent Description

MemRowSet In-memory writes, stored row-wise

DiskRowSet Disk-based data store, columnar, 32 KB

DeltaMemStores In-memory update store

DeltaFile Disk-based update store

MEM ROW SETWrites for new keys arewritten in MemRowSets.Then flushed out toDiskRowSets.Kudu uses Bloom filtersto determine if a key is inDiskRowSet.

DELTA MEM STORESUpdates are written toDeltaMemStores.The flushed toDeltaFiles.

TABLET COUNTTablet count based on partitionsPartitioning function maps row to tabletPartitioning defined at table creation timePartitions cannot be redefined dynamically

PARTITIONINGHash Partitioning

Subset of the primary key columns and number of bucketsFor exampleDISTRIBUTE BY HASH(col1,col2) INTO 16 BUCKETS

Range PartitioningOrdered subset of primary key columnsMaps tuples into binary strings by concatenating values ofspecified columns using order-preserving encoding.

DOES KUDU REQUIRE HADOOP?It does not depend on HDFS.Has no required dependency on Hadoop, HDFS,MapReduce, Spark.However, in practice accessed most easily through Impala.

DO KUDU TABLETSERVERSSHARE DISK SPACE WITH HDFS?Kudu TabletServers and HDFS DataNodes can run on themachines.Kudu’s InputFormat enables data locality.Data locality: MapReduce and Spark tasks likely to run onmachines containing data.

KUDU SCHEMA

WHAT DATA TYPES DOES KUDUSUPPORT?

Boolean8-bit signed integer16-bit signed integer32-bit signed integer64-bit signed integer

Timestamp32-bit floating-point64-bit floating-pointStringBinary

HOW LARGE CAN VALUES BE INKUDU?

Values in the 10s of KB and above are not recommendedPoor performanceStability issues in current releaseNot intended for big blobs or images

ENCODING TYPESColumn Type Encoding

integer, timestamp plain, bitshuffle, run length

float plain, bitshuffle

bool plain, dictionary, run length

string, binary plain, prefix, dictionary

WHAT IS PLAIN ENCODING?Data in its natural format.E.g. int32 stored as 32-bit little-endian integers.

WHAT IS BITSHUFFLE ENCODING?Bitwise columnarencoding.MSB stored first, thensecond-MSB, etc.Result LZ4 compressed.Works well when valuesrepeat or change bysmall amounts.

11010010 1111 111111010011 --> 0000 111111010100 0000 001111010101 1100 1010

WHAT IS RUN LENGTHENCODING?

Repeated values (runs) are stored as value and count.Works well for denormalized tables with consecutiverepeated values when sorted by primary key.54.231.184.754.231.184.754.231.184.7 --> 54.231.184.7 | 554.231.184.754.231.184.7

WHAT IS DICTIONARY ENCODING?Dictionary of unique values.Value encoded as index in dictionary.Works well if column has small set of unique values.If there are too many values Kudu falls back to plainencoding.

WHAT IS PREFIX ENCODING?Common prefixes compressed in consecutive columnvalues.Works well when values share common prefixes.

COLUMN COMPRESSIONPer-column compression using LZ4, Snappy, or ZLib.By default, columns stored uncompressed.

WHAT IMPACT WILLCOMPRESSION HAVE ON SCAN

PERFORMANCE AND ON SPACE?It will reduce storage space.It will reduce scan performance.

ADMIN UI VM PORT SETUPVirtual Box > Settings > Network > Adapter 2 > Port Forwarding > +

Enter: Name: Rule 1 Protocol: TCP Host IP: Host Port: 8051 Guest IP: Guest Port: 8051

Open browser:http://quickstart.cloudera:8051

CONNECT TO VMssh [email protected]

START IMPALA SHELLimpala-shell

CREATE A TABLECREATE TABLE sales ( state STRING, id INTEGER, sale_date STRING, store INTEGER, product INTEGER, amount DOUBLE) DISTRIBUTE BY RANGE(state) SPLIT ROWS(('CA'),('WA'),('OR'))TBLPROPERTIES( 'storage_handler' = 'com.cloudera.kudu.hive.KuduStorageHandler', 'kudu.table_name' = 'sales', 'kudu.master_addresses' = 'quickstart.cloudera:7051', 'kudu.key_columns' = 'state,id');

INSERT SOME DATA INTO ITINSERT INTO sales (state, id, sale_date, store, product, amount)VALUES('WA',101,'2014-11-13',100,331,300.00),('OR',104,'2014-11-18',700,329,450.00),('CA',102,'2014-11-15',203,321,200.00),('CA',106,'2014-11-19',202,331,330.00),('WA',103,'2014-11-17',101,373,750.00),('CA',105,'2014-11-19',202,321,200.00);

TRY SQLSELECT COUNT(*) FROM sales;SELECT STATE,COUNT(*) FROM sales GROUP BY STATE;

VIEW TABLE IN ADMIN UIGo to http://quickstart.cloudera:8051/

http://quickstart.cloudera:8051/

REFERENCES

READINGSKudu Whitepaperhttp://getkudu.io/kudu.pdf

Quickstart VMhttp://getkudu.io/docs/quickstart.html

Schemahttp://getkudu.io/docs/schema_design.html

Kudu Impala Guidehttp://www.cloudera.com/documentation/betas/kudu/0-5-0/topics/kudu_impala.html

http://getkudu.io/kudu.pdf

http://getkudu.io/docs/quickstart.html

http://getkudu.io/docs/schema_design.html

http://www.cloudera.com/documentation/betas/kudu/0-5-0/topics/kudu_impala.html

GALVANIZE DATA ENGINEERING

QUESTIONS

apache kudu

Data & Analytics