Kudu

Uploaded by

Aman Raturi

0% found this document useful (0 votes)

63 views9 pages

Description about apache kudu

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Description about apache kudu

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

63 views9 pages

Kudu

Uploaded by

Aman Raturi

Description about apache kudu

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 9

Search inside document

*8

Apache Kudu
Introduction

❑ Introduction
❑ Architecture
❑ History
❑ Why kudu
❑ Use Case
❑ Kudu vs HBase
Apache Kudu
Introduction

❑ Apache Kudu is a open source column-oriented data store of the Apache Hadoop
ecosystem.
❑ Kudu is storage for fast analytics on fast data.
❑ Kudu providing a combination of fast inserts and updates alongside efficient columnar
scans to enable multiple real-time analytic workloads across a single storage layer.
❑ Kudu fills the gap between HDFS and Apache HBase formerly solved with complex hybrid
architecture.
Apache Kudu
Architecture

The diagram shows a Kudu cluster with three masters and multiple tablet servers, each
serving multiple tablets. It illustrates how Raft consensus is used to allow for both leaders
and followers for both the masters and tablet servers. In addition, a tablet server can be a
leader for some tablets and a follower for others. Leaders are shown in gold, while
followers are shown in grey.
Apache Kudu
Architecture

Master tablet Tablet 1. Tablet 2. Tablet n

Master tablet Tablet 1

Tablet n
LEADER LEADER
FOLLOWER

Tablet 2
Master tablet Tablet 1
FOLLOWER
FOLLOWER FOLLOWER

Master tablet Tablet 1 Taet 2 Tablet n

FOLLOWER FOLLOWER FOLLOWER LEADER
Apache Kudu
History

❑ Apache Kudu is a free and open source column-oriented data store of the Apache Hadoop
ecosystem. It is compatible with most of the data processing frameworks in the Hadoop
environment.
❑ The open source project to build Apache Kudu began as internal project at Cloudera. The
first version Apache Kudu 1.0 was released 19 September 2016.
Apache Kudu
Why Kudu

❑ Apache kudu is the disruptive technology to enable Real-Time analytics on fast data that
we have all been waiting for.
❑ Kudu is completely different than other Big data analytics solution.
❑ Kudu take advantage of Next Generation Hardware.
❑ Kudu supports SQL with Spark or Impala.
❑ Kudu enables killer “Big Data” Apps.
❑ Kudu should be part of your Big Data strategy.
Apache Kudu
Use case

The big data landscape was until 1-3 years ago dominated by several storage systems, the
first was Hadoop HDFS and later followed by Apache HBase, a NoSQL database. HDFS is
great for high-speed writes and scans while the latter is well suited for random-access
queries. A new storage engine, Apache Kudu tries to bridge the gap between those two
uses cases. Apache Kudu is a distributed, columnar database for structured, real-time data.
Because Kudu has a schema, it is only suited for structured data, contrary to HBase which is
schemaless.
Apache Kudu
Kudu vs HBase

❑ Apache HBase is an open-source, distributed, versioned, column-oriented store modeled

after Google Bigtable: A Distributed Storage System for Structured Data. Just as Bigtable
leverages the distributed data storage provided by the Google File System, HBase provides
Bigtable-like capabilities on top of Apache Hadoop.
❑ Performance
● OLTP
● Fast Point Queries
❑ HBase is fast for updates and inserts but for analytics
❑ A new addition to the open source Apache Hadoop ecosystem, Kudu completes Hadoop's
storage layer to enable fast analytics on fast data.
❑ Real time analytics
❑ Kudu is meant to do both well

L02 - Spark SQL For Data Processing: CBG1C04 Big Data Programming
Document23 pages
L02 - Spark SQL For Data Processing: CBG1C04 Big Data Programming
Satya Narayana
No ratings yet
Introduction To Oracle Sharding
Document13 pages
Introduction To Oracle Sharding
scolomaay
100% (1)
Data Migration Deloitte Solution-Siemens
Document2 pages
Data Migration Deloitte Solution-Siemens
Vineet Kumar
No ratings yet
Oracle Data Integrator 11G & 12c Tutorials, - ODI 12c R2 (12.2.1.2
Document10 pages
Oracle Data Integrator 11G & 12c Tutorials, - ODI 12c R2 (12.2.1.2
Reddy seelam manohar
No ratings yet
PLSQL Manual
Document150 pages
PLSQL Manual
apru18
No ratings yet
Data Engineering Roadmap 2023
Document1 page
Data Engineering Roadmap 2023
Diego Petitto
No ratings yet
LINQ to SQL vs LINQ to Entities: Key Differences
Document58 pages
LINQ to SQL vs LINQ to Entities: Key Differences
Deepesh Sharma
0% (1)
Create Database ABC
Document2 pages
Create Database ABC
mekuriaw
No ratings yet
MCredit's Customer Management IS
Document31 pages
MCredit's Customer Management IS
Nam Nguyễn
No ratings yet
Hands-On Lab Manual: Introduction To Oracle Data Integrator 12c
Document47 pages
Hands-On Lab Manual: Introduction To Oracle Data Integrator 12c
Lizbeth Sarahi Soto Arzate
No ratings yet
Migrating Your SQL Server Workloads To PostgreSQL - Part 3 - CodeProject
Document6 pages
Migrating Your SQL Server Workloads To PostgreSQL - Part 3 - CodeProject
gfgomes
No ratings yet
Step Install Cloudera Manager & Setup Cloudera Cluster
Document23 pages
Step Install Cloudera Manager & Setup Cloudera Cluster
Onne RS-Empire
No ratings yet
Cloudera Apache Impala Guide
Document691 pages
Cloudera Apache Impala Guide
pooh06
No ratings yet
Mum PSP SQL
Document24 pages
Mum PSP SQL
afonsobds
No ratings yet
Cloudera Quickstart PDF
Document28 pages
Cloudera Quickstart PDF
Adarsh Bhardwaj
No ratings yet
Oracle Database 12c R2: Administration Workshop Ed 3: Duration
Document6 pages
Oracle Database 12c R2: Administration Workshop Ed 3: Duration
jackomito
100% (1)
Study Guide For Oracle Certified Master
Document132 pages
Study Guide For Oracle Certified Master
pablovivasve
No ratings yet
Cloudera Kudu
Document102 pages
Cloudera Kudu
Giuseppe Pucci
100% (1)
Oracle OBIEE 12c Tuning Guide - v2 PDF
Document60 pages
Oracle OBIEE 12c Tuning Guide - v2 PDF
nareshreddyguntaka
No ratings yet
Professional Hadoop Solutions
From Everand
Professional Hadoop Solutions
Boris Lublinsky
Rating: 4 out of 5 stars
4/5 (2)
Oracle AVDF Alert Event Parser
Document13 pages
Oracle AVDF Alert Event Parser
Arnav Vaid
No ratings yet
Using Ola Hallengrens SQL Maintenance Scripts PDF
Document28 pages
Using Ola Hallengrens SQL Maintenance Scripts PDF
Hana Ibisevic
No ratings yet
Exadata Migration
Document13 pages
Exadata Migration
KanjaFit
No ratings yet
AWS Oracle DB Migration Questionnaire
Document2 pages
AWS Oracle DB Migration Questionnaire
Avineet Agarwalla
No ratings yet
Dsi404 Query Optimizer
Document504 pages
Dsi404 Query Optimizer
kolleru
No ratings yet
Apache Oozie Essentials
From Everand
Apache Oozie Essentials
Singh Jagat Jasjit
No ratings yet
Installing and Using Impala
Document248 pages
Installing and Using Impala
Sumit Kumar Awkash
No ratings yet
Migrating and Upgrading To Oracle Database 12c Quickly With Near-Zero Downtime
Document31 pages
Migrating and Upgrading To Oracle Database 12c Quickly With Near-Zero Downtime
sellendu
No ratings yet
Oracle Qs
Document6 pages
Oracle Qs
hellopavani
No ratings yet
Apex Institute of Technology: Big Data Security
Document30 pages
Apex Institute of Technology: Big Data Security
So do so
No ratings yet
Goldengate Openworld PDF
Document36 pages
Goldengate Openworld PDF
rishimahajan
No ratings yet
Revolutionizing Data Warehousing in Telecom With The Vertica Analytic Database
Document11 pages
Revolutionizing Data Warehousing in Telecom With The Vertica Analytic Database
grandelindo
No ratings yet
All About TransactionScope - CodeProject
Document16 pages
All About TransactionScope - CodeProject
Grace Patiño Perez
No ratings yet
TeradataStudioUserGuide 2041
Document350 pages
TeradataStudioUserGuide 2041
Manikanteswara Patro
No ratings yet
Multitenant Administrators Guide
Document627 pages
Multitenant Administrators Guide
Paohua Chuang
No ratings yet
Oracle NoSQLDB Admin PDF
Document83 pages
Oracle NoSQLDB Admin PDF
xmanash
No ratings yet
Oracle Stream
Document804 pages
Oracle Stream
nassr50
No ratings yet
Dataguard Questions
Document5 pages
Dataguard Questions
suhaas
No ratings yet
Cloudera Introduction PDF
Document97 pages
Cloudera Introduction PDF
Santhosh Kumar
No ratings yet
Hortonworks Hadoop System Admin Guide 20130819
Document68 pages
Hortonworks Hadoop System Admin Guide 20130819
murthynsmpranu
No ratings yet
SAS Hadoop Kerberos
Document27 pages
SAS Hadoop Kerberos
shilpa
No ratings yet
Performance Evaluation of SQL and Nosql Database Management Systems in A Cluster
Document24 pages
Performance Evaluation of SQL and Nosql Database Management Systems in A Cluster
Maurice Lee
No ratings yet
Cloudera Data Analyst Training PDF
Document2 pages
Cloudera Data Analyst Training PDF
jimmy
No ratings yet
Database Appliance X4-2 Customer Presentation
Document30 pages
Database Appliance X4-2 Customer Presentation
mrugank21
No ratings yet
Data Security Powerpoint
Document25 pages
Data Security Powerpoint
leanna hoyte
No ratings yet
A00-221 Certification Guide and How To Clear Exam On SAS Big Data Programming and Loading
Document15 pages
A00-221 Certification Guide and How To Clear Exam On SAS Big Data Programming and Loading
Palak Mazumdar
0% (1)
Cloudera Manager Administration Guide
Document78 pages
Cloudera Manager Administration Guide
arun_sakre
No ratings yet
Oracle Database 12c R2: Administration Workshop Ed 3: Duration
Document6 pages
Oracle Database 12c R2: Administration Workshop Ed 3: Duration
Bugz Binny
100% (1)
Setting Up Database Oracle FLEXCUBE Universal Banking
Document106 pages
Setting Up Database Oracle FLEXCUBE Universal Banking
fptnam
No ratings yet
Performance Tuning With InfoSphere CDC
Document37 pages
Performance Tuning With InfoSphere CDC
karthikt27
100% (1)
Attunity Streaming Change Data Capture Ebook
Document54 pages
Attunity Streaming Change Data Capture Ebook
Carmelo Escribano Sen
0% (1)
Hadoop ECO System
Document1 page
Hadoop ECO System
fjaimesilva
No ratings yet
Exadata Smart Scan Exadata Smart Scan What Is So Smart About It?
Document22 pages
Exadata Smart Scan Exadata Smart Scan What Is So Smart About It?
piciul2010
No ratings yet
GG Flat File v3
Document37 pages
GG Flat File v3
Hasan Bilal
No ratings yet
Oracle Advanced Security Transparent Data Encryption Best Practices
Document36 pages
Oracle Advanced Security Transparent Data Encryption Best Practices
Dinesh Kumar
No ratings yet
CB Queryoptimization 01
Document78 pages
CB Queryoptimization 01
Jean-Marc Boivin
No ratings yet
D50311GC40 Les06
Document37 pages
D50311GC40 Les06
MuhammadSaad
No ratings yet
Talend ESB Container AG 50b en
Document63 pages
Talend ESB Container AG 50b en
slimanihaythem
No ratings yet
Database 12c Update
Document9 pages
Database 12c Update
abidou
No ratings yet
Updating Exadata Database Server Software
Document15 pages
Updating Exadata Database Server Software
Βαγγέλης Οικονομοπουλος
No ratings yet
Getting Started with Big Data Query using Apache Impala
From Everand
Getting Started with Big Data Query using Apache Impala
Agus Kurniawan
No ratings yet
Hadoop Cluster Deployment
From Everand
Hadoop Cluster Deployment
Danil Zburivsky
No ratings yet
Relational Databases: State of the Art Report 14:5
From Everand
Relational Databases: State of the Art Report 14:5
D A Bell
No ratings yet
PowerBI 1 To 151 05 01 2021
Document43 pages
PowerBI 1 To 151 05 01 2021
Vikas
No ratings yet
DDL Data Definition Exercises
Document10 pages
DDL Data Definition Exercises
Ritesh Bagwale
No ratings yet
Keys in Database: Super Key, Candidate Key, Primary Key and Foreign Key
Document71 pages
Keys in Database: Super Key, Candidate Key, Primary Key and Foreign Key
home123457
No ratings yet
3-1 Review of SQL DML
Document19 pages
3-1 Review of SQL DML
Ziki
No ratings yet
SQL11G Vol2
Document390 pages
SQL11G Vol2
Yogita Sarang
No ratings yet
SQL Command
Document6 pages
SQL Command
Yona
No ratings yet
DBMS Using MS Access
Document37 pages
DBMS Using MS Access
kapilharit20056130
No ratings yet
MySQL 8 For Developers
Document113 pages
MySQL 8 For Developers
jd
No ratings yet
SQL Transaction Processing Guide
Document17 pages
SQL Transaction Processing Guide
santoshsutar1983
No ratings yet
Advanced Java (Module 5)
Document11 pages
Advanced Java (Module 5)
Sushma Sumant
No ratings yet
Dbms Syllabus
Document1 page
Dbms Syllabus
Gaurav Rathi
No ratings yet
Sub Queries
Document4 pages
Sub Queries
Florin Nedelcu
No ratings yet
The SQL Create Database Statement: Syntax
Document6 pages
The SQL Create Database Statement: Syntax
Hemant Kumar
No ratings yet
Activity 6.1 Module9 Union Intersect Minus
Document2 pages
Activity 6.1 Module9 Union Intersect Minus
Akshay Mehta
No ratings yet
DBMS Architecture: Kocbk Database Management System
Document23 pages
DBMS Architecture: Kocbk Database Management System
Abhishek Kumar
No ratings yet
70 461 Exam Guide Querying Microsoft SQL Server 2012 PDF
Document10 pages
70 461 Exam Guide Querying Microsoft SQL Server 2012 PDF
Araz2008
No ratings yet
Advanced SQL Main Module TVET
Document119 pages
Advanced SQL Main Module TVET
mohammed ahmed
No ratings yet
Manually Creating An Oracle 11g Database
Document4 pages
Manually Creating An Oracle 11g Database
Arsam
No ratings yet
Practical File Term 2 Computer
Document31 pages
Practical File Term 2 Computer
Lakshya Sahni
No ratings yet
DBMS Lab Manual
Document91 pages
DBMS Lab Manual
VETRI
No ratings yet
Unit 4
Document28 pages
Unit 4
Nikhil Ahuja
No ratings yet
SQL LAB SET I
Document8 pages
SQL LAB SET I
anand k kumar
No ratings yet
Database Management System
Document9 pages
Database Management System
Rajesh kuamr
No ratings yet
Davao Let - Mapeh False PDF
Document22 pages
Davao Let - Mapeh False PDF
PhilBoardResults
No ratings yet
Gym Managment Project Report
Document36 pages
Gym Managment Project Report
harsh
No ratings yet
SQL Server Versions in Distribution, Parallelism and Big Data - Paper - 2016
Document9 pages
SQL Server Versions in Distribution, Parallelism and Big Data - Paper - 2016
ngo thanh hung
No ratings yet
8th STD Chap 3 Ms Access Ex
Document18 pages
8th STD Chap 3 Ms Access Ex
Raj
No ratings yet