Introducing 

Prezi AI.

Your new presentation assistant.

Refine, enhance, and tailor your content, source relevant images, and edit visuals quicker than ever before.

Loading…
Transcript

BIG DATA TECHNOLOGIES

LEEN HASHEM

20200948

APACHE HBASE

INTRO

PREPARED BY:

LEEN HASHEM

COURSE:

DATABASE SYSTEMS

INSTRUCTOR:

DR.RAGHDA HRIEZ

WHAT IS APACHE HBASE?

APACHE HBASE

  • Open source project
  • Column oriented
  • Non-relational
  • On top of HDFS

HISTORY

HISTORY & ORIGIN

Google's Bigtable paper

Apache Hadoop project

2006 - development

2007- Apache incubator project

2010 - Top level

2010 - first release of HBase 0.1.0

HBase Architecture

consists of two key components: the HMaster and the RegionServers

ARCHITECTURE

HMaster

Table Management

Region Assignment

Failover and Recovery

Load Balancing

Region Servers

REGIONSERVER

Data Storage

Data Operations

Data Replication

Splitting & Compaction

INTEGRATION

HDFS

YARN

INTEGRATION

Data Persistence

Scalable Storage

High Throughput

Resource Isolation

Dynamic Allocation

Fair Sharing

Cluster Utilization

GENERAL

CASES

CASES

  • Real-time analytics
  • Time series data storage
  • Social media platforms
  • IoT environments
  • Fraud detection & Risk analysis

REAL LIFE

CASES

REAL LIFE CASES

DIFFERENCES

HBase

RDBMS

Schema

DIFFERENCES

Data

  • Fixed schema
  • Structured
  • Need
  • Hard
  • No Fixed schema
  • Structured & semi-structured
  • No need
  • Easy

Normalization

Scalability

Characteristics

APPROACH

  • Written in Java
  • Consistency and Durability
  • Bloom Filters
  • Security
  • Backup and Restore
Learn more about creating dynamic, engaging presentations with Prezi