Toll Free 1800-123-321-5

Cassandra

Enterprises tend to leverage Apache Cassandra for their highly reliable and scalable databases, especially while dealing with large volumes of data. Cassandra has been a popular choice for various IT organizations worldwide as it is designed to handle huge workloads across multiple data centres and can be modified according to different organizational needs and demands. The value of Cassandra has significantly increased over the last few years and is quickly becoming a necessity for a multitude of job roles in IT. IIHT’s Cassandra training program is designed by experts to meet the industry’s requirements and demands. The course structure will cover various topics like NoSQL, relational data models, clusters, keyspaces, among many other to ensure that course participants develop the skills required to install and setup Cassandra, and understand its integration with Hadoop and Solr.

Implement Backup and Recovery Strategies for Cassandra

Monitor Cassandra along with its integration with frameworks like Hadoop

Perform Cassandra Admin Operations for Managing a Cluster

Print Friendly, PDF & Email

Prerequisites

  • Fundamental knowledge of any programming language
  • Basic understanding of any database, SQL and query language for databases
  • Working knowledge of Linux or Unix based systems (not mandatory)
TOP

Course Content

What is Big Data

  • Technology Landscape
  • Big Data Relevance
  • Distributed Systems and Challenges

 

Why NoSQL Databases

  • Relational DB vs. NoSQL
  • Type of NoSQL Databases
  • NoSQL Landscape
  • CAP Theorem and Eventual Consistency
  • Key Characteristics of NoSQL Database systems
  • ACID vs BASE

 

Cassandra Fundamentals

  • Distributed and Decentralized
  • Elastic Scalability
  • High Availability and Fault Tolerance
  • Tuneable Consistency
  • Row-Oriented
  • Schema-Free
  • High Performance

 

The Cassandra Data Model

  • The Relational Data Model
  • A Simple Introduction
  • Clusters
  • Keyspaces
  • Hands-on Session

 

Installation and Setup of Cassandra

  • Single Node Setup
  • Multi-Node Cluster Setup
  • Key Configurations for Cassandra
  • CLI and Hands-On with Cassandra

 

Cassandra Modeling

  • Cassandra (Column Family NoSQL DB)
  • Key Concepts – Key Space – Column Family – Column Family Options – Wide Rows, Skinny Row – Column Sorting – Super Columns – Counter Column Family – Composite Keys and Columns – Time To Live –
  • Secondary Indexes in Cassandra
  • Difference between Custom Indexes and Secondary Indexes
  • Difference between Relational Modeling and Cassandra Modeling
  • Key Points to note while modeling a Cassandra Database
  • Patterns and Anit-Patterns in Cassandra Modeling

Cassandra Architecture & Intro to CQL

  • Anatomy of Reading operation in Cassandra
  • Anatomy of the Write operation in Cassandra
  • How is Deletes handled in Cassandra
  • System Keyspace
  • Peer to Peer Model Logical Data Model: Keyspace, Column Family/Table, Rows, Columns
  • Traditional Ring design vs. VNodes
  • Partitioners: Murmer3, Random (md5) and ByteOrdered
  • Gossip and Failure Detection
  • Anti-Entropy and Read Repair
  • Memtables, SSTables and Commit Log
  • Compaction fundamentals to reduce SSTable data files
  • Hinted Handoff
  • Compaction
  • Bloom Filters, Tombstones
  • Managers and Services
  • VNodes
  • Indexes and Caches
  • Coordinator node
  • Seed nodes
  • Write/Read consistency levels: Any, One, Two, Three, Quorum
  • Snitches: Dynamic snitching, Simple Snitch, Rack Inferring Snitch, Property File Snitch, Gossiping Property File Snitch
  • Routing Client requests
  • Nodetool commands: gossipinfo, cfstats, describing
  • YAML file fundamentals
  • Operations management web GUI
  • Stress testing Cassandra
  • CQL command fundamentals

 

Cassandra API

  • Key concepts for Reading and Write in Cassandra
  • Tunable Consistency
  • Simple Get, Multi-get Slice
  • Range and Slice
  • Slice Predicate
  • Delete
  • Hands-on CLI commands

 

Cassandra Clients

  • How to establish Client Connections
  • Thrift Client
  • Connection Pooling
  • Auto-discovery and Failover in Hector
  • Client with CQL

 

Cassandra Monitoring and Administration

  • Tuning Cassandra
  • Backup and Recovery methods
  • Balancing
  • Bootstrapping
  • Node Tools Commands
  • Upgrades
  • Monitoring critical metrics
  • Bulk Loading Data to Cassandra
  • Bulk Export of Data from Cassandra
  • Hands-on Examples for each of them

 

Cassandra Analytics Cluster

  • Cassandra Hadoop Integration

 

Cassandra Search Cluster

  • Integration of Solr with Cassandra
  • Search Query on Cassandra
TOP