Banner

Cassandra Developer

Duration: 32 Hours
Pattern figure

Overview

In today’s times, businesses need to process a very large volume and variety of data. This generates a requirement for a highly available, low latency database. Cassandra solves this challenge by allowing high-speed reads and writes across a large and distributed system.

What You'll Learn

The Cassandra Developer Training courses educates the learner in –
  • Architect and engineer NoSQL Cassandra databases
  • Model data in Cassandra
  • Accessing Cassandra using CQL and Java
  • Ensuring high read/write speeds while maintaining data consistency
  • Integrating Cassandra with Hadoop, Pig and Hive
  • Implementing commonly used Cassandra design patterns
Duration: 32 Hours

Curriculum

  • Distributed and Decentralized
  • Elastic Scalability
  • High Availability and Fault Tolerance
  • Tune-able Consistency
  • Row-Oriented
  • Schema-Free
  • High Performance

  • The Relational Data Model
  • A Simple Introduction
  • Clusters
  • Keyspaces
  • Hands-on Session

  • Single Node Setup
  • Multi-Node Cluster Setup
  • Key Configurations for Cassandra
  • CLI and Hands-On with Cassandra

  • Cassandra (Column Family NoSQL DB)
  • Key Concepts
    • Key Space
    • Column Family
    • Column Family Options
    • Wide Rows
    • Skinny Row
    • Column Sorting
    • Super Columns
    • Counter Column Family
    • Composite Keys and Columns
    • Time To Live
  • Secondary Indexes in Cassandra
  • A comparison between Custom Indexes and Secondary Indexes
  • A comparison between Relational Modeling and Cassandra Modeling
  • Key Points while modeling a Cassandra Database
  • Patterns and Anti-Patterns in Cassandra Modeling

  • Anatomy of Reading operation in Cassandra
  • Anatomy of the Write operation in Cassandra
  • How is Deletes handled in Cassandra
  • System Keyspace
  • Peer to Peer Model Logical Data Model: Keyspace, Column Family/Table, Rows, Columns
  • Traditional Ring design vs. VNodes
  • Partitioners: Murmer3, Random (md5) and ByteOrdered
  • Gossip and Failure Detection
  • Anti-Entropy and Read Repair
  • Memtables, SSTables and Commit Log
  • Compaction fundamentals to reduce SSTable data files
  • Hinted Handoff
  • Compaction
  • Bloom Filters, Tombstones
  • Managers and Services
  • VNodes
  • Indexes and Caches
  • Coordinator node
  • Seed nodes
  • Write/Read consistency levels: Any, One, Two, Three, Quorum
  • Snitches: Dynamic snitching, Simple Snitch, Rack Inferring Snitch, Property File Snitch, Gossiping Property File Snitch
  • Routing Client requests
  • Nodetool commands: gossipinfo, cfstats, describing
  • YAML file fundamentals
  • Operations management web GUI
  • Stress testing Cassandra
  • CQL command fundamentals

  • Key concepts for Reading and Write in Cassandra
  • Tunable Consistency
  • Simple Get, Multi-get Slice
  • Range and Slice
  • Slice Predicate
  • Delete
  • Hands-on CLI commands

  • SQL over Cassandra
  • Composite Keys
  • Hands-on examples on CQL 3.0

  • How to establish Client Connections
  • Thrift Client
  • Connection Pooling
  • Auto-discovery and Failover in Hector
  • Client with CQL

  • Backup and Recovery methods
  • Tuning Cassandra
  • Balancing
  • Bootstrapping
  • Node Tools Commands
  • Upgrades
  • Monitoring critical metrics
  • Bulk Loading Data to Cassandra
  • Bulk Export of Data from Cassandra
  • Hands-on Examples for each of them

Cassandra Hadoop Integration

  • Integration of Solr with Cassandra
  • Search Query on Cassandra
waves
Ripple wave

Prerequisites

Basic knowledge of Linux

Interested in this Course?

    Ready to recode your DNA for GenAI?
    Discover how Cognixia can help.

    Get in Touch
    Pattern figure
    Ripple wave