Overview
In today’s times, businesses need to process a very large volume and variety of data. This generates a requirement for a highly available, low latency database. Cassandra solves this challenge by allowing high-speed reads and writes across a large and distributed system.
What You'll Learn
The Cassandra Developer Training courses educates the learner in –
- Architect and engineer NoSQL Cassandra databases
- Model data in Cassandra
- Accessing Cassandra using CQL and Java
- Ensuring high read/write speeds while maintaining data consistency
- Integrating Cassandra with Hadoop, Pig and Hive
- Implementing commonly used Cassandra design patterns
Curriculum
- Distributed and Decentralized
- Elastic Scalability
- High Availability and Fault Tolerance
- Tune-able Consistency
- Row-Oriented
- Schema-Free
- High Performance
- The Relational Data Model
- A Simple Introduction
- Clusters
- Keyspaces
- Hands-on Session
- Single Node Setup
- Multi-Node Cluster Setup
- Key Configurations for Cassandra
- CLI and Hands-On with Cassandra
- Cassandra (Column Family NoSQL DB)
- Key Concepts
- Key Space
- Column Family
- Column Family Options
- Wide Rows
- Skinny Row
- Column Sorting
- Super Columns
- Counter Column Family
- Composite Keys and Columns
- Time To Live
- Secondary Indexes in Cassandra
- A comparison between Custom Indexes and Secondary Indexes
- A comparison between Relational Modeling and Cassandra Modeling
- Key Points while modeling a Cassandra Database
- Patterns and Anti-Patterns in Cassandra Modeling
- Anatomy of Reading operation in Cassandra
- Anatomy of the Write operation in Cassandra
- How is Deletes handled in Cassandra
- System Keyspace
- Peer to Peer Model Logical Data Model: Keyspace, Column Family/Table, Rows, Columns
- Traditional Ring design vs. VNodes
- Partitioners: Murmer3, Random (md5) and ByteOrdered
- Gossip and Failure Detection
- Anti-Entropy and Read Repair
- Memtables, SSTables and Commit Log
- Compaction fundamentals to reduce SSTable data files
- Hinted Handoff
- Compaction
- Bloom Filters, Tombstones
- Managers and Services
- VNodes
- Indexes and Caches
- Coordinator node
- Seed nodes
- Write/Read consistency levels: Any, One, Two, Three, Quorum
- Snitches: Dynamic snitching, Simple Snitch, Rack Inferring Snitch, Property File Snitch, Gossiping Property File Snitch
- Routing Client requests
- Nodetool commands: gossipinfo, cfstats, describing
- YAML file fundamentals
- Operations management web GUI
- Stress testing Cassandra
- CQL command fundamentals
- Key concepts for Reading and Write in Cassandra
- Tunable Consistency
- Simple Get, Multi-get Slice
- Range and Slice
- Slice Predicate
- Delete
- Hands-on CLI commands
- SQL over Cassandra
- Composite Keys
- Hands-on examples on CQL 3.0
- How to establish Client Connections
- Thrift Client
- Connection Pooling
- Auto-discovery and Failover in Hector
- Client with CQL
- Backup and Recovery methods
- Tuning Cassandra
- Balancing
- Bootstrapping
- Node Tools Commands
- Upgrades
- Monitoring critical metrics
- Bulk Loading Data to Cassandra
- Bulk Export of Data from Cassandra
- Hands-on Examples for each of them
Cassandra Hadoop Integration
- Integration of Solr with Cassandra
- Search Query on Cassandra
Prerequisites
Basic knowledge of Linux