Toll Free 1800-123-321-5

Apache Kafka

IIHT's Apache Kafka training  is one-stop-shop for learning implementation, administration and development of Apache Kafka product for enterprise need of real time, scalable and fault tolerant data processing. The 2-day training will encompass the fundamental concepts like Kafka cluster, Kafka API to advance topics such as Kafka integration with Hadoop, Storm, Spark, Maven etc. The course also covers installation & configuration of Kafka and other components like Hadoop, Spark, Maven etc.
Print Friendly, PDF & Email

Duration

2 Days

TOP

Course Content

  • What is Big Data?
  • The need for Big Data
  • Why Big Data now?
  • Myths of Big Data
  • Tabular representation of data unit measurement.
  • Is one petabyte big data?
  • Types of Architectures in Big Data
  • Lambda Architecture
  • Kappa Architecture
  • Zeta Architecture
  • Seda Architecture
  • NoSQL Store and high throughput messaging system
  • Illustration about CAP theorem
  • Problems with large-scale systems
  • Why HDFS ?
  • HDFS Architecture
  • Using HDFS and hdfs commands
  • Why and What is Distributed Service and why we need Zookeeper
  • CAP – Brewer’s Theorem
  • Systems that use zookeeper as the underlying storage
  • System requirements and installing and managing a Zookeeper cluster
  • Quorum
  • Epoch
  • Znode
  • Session
  • Watcher
  • Persistent Znode
  • Ephemeral Znode
  • Sequential Znode
  • Create
  • Get
  • Set
  • Delete
  • Java code to create,get,set on a znode
  • Configuring Zookeeper
  • Managing Zookeeper Storage
  • Remotely Connecting to Zookeeper
  • Logging
  • Leader Election
  • Locks
  • Queues
  • Lambda Architecture
  • Kappa Architecture
  • Zeta Architecture
  • Seda Architecture
  • NoSQL Store and high throughput messaging system
  • Real time analytics
  • Data ingestion
  • Case studies
  • Core concepts
  • Kafka Design
  • Log Compaction
  • Message compaction
  • Replication
  • Message flow
  • High Availability and Consistency
  • Resource Management
  • Topics
  • Partitions
  • Replicas
  • Producers
  • Consumers
  • Brokers
  • Segment
  • Offset
  • Leader
  • Follower
  • Last Commit Offset
  • In-Sync Replicas
  • High Watermark
  • Log end offset
  • Single and Multiple Consumer with Multiple Consumer Group
  • Consumer rebalancing
  • Group Coordinator and Group Leader Strategy
  • Architecture
  • Hardware specs
  • Deploying Deep Dive into Kafka Cluster
  • Understanding the components of Kafka cluster
  • Installation of Kafka Cluster
  • Configuring Kafka Cluster
  • Producer of Kafka
  • Consumer of Kafka
  • Producer and Consumer in Action
  • Hands on code with Java and Scala
  • Replication and Compression
  • Subscribing to topics
  • Assignment to topic partitions
  • Install Kafka
  • Set up a Kafka –
  • A single node- A single broker cluster
  • A single node – Multiple broker clusters
  • Multiple nodes – Multiple broker clusters
  • Kafka Operations and Performance Tuning
  • KStreams
  • KTable
  • All transformations with KStreams and KTable-
  • map(),mapValues(),filter(),flatMap(),groupBy(),groupByKey(),foreach(),
  • peek(),writeAsText(),print()
  • Windowing
  • Data transfer through connect to HDFS and kafka topics.
  • Working with Kafka Logs
  • Operationalizing Kafka Securing Kafka
  • Security Overview
  • Configuring Kafka Security
  • Authentication using SASL
  • Authorization and ACLs
  • Incorporating Security Features in a Running Cluster
  • ZooKeeper for HA Hands On
  • Using Kafka Connect to move data
  • Monitoring and Alerting using Kafka Tools
  • Set up authentication for Kafka
  • Authentication via SSL & Kerberos through SASL
  • Authorization, permissions and ACLs setup
  • Set up Encryption
TOP