Working with Apache Kafka

Get hands-on tooling experience with architecting, programming, streaming, monitoring, and tuning your data using Apache Kafka.

Apache Kafka is the industry-leading tool for real-time data pipeline processing. Kafka serves as the key solution to addressing the challenges of successfully transporting big data. Its high-scalability, fault tolerance, execution speed, and fluid integrations are some of the key hallmarks that make it an integral part of many Enterprise Data architectures.

This hands-on Apache Kakfa training workshop gets you up and running so you can immediately take advantage of the low latency, massive parallelism, and exciting use cases Kafka makes possible. Led by an enterprise engineering expert, you’ll get live instruction and coaching on how to be effective when using Kafka in your work or project.

This “skills-centric” course is about 50% hands-on lab and 50% lecture, coupling the most current techniques with the soundest industry practices. Throughout the course, you will be led through a series of progressively advanced topics, where each topic consists of lectures, group discussion, comprehensive hands-on lab exercises, and lab review.

2 days/14 hours of instruction
Public Classroom Pricing

Starting at: $1895(USD)

GSA Price: $1420

Group Rate: $1795

Private Group Pricing

Have a group of 5 or more students? Request special pricing for private group training today.

Part 1:  Introduction to Streaming Systems

  1. Fast data
  2. Streaming architecture
  3. Lambda architecture
  4. Message queues
  5. Streaming processors

Part 2:  Introduction to Kafka

  1. Architecture
  2. Comparing Kafka with other queue systems (JMS / MQ)
  3. Kaka concepts: Messages, Topics, Partitions, Brokers, Producers, commit logs
  4. Kafka & Zookeeper
  5. Producing messages
  6. Consuming messages (Consumers, Consumer Groups)
  7. Message retention
  8. Scaling Kafka
  9. Labs: Getting Kafka up and running; Using Kafka utilities

Part 3:  Programming with Kafka

  1. Configuration parameters
  2. Producer API (Sending messages to Kafka)
  3. Consumer API (consuming messages from Kafka)
  4. Commits, Offsets, Seeking
  5. Schema with Avro
  6. Lab: Writing Kafka clients in Java; Benchmarking Producer APIs

Part 4:  Kafka Streams

  1. Streams overview and architecture
  2. Streams use cases and comparison with other platforms
  3. Learning Kafka Streaming concepts (KStream, KTable, KStore)
  4. KStreaming operations (transformations, filters, joins, aggregations)
  5. Labs: Kafka Streaming labs

Part 5:  Administering Kafka

  1. Hardware / Software requirements
  2. Deploying Kafka
  3. Configuration of brokers / topics / partitions / producers / consumers
  4. Security: How secure Kafka cluster, and secure client communications (SASL, Kerberos)
  5. Monitoring: monitoring tools
  6. Capacity Planning: estimating usage and demand
  7. Troubleshooting: failure scenarios and recovery

Part 6:  Monitoring and Instrumenting Kafka

  1. Monitoring Kafka
  2. Instrumenting with Metrics library
  3. Labs; Monitor Kafka cluster
  4. Instrument Kafka applications and monitor their performance

Part 7:  Case Study / Workshop (Time-Permitting)

  • Students will build an end-to-end application simulating web traffic and send metrics to Grafana.

Participants in this workshop should have a working knowledge of at least one programming language (preferably Python, Java, or Scala) and be able to work from the command line in a Linux VM or container.

Professionals who may benefit include:

  • Java developers seeking to be proficient in Apache Kafka.
  • Developers who are comfortable with Java, and have reasonable experience working with databases.
  • Students should also be able to navigate Linux command line and have basic knowledge of Linux editors (such as VI / nano) for editing code.
  • Data Scientists
  • Software Engineers 

  • Get Kafka up and running
  • Produce and Consume Messages
  • Write Kafka clients in Java
  • Program using Kafka API
  • Build a data streaming pipeline using Kafka Streams
  • Monitor Kafka Performance Metrics
  • Tune Kafka for Optimal Performance 
  • Troubleshoot Common Kafka Issues
  • Administer and Deploy Kafka

A full refund will be issued for class cancellations made at least 10 business days before the course begins. Payment is nonrefundable for cancellations or reschedules made within 10 business days from the course start date and for No Shows (students who do not attend class).  For reschedules made within 10 business days from the course start date, students must reschedule immediately for the same course, up to a maximum of six months from the original date.  A student may only reschedule one time.

Working with Apache Kafka Schedule

Filter by region
Filter by region
There are currently no scheduled classes for this course. Please contact us if you would like more information or to schedule this course for you or your company.

Request Private Group Training