We do not have any scheduled public training for this course at the moment. Please contact us or request a private training.

Take your knowledge to the next level with Cloudera’s NiFi Training

Students who successfully complete this course will be able to:

 • Describe Cloudera Flow Management in the context of the Cloudera Dataflow Data-in-Motion Platform

• Explain how NiFi and MiNiFi fit into the Cloudera Edge to AI paradigm

• Describe NiFi Architecture, including standalone and clustered configurations

• Describe the key features, concepts, and benefits of NiFi

 • Explain how FlowFiles, processors, process groups, controllers, and connections work together to define a NiFi dataflow

 • Navigate, configure dataflows, and use dataflow information with the NiFi User Interface

 • Trace the life of data, its origin, transformation and destination, using data provenance

• Organize and simplify dataflows

• Manage dataflow versions using the NiFi Registry

• Use the NiFi Expression Language to control dataflows

 • Describe dataflow optimization methods and available monitoring and reporting features

• Connect dataflows with other systems, such as Kafka, HDFS, and HBase

• Describe aspects of NiFi security

Prerequisites

This course is designed for Developers, Data Engineers, Data Scientists, and Data Stewards. It provides a no-code, graphical approach to configuring real-time data streaming, ingestion, and management solutions for a variety of use cases.

Though programming experience is not required, basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful.

Outline

Introduction to Apache NiFi

  • Overview of Cloudera Flow Management and NiFi
  • The NiFi User Interface

Processors

  • Overview of Processors
  • Processor Surface Panel
  • Processor Configuration Panel

Connections

  • Anatomy of a Connection
  • Connection Configuration Panel
  • Connector Pop-Up Menu

Dataflows

  • Build a Dataflow
  • Command and Control of a
    Dataflow

Dataflows in Depth

  • Processor Relationships
  • Back Pressure
  • Prioritizers
  • Labels

Processor Groups

  • Anatomy of Process Group
  • Input and Output Ports

Data Governance

  • Data Provenance Events
  • FlowFile Lineage
  • Replaying a FlowFile









Dataflow Templates

  • Templates Overview
  • Managing Templates

Apache NiFi Registry

  • Apache NiFi Registry Overview
  • Using the Registry

FlowFile Attributes

  • FlowFile Attribute Overview
  • Routing on Attributes

NiFi Expression Language

  • NiFi Expression Language
    Overview
  • Syntax
  • Expression Language Editor
  • Setting Conditional Values

Dataflow Optimization

  • NiFi Dataflow Optimization
    Control Rate
  • Managing Compute
  • Managing Backlog

NiFi Architecture

  • NiFi Architecture Overview
    Cluster Architecture
  • Heartbeats
  • Managing Clusters

Site-to-Site Dataflows

  • Site-to-Site Theory
  • Site-to-Site Architecture
  • Anatomy of a Remote Process
    Group
  • Adding and Configuring Remote
  • Process Groups





Cloudera Edge Management and MiNiFi

  • Overview of MiNiFi
  • Example Walk-through

Monitoring and Reporting

  • Monitoring from NiFi
  • Overview of Reporting
  • Examples of Common Reporting
    Tasks

Controller Services

  • Controller Services Overview
  • Common Controller Services

Integrating NiFi with the Cloudera Ecosystem

  • NiFi Integration Architecture
  • NiFi Ecosystem Processors
  • A Closer Look at NiFi and Apache
    Hive
  • A Closer Look at NiFi and Apache
    Kafka

NiFi Security

  • NiFi Security Overview
  • Securing Access to the NiFi UI
  • Authentication
  • The Importance of Kerberos
  • NiFi Registry Security
  • NiFi Security Summary