0% found this document useful (0 votes)
113 views52 pages

KNIME Introduction 2023-07

The document introduces the KNIME Analytics Platform, describing how to download and install it along with an overview of the workbench interface which is used to build workflows by connecting nodes visually and process data through the workflows. Examples of common node types and how to access sample workflows from the KNIME Hub are also provided.

Uploaded by

John Kimutai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
113 views52 pages

KNIME Introduction 2023-07

The document introduces the KNIME Analytics Platform, describing how to download and install it along with an overview of the workbench interface which is used to build workflows by connecting nodes visually and process data through the workflows. Examples of common node types and how to access sample workflows from the KNIME Hub are also provided.

Uploaded by

John Kimutai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 52

KNIME Capacity Building Program for

Malawi Revenue Authority

Brief Introduction to
KNIME Analytics Platform

https://www.knime.com/downloads/full-license

These slides are a derivative of KNIME Course Material of


1
KNIME AG used under CC BY 4.0
Content of this lesson
 Download and Install
 The Workbench
 More on Nodes
 Metanodes and Components
 KNIME Community Hub
 Building your first „Hello World“ Workflow

These slides are a derivative of KNIME Course Material of


2
KNIME AG used under CC BY 4.0
Datasets
 Dataset used: adult dataset
 Example Workflows:
 „My First Workflow“ (https://hub.knime.com/-/spaces/-/latest/~8HwhURN_OH6yh37q/)
 Read from CSV file
 Filter rows and columns
 Write to CSV file

These slides are a derivative of KNIME Course Material of


3
KNIME AG used under CC BY 4.0
Download and Install

4
KNIME Analytics Platform
 Open and opensource modular Data Science platform
 Covers all the data science needs:
Data Data
Data Access
Preparation Visualization

Machine
Testing Deployment
Learning

 Based on the visual programming paradigm


 Provides a diverse array of extensions:
 Text Mining
 Network Mining
 Cheminformatics
 Deep Learning
 Many integrations, such as Java, R, Python, Weka, Keras, Plotly, H2O, etc
 ... And more

These slides are a derivative of KNIME Course Material of


5
KNIME AG used under CC BY 4.0
KNIME Hub
Blend & Transform Validate & Deploy

Model & Production Consume &


Creation Production
Visualize Process Interact

Optimize & Capture Monitor & Update

KNIME Analytics Platform KNIME Hub


Prepare & Build Collaborate & Deploy

These slides are a derivative of KNIME Course Material of


6
KNIME AG used under CC BY 4.0
Installation

https://www.knime.com/downloads

 Select the KNIME Analytics Platform version for your computer:


 Mac
 Windows – 32 or 64 bit
 Linux
 Download the archive and extract the file, or download the installer package and
run it

These slides are a derivative of KNIME Course Material of


7
KNIME AG used under CC BY 4.0
The Workbench

8
The KNIME Workspace
 The workspace is the folder/directory in which workflows (and potentially data
files) are stored for the current session.
 Workspaces are portable (just like KNIME Analytics Platform)

These slides are a derivative of KNIME Course Material of


9
KNIME AG used under CC BY 4.0
The KNIME Analytics Platform Workbench

KNIME Explorer

Node Description

Workflow Coach

Workflow Editor

KNIME Community Hub


Node Repository

Console & Node Monitor

Outline

These slides are a derivative of KNIME Course Material of


10
KNIME AG used under CC BY 4.0
Workflow

A workflow is a pipeline of nodes, each configurable to perform a specific task.


The data flow through nodes from left to right

These slides are a derivative of KNIME Course Material of


11
KNIME AG used under CC BY 4.0
KNIME Explorer

 In LOCAL you can access your own


workflow projects.
 Other mountpoints allow you to connect
to
 EXAMPLE Server
 KNIME Hub
 KNIME Server
 The Explorer toolbar on the top has a
search box and buttons to
 select the workflow displayed in the active editor
 refresh the view
 The KNIME Explorer can contain 4 types
of content:
 Workflows
 Workflow groups
 Data files
 Shared Components

These slides are a derivative of KNIME Course Material of


12
KNIME AG used under CC BY 4.0
Creating a new workflow
Click anywhere on the KNIME Explorer to create a new workflow or workflow group

These slides are a derivative of KNIME Course Material of


13
KNIME AG used under CC BY 4.0
Importing and Exporting Workflows

Right-click anywhere in KNIME


Explorer to import a workflow Right-click on a workflow or
workflow group to export the
selected workflow

These slides are a derivative of KNIME Course Material of


14
KNIME AG used under CC BY 4.0
Node Repository

 The Node Repository lists all


KNIME nodes

 The search box has 2 modes


 Standard Search – exact match of node
name
 Fuzzy Search – finds the most similar node
name

These slides are a derivative of KNIME Course Material of


15
KNIME AG used under CC BY 4.0
Description

 The Description window gives


information about:
 Node Functionality
 Input & Output
 Node Settings
 Ports
 References to literature

These slides are a derivative of KNIME Course Material of


16
KNIME AG used under CC BY 4.0
Workflow Description

 When selecting the workflow, the


Description window gives
information about the workflow’s:
 Title
 Description
 Associated Tags and Links
 Creation Date
 Author

These slides are a derivative of KNIME Course Material of


17
KNIME AG used under CC BY 4.0
Workflow Coach
 Node recommendation engine
 Gives hints about which node to use next in the workflow
 Based on KNIME communities' usage statistics
 Based on your own KNIME workflows

These slides are a derivative of KNIME Course Material of


18
KNIME AG used under CC BY 4.0
Console and Other Views

 Console view prints out error and


warning messages about what is
going on under the hood

 Click on View and select Other… to


add different views
 Node Monitor, Licenses, etc.

These slides are a derivative of KNIME Course Material of


19
KNIME AG used under CC BY 4.0
More on Nodes

20
More on Nodes…
 Nodes are the basic processing units of a workflow
 Each node has a number of input and/or output ports
 Data is transferred over a connection from an out-port to the in-port(s) of other
nodes
 Under each node, a traffic light shows its status

These slides are a derivative of KNIME Course Material of


21
KNIME AG used under CC BY 4.0
Data Port Types
 A pipeline of such nodes makes a workflow
 The result of the node’s operation on the data is provided at the out-port to
successor nodes
 Only port of the same type can be connected

Model
Flow Variable

Image

Data DB Connection DB Data

These slides are a derivative of KNIME Course Material of


22
KNIME AG used under CC BY 4.0
Node Configuration
 Most nodes require configuration
 To access a node configuration window:
 Double-click the node
OR
 Right-click > Configure

These slides are a derivative of KNIME Course Material of


23
KNIME AG used under CC BY 4.0
Node Execution
 Right-click node
 Select Execute in context menu

If execution is successful, status shows green


light

If execution produces warnings, status show


yellow triangle

If execution encounters errors, status shows a


red X

These slides are a derivative of KNIME Course Material of


24
KNIME AG used under CC BY 4.0
Node Views

Interactive View

Data View

These slides are a derivative of KNIME Course Material of


25
KNIME AG used under CC BY 4.0
Frequently Used Nodes

Readers GroupBy
Writers

Filters Partitioning

Learners/Predictors

These slides are a derivative of KNIME Course Material of


26
KNIME AG used under CC BY 4.0
Tidy up workflows
 Workflow can easily become complex and difficult to understand

These slides are a derivative of KNIME Course Material of


27
KNIME AG used under CC BY 4.0
Metanodes and Components

28
Tidy up workflows
 Metanodes and components can help tidying up, encapsulating nodes performing
common operations

These slides are a derivative of KNIME Course Material of


29
KNIME AG used under CC BY 4.0
Components

Steps to build a component or a metanode


 Select related nodes that you want to group
 Right click
 Select Create component… or Create Metanode…
 Give it a name

Components have more sophisticated features:


 Encapsulate flow variables, i.e. the parameters only live inside the
component
 Provide a configuration window: variables and parameters
within the component can be edited by Right Click -> Configure…
 Build a composite view: Visualization inside the component can
be grouped in a dashboard

These slides are a derivative of KNIME Course Material of


30
KNIME AG used under CC BY 4.0
Submenu Component
 Right click on the Component and select Setup… from the submenu Component
to access further customization settings, such as the component name and the
ports

These slides are a derivative of KNIME Course Material of


31
KNIME AG used under CC BY 4.0
Inside a component

Shortcut:
Ctrl + double click on
component to open its
content

These slides are a derivative of KNIME Course Material of


32
KNIME AG used under CC BY 4.0
Components Configuration Window

 Components can be configurable


 From the configuration window
(Right click -> Configure…) the
user can enter some parameters
 The entered parameters change
the behaviour of the nodes inside
the component

These slides are a derivative of KNIME Course Material of


33
KNIME AG used under CC BY 4.0
Components Composite View

The visualization nodes


within the component can be
organized to build an
interactive composite view

You can organize and


reshape the node
views from the Visual
Layout window (from
inside the component,
last icon on the toolbar)

These slides are a derivative of KNIME Course Material of


34
KNIME AG used under CC BY 4.0
Composite views interactivity

Enable publication and


subscription to selection events
to make the composite view
interactive: data selected in one
view are highlighted in the others

These slides are a derivative of KNIME Course Material of


35
KNIME AG used under CC BY 4.0
KNIME Community Hub

36
KNIME Community Hub

A place to share knowledge about Workflows and Nodes https://hub.knime.com

These slides are a derivative of KNIME Course Material of


37
KNIME AG used under CC BY 4.0
KNIME Community Hub

Workflows

Nodes, Shared Components and


Extensions

These slides are a derivative of KNIME Course Material of


38
KNIME AG used under CC BY 4.0
KNIME Community Hub Spaces

 Private Space
 Your personal space. Upload here your
workflows and components (max 1GB) to
have them always available in a central
place

 Public Space
 Shared with the KNIME community.
Everyone can find and download them
from the KNIME Hub

These slides are a derivative of KNIME Course Material of


39
KNIME AG used under CC BY 4.0
Downloading and importing from KNIME Hub

Searching for the Tag


“theguidebook” will show you all
the workflows related to this book

These slides are a derivative of KNIME Course Material of


40
KNIME AG used under CC BY 4.0
Downloading and importing from KNIME Hub
Method 1
Download the workflow, locate it
into your machine and import it
as seen before

These slides are a derivative of KNIME Course Material of


41
KNIME AG used under CC BY 4.0
Downloading and importing from KNIME Hub
Method 2
Drag and drop the icon
directly into the KNIME Explorer
at the desired location

These slides are a derivative of KNIME Course Material of


42
KNIME AG used under CC BY 4.0
KNIME Cheat Sheets

https://www.knime.com/cheat-sheets

These slides are a derivative of KNIME Course Material of


43
KNIME AG used under CC BY 4.0
KNIME Books

e-book downloads from KNIME Press


https://www.knime.com/knimepress

with code: <Promotion-Code>

These slides are a derivative of KNIME Course Material of


KNIME AG used under CC BY 4.0
Free Self-Paced Courses
 https://www.knime.com/knime-self-paced-courses

These slides are a derivative of KNIME Course Material of


45
KNIME AG used under CC BY 4.0
Build your first Hello Workflow

46
Create your first workflow

Right Click on the LOCAL folder in the KNIME


Explorer and select New KNIME Workflow

From the pop up


window, insert the name
of your first workflow

These slides are a derivative of KNIME Course Material of


47
KNIME AG used under CC BY 4.0
Read the dataset

Drag and drop the File


Reader node from the
Node Repository to add it
to the workflow

Open the configuration


window (double click) and
select the file on your
machine containing the adult
dataset

These slides are a derivative of KNIME Course Material of


48
KNIME AG used under CC BY 4.0
Remove columns

Some columns have


unnecessary
information. Remove
them with a Column
Filter node

These slides are a derivative of KNIME Course Material of


49
KNIME AG used under CC BY 4.0
Remove Rows

Add a Row Filter node and


configure it to only keep
entries whose “native-
country” value is not
“United-States”

These slides are a derivative of KNIME Course Material of


50
KNIME AG used under CC BY 4.0
Write to new file

Finally add a CSV Writer node to the


pipeline.
Configure and execute it to write the
transformed dataset to a new file

These slides are a derivative of KNIME Course Material of


51
KNIME AG used under CC BY 4.0
Annotations
 Annotations are coloured editable boxes that you can add to your workflow
 They help you making it more readable and visually pleasant

Click on the upper left


corner icon to
customize text and
appearance of an
annotation

Right click anywhere on your


workflow and add a New Workflow
Annotation from the context menu

These slides are a derivative of KNIME Course Material of


52
KNIME AG used under CC BY 4.0

You might also like