Tips Tricks 17
Tips Tricks 17
Tips Tricks 17
VERSION 11 | 2017
Table of Contents
Alteryx Resources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Sample Workflows, Alteryx Community, Gallery, Alteryx Blogs & Help
Documentation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
The not-so-new kits on the block: Alteryx Starter Kits . . . . . . . . . . . . . . . . . . 10
Product Training and it’s many flavors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Alteryx Support team…at the ready !!. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Licensing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Workflow Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Set yourself up for success . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Personalize your Alteryx canvas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Turn back time with Autosave Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Collect and display data profile Information NEW to 11. . . . . . . . . . . . . . . . 17
Bringing your data in. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
No need to Reconnect. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Refresh with F5 and other shortcuts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
Just Browsing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .23
Mad about organization? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
The Go-To Time Savers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Quickly Forget those missing fields. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .29
Close All But This NEW to 11. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Open Containing Folder NEW to 11. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Global Variables, the only Constant is Change . . . . . . . . . . . . . . . . . . . . . . . 30
Pinned Possibilities. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Workflow Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Resource Optimization. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Lean for Speed . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
Data Preparation & Blending . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Sometimes You Feel Like a Join, Sometimes You Don’t:
Joining and Blending Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
Summarize . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
When You Got It, Select It: Tips and Tricks for the Select Tool. . . . . . . . . . . . 47
Plop, Plop, Fizz, Fizz, Oh, What a Formula It Is: Tips and Tricks
for the Formula Tool. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Tile Tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
Filters, Filters, Everywhere: ACE MarqueeCrew’s Trick to
Optimize Filtering on Large Datasets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
Get More from DateTime: New to 11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Spatial Tips and Tricks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
Spatial Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
Map Input. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
Browsing Spatial Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Tom Tom Spatial Data Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
| 2
Database Processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
Database Connections Best Practices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Connecting to Databases. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Workflow Efficiency. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
Writing Out . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
IN-DB. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
Connectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
Community Resources for Connectors. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
Create Your Own Connector. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
Analytic Apps and Macros . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
Analytic Apps Best Practices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
Apps and Macros Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Debug. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Keep Your Values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
Manageable Macros . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
Macro Input Shortcut. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Macro Building. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
Server and Gallery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
Workflow and Macro Sharing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
Run As . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
Workflow Execution Permissions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
Schedule from the Gallery: NEW to 11! . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
Gallery Admin Help Page: NEW to 11! . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
Alteryx Server Usage Report: Monitoring and Reporting on
Alteryx Across the Organization: NEW to 11! . . . . . . . . . . . . . . . . . . . . . . . . . . 90
Tool Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
3 |
Getting Started –
Alteryx Resources
One-Tool Examples:
Within the Sample workflows menu, you’ll find a number of One-Tool Examples. These
are simple to follow, fully annotated, use case examples for specific tools within Alteryx.
There are a number of tools available across many different tool categories:
Originally debuted in version 11.0, find one tool samples when you search a tool by name
in the global search bar. New to Version 11, also find many updated and new samples.
Gallery:
Check out the Analytics Gallery at http://gallery.alteryx.com. Here you can browse
the Alteryx Public Gallery to find apps that solve business problems across a variety
of industries. Many of these apps can also be downloaded to your machine and
opened in your Designer! Use the gallery to learn how apps are constructed and
how data flows through properly configured tools. You can reverse engineer apps
you are interested in and incorporate those features into your own workflows. The
Macro District is also available and always growing with useful macros that simplify
reusable processes. You can download Macros from the Gallery and add them to
future workflows.
Note: Many of the tools make use of functionality recently added to the Alteryx
Predictive Plug-In. As a result, we highly encourage community members to upgrade to
the most recent Predictive Plug-in, available from the Alteryx Download site. In addition,
some tools require additional R packages. The tools attempt to install the needed R
packages, but are not always successful due to firewall and proxy server issues.
Alteryx Blogs:
Look to blogs written by Alteryx employees to learn more about new macros and
Alteryx concepts at Engine Works Blog (community.alteryx.com) and Chaos Reigns Within
(http://www.chaosreignswithin.com/). The Engine Works Blog features information
about new macros published in the Macro District, details about new releases, Inspire
related posts, and more. Chaos Reigns Within is the personal blog of a Software
Developer at Alteryx and he publishes a Blog Macro Pack each quarter. Some of the
macros are prototypes and most have not gone through the extensive testing process it
takes to make it into the product, but you just might find a solution to the problem you
have been struggling with and it is as easy as downloading a macro!
There are four ways to access the Help file from the Designer.
2. Open any tool’s configuration window and click on the Help icon:
3. Click on any tool on your canvas and click the help icon in the top right hand
corner of the Designer:
Also within the help documentation is useful information about building macros,
constructing formula expressions using the built in functions, data type explanations
and more!
Weekly Challenge
Visit https://community.alteryx.com/t5/Weekly-Challenge/bd-p/weeklychallenge
to participate in our weekly challenge. The challenges are a fun and interactive way
to engage with the Alteryx Community and sharpen your skills. Previous weekly
session exercises make great training materials to help ramp up new teammates.
Product Training
Alteryx offers an abundance of training online at http://www.alteryx.com/
product-training. The Getting Started section is great for beginners and includes
short videos, accompanying help files, and exercises to solidify the concepts. The
On Demand videos cover a variety of topics and can be used to learn about a
subject in general or you can watch a video pertaining to the tool or process you
are specifically interested in exploring. The Self-Paced Training section contains
various exercises at the Beginner, Intermediate, and Advanced skill levels for users
to download and complete at their own speed. Solution files are also available for
download here so you can compare your answer to a best practice solution.
Community (24/7 Self Service): Want answers now? The Alteryx Community has a
host of articles, documents, and answers to commonly asked questions.
Email (8-8 EST): Need to open a support ticket? Email [email protected] with
complex questions that may require screenshots, sample workflows, or data files.
Chat (8-8 EST): For quick and simple questions that can be solved in a brief text-
only exchange.
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/Licensing-Series-
Part-1-Welcome-to-Alteryx/ta-p/37534
Installing a license:
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/Licensing-Series-
Part-2-Installing-Licenses/ta-p/37626
Managing a license:
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/Licensing-Series-
Part-3-Managing-Your-License/ta-p/37715
If your question isn’t answered in the community articles, you can get support for
licensing by emailing [email protected]
Workflow Layout
Build out your workflows vertically or horizontally. To change a single workflow, click
anywhere on your canvas, and in your Workflow – Configuration window, under
Canvas options, select your Layout Direction as horizontal or vertical.
Connection Progress:
The Canvas tab under Workflow –
Configuration provides three options for
the Connection Progress. If you prefer
an uncluttered look, you can “Hide” the
connection progress. If you’d like to see
it as your workflow is processing, you
can select “Show Only When Running”.
If you’d like to be able to take a closer
look at the number of records process, e.g. if you are troubleshooting a workflow,
you have the option to keep them visible, “Show”, even after the workflow has
finished processing.
Find the Autosave settings at Options > User Settings > Edit User
Settings > Advanced.
Quickly look at the Data Quality section in the Profile tab of the Browse tool when
selecting that field you want to check. Red is for Not OK (maybe white spaces), Yellow is
for Null because there are no values, Gray for Empty and Green for Good to go.
A fast way to open workflows when Alteryx is already open, drag the workflow file
(YXMD) from Windows Explorer and drop it on the canvas; the workflow will open in
the same window but on a new tab. This saves lots of time because you don’t have to
wait for a new instance of Alteryx to open each time.
Explorer Box
This handy dandy tool
allows you to point to
a directory to show
all the files contained.
From there (as explained
above), drag one or more
files from the list and
drop them to the canvas.
Copy and Paste from a Browse tool back to the canvas to create a Text Input containing
the data. Great way to fast “test” inputs or to create fast lookups / data cleans.
Limiting the number of records coming in while building your workflow will speed up
testing. This will return the top records.
Record Limit all the Input tools in the workflow at once: Workflow Configuration >
Runtime tab > Record Limit for All Inputs. When Record Limit is set on the file and to
All Inputs, the smaller number takes precedent.
This can be useful whenever running into delimiter issues such as a CSV file with extra
commas. Bring your data with \0 and it will ignore delimiters, placing all your data in one
field. Increase the Field Length and use the Text to Columns to split your data up.
No need to Reconnect
Connecting and Reconnecting tools on
our canvas is how we build our
masterpieces a.k.a. workflows.
When deleting a tool that is connected, Right Click on the tool and select Delete and
connect around. This won’t be enabled for tools that have multiple inputs and/or outputs.
Your incoming data source has been updated or is just different- Press F5 to refresh
all the tools with the new metadata.
By design, tool configurations are not being refreshed If the Disable Auto
Configure option has been selected in the User Settings—press F5 to manually
refresh tool configurations.
Editing your complex workflow, your big data source is located on the network drive or
Allocate tools are present, check this box (Options > User Settings > Edit User Settings)
and you may find resources are better allocated and optimized.
Cell Viewer
Available on the Browse tool and Results
window, Click on the Cell Viewer to view
metadata for each column (when clicking on
the column name) or the content of a selected
cell (when clicking on the data).
Easily compare two or more records in the Browse tool with two different
methods below:
Highlight the desired records with Ctrl + Click/Drag, then go to the ‘Open results
in a new window icon’ on the upper-right corner and select: New Window
(Selected Records)
Highlight the desired records with Ctrl + Click/Drag, go the up and down arrows
on the top bar of the results window and use them to toggle between all of your
selected records
Rapid Results
Accidentally closed the results window? No need to go back into the View menu
to enable it. Simply click on the anchor for any tool on your canvas and the Results
window will reappear…magic!
The Results window has your data’s metadata, no need to run the workflow to see it.
Designing on your canvas and need to quickly see the metadata, wanting to save precious
seconds? Click on the connection to see the metadata moving through that connection.
Anchor Limits
Adjust the memory limit per anchor in the User Settings (Options > User
Settings > Edit User Settings > Advanced tab > Tool Result Settings). It can also
be adjusted in the System Settings > Engine
Put long/short comment along the top and tall/skinny comment along the left, stretch
these to size the tool container to your desired fixed size.
If you’re a bit OCD about perfectly aligned tools and connections in your workflow, then
you are going to love these shortcuts! Use Ctrl+Shift+- or + to align vertical or horizontal.
Right Click & Paste to control where copied tools get placed on the canvas, rather than Ctrl-V.
Use the tools in the ‘Documentation’ toolset to keep things clear and clean for others
to understand (or your future self when you get asked to update something you haven’t
touched in 6 months)
Tools with an Expression panel support block comments (/*Comment*/) and single line
comments (//Comment) allowing more documentation capabilities over and above the
Comment tool and the tool annotations.
Nimble Navigation
When the time comes when your little workflow is not so little anymore, you’ll need
these navigation tips & tricks handy.
If scrollbars are not your thing, press on the mouse wheel and move up/down or
left/right, or the combination of space bar + Right Click.
The Overview view (in the View menu bar) allows you to navigate through your
workflow while it is running.
• Double click the scroll wheel on your mouse zooms to whole workflow.
(Ctrl + 0 for the keyboard shortcut lovers)
Pinned Possibilities
Is there one or more tool categories you use on a regular basis and you find yourself
scrolling through the tool palette? Good news, You can pin those categories to the
front of the tool palette. Right click on any tool category and select
Pin [Category Name].
The good news is that most of the resource utilization can be controlled. You can
limit the amount of memory that is used on a system, user, or workflow level.
The Engine Sort/Join memory setting defines the minimum amount of memory the
engine will consume while performing operations such as sorts and joins. Other tools
will use memory outside that sort/join block, some of which (e.g. drive times with a
long maximum time) can use a lot.
The global Default Dedicated Sort/Join Memory Usage at System level can be
found at Alteryx → Options → Advanced Options → System Settings → Engine →
Default sort/join memory usage (MB)
You can also edit sort/join memory under the user settings if the option “Allow users
to overwrite these settings” is checked with the engine settings.
Other considerations:
5. Run Alteryx at a lower priority: This will ensure that the Alteryx Engine runs at
a lower priority than all the other applications running on the same machine. By
doing so, even the Alteryx GUI will remain responsive when you are running a
large Workflow in the background. This is an especially good idea for a shared
server and recommend for a server hosting a Gallery. Alteryx → Options →
Advanced Options → System Settings → Engine
6. Shared Servers: For a shared server, the system owner/IT person should set the
memory to no more than (total memory-2GB)/(Number of Users). This way if
all the users are running workflows at the same time the system won’t go into
virtual memory, which significantly slows things down.
7. Web Servers: When running Alteryx on a web server, you want to set the
memory usage to the smallest possible without impacting the performance
too much. We recommend trying a system memory setting of 64MB and then
increasing the memory on a per workflow basis as needed. It is important to note
that the user setting for memory usually has no impact since the web service
typically runs as a separate system user. Make sure to use the system settings.
8. Background Processing: Any time you are planning to run a workflow in the
background while continuing to do other work, it is a good idea to run it with
less memory.
9. It is also a good idea to have the temporary directory point to a separate physical
hard drive from your boot drive. If your temp drive points to C:\temp and you run
a Workflow that consumes 100’s of GB of Temp space (it happens), your system
may become unstable.
If you are reading large DB inputs or bulk loading excel/csv files, it is in most cases
worth to write it to a .yxdb and continue to work with that .yxdb until the flow is
done. This is also the case when using the download tool. Save the data locally and
work on from there.
Below is an example of data types before and after the AutoField tool:
Another benefit of using the AutoField tool is that it will reduce the size of your output file.
You have the option to overwrite the existing fields or to create new fields. If you
decide to copy the fields, you will have the option to change the field names by
adding a prefix or suffix.
You also have the option to change the output type. This is convenient if you are
converting dates to strings, or numbers to strings to format them.
Selecting [_CurrentField_] under the variables will modify all of the selected fields.
You also have the option to use specific fields.
Join is AMAZING if Left and Right inputs have the same record
count and order. Suppose that you’ve used a select tool and do
some functions to create new columns of data that you want to join
back to the original data. You can use the join and use the “Join by Record Position”
configuration (radio button). You can’t get any faster than that. Your data is NOT sorted.
If you are joining a Large set of data to a small set of data (Large is large and small is <
32,000 rows) the Find Replace offers you the ability to put the small data into memory
and avoid the expensive sort. Imagine with 138 million records using a TILE tool and
then looking up eight (8) length descriptors from a text input tool. The JOIN version runs
in 3:40 minutes and the FIND REPLACE version runs in 3:09 minutes.
FIND REPLACE also allows for case insensitive and partial matching. If you’re ever
applying descriptions or labels to values, this is the recommended approach.
If you are appending fields using the Join tool, rename the fields with a prefix
(or suffix) before joining to another data set that contains the same field names.
Then you can union all 3 outputs to complete a full outer join without any
issues with misaligned fields.
Consider a use case where you have stores and customers each with addresses.
Without adding prefixes the join would produce Right_ prefixes on the data, making
it hard to tell which address belongs to the customer and which belongs to the store
After adding prefixes, the join fields are neat and clearly labeled with the source they
came from.
A common issue when viewing joined data is that both streams can share certain fields
(in particular, this can be an issue with the join field). Use Options > Deselect Duplicate
Fields in order to remove duplicates in one quick and easy step.
Did you know that you can use Summarize to create character-delimited fields? Use the
Text-to-columns to disassemble the string. Then use Summarize with a concatenate to
put it back together again.
To reorder multiple fields at once highlight them, then right-click and drag
Changed your mind? To revert to incoming field order choose Options > Sort
After changing your data set, choose options > forget all missing fields to remove the
missing fields from your select tool. This will make configuring the fields much easier and
give you a cleaner look.
When you have a stable workflow that you want to use in a production environment,
put a Select tool directly after the input and uncheck the row labeled *Unknown at the
bottom of the list of fields. That way if your input data changes, then the workflow won’t
be able to add any new fields that your destination file or database is not expecting.
When writing range expressions, a common method is to include a top and bottom
for each category.
Once the first test is passed, you need not ask the lower range question again.
MarqueeCrew ran a test on the logics above and found timings for “Before” on
135,708,968 records are: 3:24 minutes versus 3:06 minutes. In addition to spending
less time coding the formula, there are performance benefits to be realized by the
second formula. By knowing where a skew is in a particular category, the formula
could be coded to ask that question first.
If you have a numeric field that has nulls that you want to quickly change to zero, try
using the ToNumber function: ToNumber([Field1]). Also suggested by ACE danielbrun
Data Cleansing
Imputation Tool
Ever wanted to create a record ID with a group by? Consider using the Tile tool to create
a Record ID for each group in your data.
Simply use the Tile Method “Unique Value” and select the field(s) you would like to have
the record IDs “grouped” on. This will reset the record ID (renamed from the “Tile_
SequenceNum” field) counter for each unique value in your specified Unique Fields.
Doing this as one statement in one filter tool, you would be testing more conditions
(400+MM conditions in the US) than if you first tested for Postal Code then filtered for
Residency and/or Presence of Children separately. The Postal Code would net you a few
thousand records. The subsequent filter/tests would get you down to the few records faster.
When working with large sets of data this is more apparent and the benefits can add
up. There are 169 households out of 139,709,868 records. With one filter the job
runs in 3:09 minutes. With three filters the job runs in only 2:08 minutes.
Users may now type in custom formats and see an example of the output, providing
greater flexibility when working with DateTime data
1. The Spatial Match tool will put everything in the Universe (U) tab into a
temporary YXDB with a spatial index.
2. Then it has to look at every Target (T) object, but it can quickly ignore all
Universe-side objects whose bounding rectangles don’t intersect the bounding
rectangle of the Target object. [A Bounding Rectangle is the rectangle that
bounds the spatial object.]
3. As an example, for the common case of a smaller number of larger objects (e.g.
Store trade areas) being matched against a large, wide-spread set of smaller
objects (e.g. nationwide customer points); it’s better to put the large set of wide-
spread small objects on the universe side.
4. There are many multiple methods for spatially matching two sets of spatial
objects. A venn diagram of each of those methods can be found in the Spatial
Match Tool’s Help Menu.
5. The records that come from the Match (M) tab will be Target (T) records whose
object had a match from the Universe (U) stream. The Universe object and
selected fields are joined to the Target Record. The records from the Unmatched
(U) tab will be Target records whose object had no match from the Universe
stream.
6. The IMPORTANT Message is: The Spatial Match can ignore most Universe
records that won’t match the Target record without even looking at them.
When using spatial process with large datasets, consider using file input
within the tool for increased performance.
For large point in polygon spatial matches consider loading the point layer
to a Calgary layer, then using a Calgary join tool. Configure the join to
map spatial field to spatial field.
Map Input
Set default location
To make development faster, use a default location for the map input tool
so you don’t need to scope each time
Or, if you prefer, you can access the file for all geographies by locating the Data
folder in the install directory.
Documentation for the layers can be found in the documentation folder under ‘spatial’
Database Processing | 60
Database Processing
Database Connections Best Practices
Alteryx can access data that resides in a database and either bring that data into memory
in Alteryx for processing or conduct the processing within the database itself via the in-
database tool set. Here are a few tips to follow when connecting to databases.
https://help.alteryx.com/11.0/index.htm#DataSources/SupportedDataSources.
htm%3FTocPath%3D_____4
Connecting to Databases
Saved Data Connections
Make use of saved data connections (Options > Advanced Options > Manage Data
Connections) if you need to share and develop workflows among a team. Managing these
connections has been made easier with a new option to manage them through the gallery.
Database Processing | 62
MS SQL Server
The familiar ODBC and OleDB options can still be found under Other Databases.
63 | Database Processing
Oracle
Connection Options for Oracle – New for 11
For version 11, Alteryx has introduced an option to connect to Oracle without
having to set up a tnsnames.ora file or installing a driver first.
To use this connection, all you need from your Oracle DBA is the Oracle hostname,
Service Name, Username, and Password.
Please see the community for more detail on this option: https://community.
alteryx.com/t5/Alteryx-Knowledge-Base/Connecting-to-Oracle-Alteryx-11-0/
ta-p/46654
Other
If you want to give users the option to read in certain data sets but you don’t want to
give them access to the dataset or have access to the credentials, set up an input tool
that reads in the data inside an encrypted macro. To encrypt the macro, go to Options >
Advanced Options > Encrypt Workflow
Database Processing | 64
Use Native client
It is best to use the Native Client Driver provided by the database company. Make
sure you are selecting the version that matches your database. Native Driver names
would look like OraClient12c_home2 for Oracle and SQL Server Native Client 11 for
MS SQL Server.
65 | Database Processing
Workflow Efficiency
Limit data being brought into Alteryx
Use the Visual Query Builder and/or SQL Editor to limit the number of columns and
rows being brought in.
Use the dynamic input tool to dynamically update SQL queries, speeding up your
processing time as it filters the data and only returns the user specified criteria.
Database Processing | 66
Do not show % complete
When checked, Alteryx will not try to report the status of reading in the file, thus
speeding up the reading time.
Messages in the output window will indicate whether cached data or live data is used.
67 | Database Processing
Teradata: specify AMPS
Teradata allows the user to specify the Number of Sessions (AMPS) to
speed up processing.
Database Processing | 68
Test query
Once you have entered a query in the SQL Editor window, use the Test Query
button to verify its syntax.
Writing Out
Commit as you go
The transaction size option in the output tool determines how many records are
committed to the table at once. Be default, the option is set to 10000 records.
Increase or decrease the size to increase efficiency with your database.
69 | Database Processing
Faster Write to Hive
Hive ODBC can be slow when writing to
tables. If you are looking to create a new
table or overwrite an existing table, use
the IN-DB tools with the write option set
to HDFS(Avro) to improve speeds (NOTE:
IN-DB does not support updates). You can
use the Data Stream In tool instead of the
regular output tool:
IN-DB
The IN-DB tools help users build complex queries to be executed on the database
without needing to have a deep knowledge of the database query language. The
query components are put together using tools with a look and feel similar to the
regular tools used in Alteryx. Alteryx then compiles a query based on the tools on
the canvas and sends it to the database to be executed there instead of on your
local machine, thus taking advantage of the database server’s processing power.
Find more information about the IN-DB tools in our product documentation: https://
help.alteryx.com/11.0/index.htm#In-DatabaseOverview.htm%3FTocPath%3DIn-
Database%7C_____0
Database Processing | 70
Supported Databases
For IN-DB we currently support the following databases:
•• Hive •• Oracle
•• Impala •• Spark
71 | Database Processing
Do simple operations without knowledge of query language
While some SQL knowledge is required for more advanced operations (e.g.
when creating conditional statements), the IN-DB tools don’t need full SQL
statements to function.
For more details, check this Alteryx Community post at: http://community.alteryx.
com/t5/Alteryx-Knowledge-Base/Alteryx-In-DB-Connection-File/ta-p/17574
Database Processing | 72
Connectors
73 | Connectors
Connectors
Community Resources for Connectors
Did you know that in addition to Alteryx Help Documentation, there is a wealth of
information for connectors at community.alteryx.com? Many Connectors have full
knowledge base articles with step by step instructions.
Publish to Power BI
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/The-
How-to-Guide-to-Power-BI/ta-p/18952
Adobe Analytics
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/The-
How-to-Guide-to-Adobe-Analytics/ta-p/21304
Download Tool
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/Tool-
Mastery-Download/ta-p/29583
Database Processing | 74
Google Analytics
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/The-
How-to-Guide-to-Google-Analytics/ta-p/15137
Google Sheets
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/The-
How-to-Guide-to-Google-Sheets/ta-p/20354
Salesforce
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/The-
How-to-Guide-to-the-Salesforce-Connector/ta-p/18100
Tableau
https://community.alteryx.com/t5/Alteryx-Knowledge-Base/The-
How-to-Guide-to-Publishing-to-Tableau/tac-p/57951#M1704
75 | Connectors
Create Your Own Connector
Need a connector not included in Alteryx? Create your own! We’ve got a wealth of
resources posted online to help you build one.
Connectors | 76
Analytic Apps
and Macros
Interface Designer
2. Dragging any Interface Tool to the canvas automatically changes your Workflow
Type to “Analytic App”.
Connectors | 78
4. You can move Analytic App questions around the UI using the Interface Designer.
Connections
5. To make the app look less cluttered, it is recommended to use wireless
connections for the action tools. To make your connections wireless, click on the
connection and check the Wireless box, or right click on a tool and select Make
Incoming/Outcoming connection wireless.
Manageable Macros
Custom macros are a great way to make Alteryx more efficient for your use cases,
allowing for increased flexibility, customization, and shareability. To save yourself
some time, try creating a custom Macro tool category within the Designer. To do so,
follow the simple instructions below:
5. Enter a name for your custom category, and enter the path where you have your
macro(s) stored:
6. Scroll to the end of your Tool Categories and you’ll see your custom folder with
any macros stored in that directory as Tool Icons you can now quickly drag and
drop into your workflow.
To create or change the Connector Abbreviation, go to the Configuration tab for the
Macro Input and Output tools
When working with interface tools (drop down, list box, textbox etc.), you can use
the %Question.UniqueToolName% syntax to call the user input in any tool. During
debugging this allows you to run an app like a normal workflow by giving it test data
under the workflow tab of the designer configuration window, as well as cleaner look
due to no action tools.
Workflows and Macros are uploaded the same way Alteryx Apps are. Simply open the
Workflow in the Alteryx Desktop Designer and click File -> Save As. Select the Gallery
you wish to save them to. If you need to make changes to what will be included with
your Workflow or Macro, like including sample data, click the Manage workflow assets
link to make changes. When you are ready, click the Save button. Once the Workflow
or Macro has been published, you can find it in your Private Studio.
To set up a Tag, the Curator of your Gallery must log in and go into the Admin
section of the Gallery. From there, click on the Workflows option on the left, and at
the bottom you’ll see the Tags area. Admins can create custom tags for users to add,
and can also include the option for certain tags to be only usable by Curators, by
selecting the “Admin Only” option.
Once created, any Artisan who uploads a Workflow, App, or Macro to the Gallery can
go into their Private Studio, select a workflow, and click on the Workflow Properties
link. Here you’ll be able to select from the list of tags available within your Gallery.
To add a tag to a District, the Curator for your Gallery must first create a District,
and then in the setup for that District, can select tags to apply. Once the tags are
applied to the District, any Workflow that is added to your company’s public gallery
with the tags associated with that District will be available within the District itself.
Note that only one user account can be specified here and it applies to all jobs run
on the server either through the Scheduler or the Gallery. If you have multi-node
deployment with multiple workers, each Worker machine will need this setting
entered in order to ensure all workflows can run regardless of which Worker they
are sent to.
To enable this functionality, the Gallery Curator must log in and go into the Admin
section of the Gallery. From there, click on the System option on the left. You’ll see
the option for “Default behavior for workflow credentials”.
2. Require users to enter their own credentails when running any workflow
User is not required to specify credentials: This option will set the workflow to run
as the default credentials for the studio or server.
After a Gallery Curator (Admin) has enabled the ability to schedule processes from
the Gallery, users have the option to schedule from gallery.
Once enabled, users will see an additional option when they select a workflow:
To view the results of the Schedules, click over to the Workflow Results page.
Users will be able to see only the results from their schedules and ad-hoc runs, with
a designation as to which runs were Manual and which were Scheduled. Gallery
Curators (Admin) can see run results for all users.
Log into your Gallery, find your settings, and click on Admin (this will only appear if you
are your Gallery’s admin). Then, in the bottom left-hand corner, find the Help link.
Still craving more information? Perhaps some training options? Quick Community access?
Now, with Alteryx Analytics 11, we’ve expanded on these capabilities by enabling
administrators to understand not only how the Server is being utilized, but how
users of Alteryx Designer are leveraging their own desktop installations.
Administrators can now access usage insights such as:
·· What are the most popular tools and categories being used
The new Alteryx Designer Usage data is available for customers with Alteryx Server
with the Gallery component enabled. By default, this feature is disabled, and the
Alteryx Designers will need to be configured to begin sending data to your Alteryx
Server. Please read the documentation available here for instructions on how to
configure Alteryx Designers to send data to your Alteryx Server.
91 | Tool Overview
Tool Overview
In/Out
Input the current date and time
at module runtime, in a format
Review your data at any point
of the user’s choosing. (Useful
in the work flow.
for adding a date-time header
Date Time to a report.)
Browse
Now
Automatically set the field Split the data stream into two
type for each string field to the or three random samples with a
smallest possible size and type specified percentage of records
that will accommodate the data in the estimation and validation
in each column. Create samples.
Auto Field
Samples
93 | Tool Overview
Search for data in one field
An updated version of the
from one data stream and
Household File Matching Tool
replace it with a specified field
with functionality and new
from a different stream. Similar
Consumer View matching criteria
Find Replace to an Excel VLOOKUP.
Matching
Split the text from one field Read in XML snippets and parse
into separate rows or columns. them into individual fields.
Text to
XML Parse
Columns
Transform
Tool Overview | 94
Summarize data by grouping,
summing, counting,
Pivot the orientation of the
spatial processing, string
data stream so that horizontal
concatenation, and much more.
fields are on the vertical axis.
The output contains only the
Summarize results of the calculation(s). Transpose
95 | Tool Overview
Limit the In-DB data stream
Create an In-DB output
to a number or percentage of
connection on a macro.
records.
Macro Output
Sample In-DB
In-DB
Reporting
Tool Overview | 96
Create a map for output via the Add and customize text for
Render tool. output via the Render tool.
Table
Documentation
Add annotation or images to
Add a web page or Windows
the module canvas to capture
Explorer window to your
notes or explain processes for
canvas.
later reference.
Comment Explorer Box
97 | Tool Overview
Extract information about a
Round off sharp angles of a
spatial object, such as area,
polygon or polyline by adding
centroid, bounding rectangle,
nodes along its lines.
etc.
Smooth Spatial Info
Combine two data streams
based on the relationship Create a new spatial object
between two sets of spatial from the combination or
objects to determine if the intersection of two spatial
objects intersect, contain or Spatial objects.
Spatial Match touch one another. Process
Tool Overview | 98
Display an interactive map to
Allow users to choose a number
allow the user to draw or select
from a predefined range.
location objects
Numeric Up/
Map
Down
99 | Tool Overview
Produce enhanced scatterplots, Assesses how well an arbitrary
with options to include monotonic function could
boxplots in the margins, a linear describe the relationship
regression line, a smooth curve between two variables without
via non-parametric regression, making any other assumptions
a smoothed conditional spread, Spearman about the particular nature of
Scatterplot outlier identification, and a the relationship between the
Correlation
regression line. variables.
Shows the distribution of a
single numeric variable, and
conveys the density of the
distribution based on a kernel
smoother that indicates the
Violin Plot density of values (via width) of
the numeric field.
Predictive
Create generalized boosted
regression models based on the Estimate regression models for
gradient boosting methods of count data (e.g., the number
Friedman.* It works by serially of store visits a customer
adding simple decision tree makes in a year), using Poisson
models to a model ensemble so regression, quasi-Poisson
Boosted as to minimize an appropriate Count regression, or negative binomial
Model loss function. Regression regression.
Creates an interactive
visualization of a network along Create feedforward perceptron
with summary statistics and neural network model with a
distribution of node centrality single hidden layer.
Network Neural
measures.
Analysis Network
Predict a variable of interest
(target variable) based on one
or more predictor variables
Calculate a predicted value for
using the two-step approach
the target variable in the model.
of Friedman’s multivariate
Score Spline Model adaptive regression (MARS)
algorithm.
Support Vector Machines
(SVM), or Support Vector
Determine the “best” predictor Networks (SVN), are popular
variables to include in a model supervised learning algorithms
out of a larger set of potential used for classification
predictor variables for linear, problems, and are meant to
logistic, and other traditional Support accommodate instances where
Stepwise regression models. Vector the data (i.e., observations)
Machine are considered linearly non-
separable.
Compare the difference in
the mean values for a numeric
response field between a
control group and one or more
Test of Means treatment groups
AB Testing
Match one to ten control units
(e.g., stores, customers, etc.)
to each member of a set of
Compare the percentage
previously selected test units
change in a performance
on the criteria such as seasonal
measure to the same measure
patterns and growth trends for
one year prior.
AB Analysis AB Controls a key performance indicator,
along with other user provided
criteria.
Read CSV, DBF and YXDB files Write CSV, DBF and YXDB files
from Amazon S3. to Amazon S3.
Amazon S3 Amazon S3
Download Upload
Social Media
Bring in data from Datasift-
twitter, Facebook, Tumblr, Search Foursquare Venues by
YouTube, Wikipedia, and much a location with an option of
more- http://datasift.com/ filtering by a search term.
DataSift platform/datasources/ Foursquare