Unit2 Notes
Unit2 Notes
from large amounts of data. The data sources can include databases, data
warehouses, the web, and other information repositories or data that are
streamed into the system dynamically.
Gather your high-quality data and store it in a format that you can easily access.
If you’re just getting started with data mining, you might use something as simple as
Google Sheets. If your business is growing, consider HubSpot’s data sync tool. If
you’re experienced, you might opt for a tool such as Tableau.
Stage 3: Data Preparation
Clean up the data, remove duplicates, and ensure it represents your business
accurately. To avoid errors, you might employ the help of a tool such as Operations
Hub and appoint this task to one person. Allowing multiple people to collaborate on
one dataset at the same time may lead to duplicates and redundancies.
Check out our guides on data quality and data lifecycle management to ensure
you do everything you need to do in this stage.
Stage 4: Modeling
In the modeling stage, you use algorithms, artificial intelligence, and machine
learning to associate, categorize, regress, and cluster your data. If you have a data
analyst on staff, they might use the R and Python programming languages to carry
out these data mining techniques. They might also use data mining software.
If you’re just getting started, you might use the pivot table, filtering, and data
visualization tools in your spreadsheet software.
Stage 5: Evaluation
Next, it’s time to look at the results. Do your findings help you answer the business
question you established in stage one? If not, then it’s time to try stage four again —
it’s totally normal to have to model the data various times before gleaning the right
insights.
Stage 6: Deployment
Last, you compile all of your results in a presentation or dashboard and present it to
key stakeholders. You’ll all convene and figure out what to do based on what you
found in your data.
Data mining has its benefits, but it can sound like a lot to tackle for a beginner in the
subject. One common point of confusion is in regards to the differences between
data mining and data harvesting.
The two primary methods for data analysis are qualitative data analysis
techniques and quantitative data analysis techniques. These data analysis
techniques can be used independently or in combination with the other to
help business leaders and decision-makers acquire business insights from
different data types.
Data analysts work with both quantitative and qualitative data, so it’s
important to be familiar with a variety of analysis methods. Let’s take a look
at some of the most useful techniques now.