0% found this document useful (0 votes)
78 views3 pages

CRISP

The document provides information about various topics related to data analysis including CRISP-ML (Q), SQL, and Python. It discusses the 4 stages of analytics, the 6 phases of CRISP-ML (Q), data types in SQL, operators and functions in Python, and more. The document is comprised of session summaries on these topics from multiple days of training.

Uploaded by

Akmal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
78 views3 pages

CRISP

The document provides information about various topics related to data analysis including CRISP-ML (Q), SQL, and Python. It discusses the 4 stages of analytics, the 6 phases of CRISP-ML (Q), data types in SQL, operators and functions in Python, and more. The document is comprised of session summaries on these topics from multiple days of training.

Uploaded by

Akmal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

CRISP-ML (Q) Session-1

1) 4 Stages of analytic:
i. Descriptive= describing trends/what happen in the past
ii. Diagnostic= figuring the reason/cause of the trends or what happen in the past
iii. Predictive= the use of the trend, past or current data to make prediction/forecasting
for future event
iv. Prescriptive= decision making/recommendation based on all the first 3 analysis
process.
2) CRISP-ML (Q) stands for “Cross Industry Standard Process for Machine Learning with Quality
Assurance”. The 6 phrases are:

i. Business and data understanding


ii. data preparation
iii. model building and tuning
iv. evaluation
v. model deployment
vi. monitoring and maintenance

SQL for Data Analytic & Data Science Day 1

1) The role of database is to store the data/information to be accessed by server as per


the “request” of the User and provide the retrieve information to the User in the form
of “response”
2) Create table employee (emp_name varchar (28), emp_id int)

Tame the Phyton Day 1

1) A=1 and a=2


2) Rules:
a. Variable start with letter and only alphanumeric
b. Avoid existing keyword used by phyton
c. No “SPACE” (empty character) or symbol but can be substitute underscore
“_”

CRISP-ML (Q) Session-2

1) Business understanding referring to understanding the business problems to better


understand the problem, the objective as well as its constraint in order to solve it. The two
key point in business understanding is the business problem and business constraint
2) Data understanding refer to the process of populating data, analysing the data as well
understanding it.

SQL for Data Analytic & Data Science Day 2

1) There are 3 main types of data type in SQL


2) Using DDL command, example:
create table employee(
first_name varchar(20),
mid_name varchar (20),
last_name varchar (20),
age int,
salary int,
location varchar (20)
);

and run the command.

Tame the Phyton Day 2

1) Keyword or reserved word in variables refer to the pre-defined variables.


2) There are 7 operators in Phyton.

CRISP-ML (Q) Session-3

1) Continuous data is consisting of numerical value which can take any value that can be
measured and infinite. While discrete data is a solid which can be counted and exact value.
2) Data types in the data understanding is (1) continuous, (2) discrete, (3) Qualitative vs
Quantitative, (4) Structured vs semi structured vs unstructured, (5) big data and non-big
data, (6) Cross sectional vs time series vs longitude/panel data, (7) balanced vs imbalanced
data, (8) offline data vs live streaming data.

SQL for Data Analytic & Data Science Day 3

1) Composite primary key is a combination of more than one attributes to make a primary key
which then known as composite primary key.
2) Primary key identifies each record as distinct while unique key only allows distinct value to
be entered. While primary key can only have one in a table but unique key can have many
and accept null.

Tame the Phyton Day 3

1) Example of operation in phyton is:

i. Arithmetic operators: print (2*3)


ii. Assignment operator: a = 343
iii. Comparison operator: 2=1
iv. Identity operators: 1 is 1
v. Logical operators: x>12
vi. Bitwise operator: a & b
2) List is an ordered sequence of item which represented with “{“ bracket. Indexing in
phyton list refer to the retrieving the info by specifying its position. Slicing in phyton
referring to extracting portion of info within the info specifically.

CRISP-ML (Q) Session-4

1) Time series data compost of single variable with pattern of acceding and descending across a
period of time. Cross sectional data is use multiple variables which usually to represent
relation between both.
2) Live streaming differ from offline processing as it happen in real time with shorter period by
using data which currently through system. While offline using pre existing data to draw
conclusion.

SQL for Data Analytic & Data Science Day 4

1) Constrain in SQL referring to command used in SQL to set rules. Constraint can be checked.
Check constraints used to specify specific range of data or limit type of data as in this
example to tell command to only permit aga more or equal than 18 to be stored:
a. CREATE TABLE Persons (
    ID int NOT NULL,
    LastName varchar(255) NOT NULL,
    FirstName varchar(255),
    Age int,
    CHECK (Age>=18)
);
2) In a simple word if the object does not exist already it will be created and if the object does
already exist it will be altered/updated

Tame the Phyton Day 4

1) In-built fucntuon referring to several functions that are readily available for use.
2) Set in phyton referring an unordered collection of data types. These are mutable,
iterable, and do not consist of any duplicate elements. Set is mutable while tuple is
not. Set is unordered while tuples ordered.

You might also like