0% found this document useful (0 votes)

14 views32 pages

305_BA_MachineLearning_And_Cognitive_Intellegence_using_Python_1

Uploaded by

harshavalwani24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views32 pages

305_BA_MachineLearning_And_Cognitive_Intellegence_using_Python_1

Uploaded by

harshavalwani24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

305 BA: MACHINE LEARNING & COGNITIVE INTELLIGENCE USING PYTHON 5860

Q.1. a) State how to define variable in python? b) Identify any two features of machine learning. c) List various
loops in python. d) List any two differences between lists and sets. e) What do you mean by operator overloading
in python? f) Define the term cognitive intelligence. g) Identify the steps of CRISP - DM Methodology. h) What
do you mean by data visualisation?
a) In Python, you can define a variable by simply assigning a value to a name. For example:

```python
variable_name = 42
```

b) Two features of machine learning are:

1. **Supervised Learning:** It involves training a model on a labeled dataset, where the algorithm
learns from the input-output pairs.

2. **Unsupervised Learning:** In this type of learning, the algorithm is given data without explicit
instructions on what to do with it. The system tries to learn the patterns and the structure from the
data.

c) Various loops in Python include:

1. **for loop:**
```python
for item in iterable:
# code to be executed
```

2. **while loop:**
```python
while condition:
# code to be executed
```

d) Two differences between lists and sets in Python:

- Lists allow duplicate elements, while sets do not. Each element in a set must be unique.
- Lists are ordered, meaning the elements have a specific order, and you can access them using
indices. Sets, on the other hand, are unordered.

e) Operator overloading in Python refers to the ability to define multiple behaviors for a single
operator depending on the operands. It allows you to define how operators should behave for
objects of your own classes.

f) **Cognitive Intelligence:** This term refers to the capability of a computer program or system to
simulate human-like thought processes. It involves understanding, reasoning, learning, problem-
solving, and adapting to new situations.

g) Steps of CRISP-DM Methodology (Cross-Industry Standard Process for Data Mining):

1. **Business Understanding:** Define the business problem, the goals of the data mining project,
and the criteria for success.

2. **Data Understanding:** Collect and explore the data needed for the project, identify data
quality issues, and understand the structure of the data.

3. **Data Preparation:** Cleanse, preprocess, and format the data for analysis. This step also
involves feature engineering and transformation.

4. **Modeling:** Select and apply various modeling techniques, tune parameters, and validate the
models to find the best one for the given problem.

5. **Evaluation:** Evaluate the models' performance against the business objectives and criteria
defined in the first step.
6. **Deployment:** Deploy the model into the production environment and monitor its
performance.

h) **Data Visualization:** It is the representation of data in a graphical or pictorial format. The goal
is to provide insights into complex datasets by presenting them in a more understandable and
interpretable form, making it easier to identify patterns, trends, and outliers. Data visualization can
include charts, graphs, maps, and other visual elements to convey information effectively.

Q.2. a) Describe Numpy Arrays. Explain with example.

b) Distinguish between clustering and classification in machine learning.

c) Discuss the Reinforcement learning with example.

a) **Numpy Arrays:**
NumPy is a powerful Python library for numerical and mathematical operations. One of its key
features is the array object, which is a multidimensional array providing efficient storage and
manipulation of large datasets. Here's an example of how to create and work with NumPy arrays:

```python
import numpy as np

# Creating a 1-dimensional array

arr_1d = np.array([1, 2, 3, 4, 5])
print("1D Array:")
print(arr_1d)

# Creating a 2-dimensional array

arr_2d = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]])
print("\n2D Array:")
print(arr_2d)
# Accessing elements
print("\nElement at row 1, column 2:", arr_2d[0, 1])

# Basic operations
print("\nSum along columns:", np.sum(arr_2d, axis=0))
```

NumPy arrays are essential for numerical operations in Python, providing a foundation for many
other libraries and tools in the data science and machine learning ecosystem.

b) Clustering vs. Classification:

- Clustering: Clustering is an unsupervised learning technique where the algorithm tries to

group similar data points based on some inherent patterns or similarities without any predefined
labels. The goal is to discover the inherent structure present in the data.

- Classification: Classification is a supervised learning technique where the algorithm learns

from labeled data to predict the labels of new, unseen data. The model is trained on a dataset with
input-output pairs, and the goal is to map input features to predefined output classes.

In summary, clustering involves finding natural groupings in the data without prior knowledge of the
groups, while classification involves learning from labeled examples to predict the class labels of
new instances.

c) Reinforcement Learning with Example:

Reinforcement Learning (RL) is a type of machine learning where an agent learns how to behave in
an environment by performing actions and receiving rewards. The goal is for the agent to learn the
optimal policy (sequence of actions) to maximize cumulative reward. Here's a simple example:

Imagine training an agent to play a game. The agent takes actions (e.g., moving left or right) in an
environment (the game), and after each action, it receives a reward or penalty based on its
performance. The agent's objective is to learn the best sequence of actions to maximize its
cumulative score.

```python
# Example of a basic reinforcement learning scenario
# (Note: This is a conceptual example, not actual code)

# Environment
game_environment = Game()

# Agent
class Agent:
def __init__(self):
self.q_values = {} # Q-values represent the expected cumulative reward for each action-state
pair

def choose_action(self, state):

# Exploration-exploitation trade-off
if np.random.rand() < exploration_rate:
return random_action()
else:
return self.get_best_action(state)

def update_q_values(self, state, action, reward, next_state):

# Update Q-values based on the reward and the next state
q_value = calculate_new_q_value(reward, next_state)
self.q_values[(state, action)] = q_value

# Training loop
agent = Agent()
for episode in range(num_episodes):
state = game_environment.reset()
total_reward = 0

for step in range(max_steps_per_episode):

action = agent.choose_action(state)
next_state, reward, done = game_environment.take_action(action)
agent.update_q_values(state, action, reward, next_state)

total_reward += reward
state = next_state

if done:
break

print("Episode {}: Total Reward: {}".format(episode, total_reward))

```

In this example, the agent learns the optimal actions to take in different states of the game by
updating its Q-values based on the received rewards. Over time, the agent refines its strategy to
maximize cumulative reward.

Q.3. a) Explain the decision tree algorithm in machine learning with example.

b) Explain the concept of simple and multiple regression.

a) **Decision Tree Algorithm in Machine Learning:**

A decision tree is a supervised machine learning algorithm used for both classification and
regression tasks. It works by recursively partitioning the data into subsets based on the most
significant attribute at each level of the tree. The process continues until a stopping criterion is met,
such as a specific depth or purity threshold.
Here's a simplified example of a decision tree for a binary classification problem (e.g., predicting
whether a passenger survives on the Titanic):

```plaintext
Decision Tree for Survival Prediction:
-----------------------------------------
- If gender is male:
- If age <= 10:
- Predict: Survived
- If age > 10:
- If class is 1st or 2nd:
- Predict: Survived
- If class is 3rd:
- Predict: Not Survived
- If gender is female:
- Predict: Survived
```

In this example, the decision tree makes decisions based on the features (gender, age, and class) to
predict whether a passenger survived or not. Each node represents a decision based on a feature,
and each branch represents the outcome of that decision. The leaves of the tree contain the final
predictions.

The decision tree algorithm recursively selects the best feature to split the data based on criteria like
Gini impurity or information gain, optimizing for the most significant reduction in uncertainty or
impurity.

b) Simple and Multiple Regression:

**Simple Regression:**
Simple linear regression is a statistical method to model the relationship between a single
independent variable (feature) and a dependent variable (target) by fitting a linear equation to the
observed data. The equation takes the form:

\[ Y = mX + b \]

where:
- $ Y $ is the dependent variable.
- $ X $ is the independent variable.
- $ m $ is the slope of the line.
- $ b $ is the y-intercept.

For example, consider predicting the price ($ Y $) of a house based on its square footage ($ X $).
The simple linear regression model would find the best-fitting line to represent this relationship.

**Multiple Regression:**
Multiple linear regression extends simple regression to model the relationship between two or more
independent variables and a dependent variable. The equation takes the form:

\[ Y = b_0 + b_1X_1 + b_2X_2 + \ldots + b_nX_n \]

where:
- $ Y $ is the dependent variable.
- $ X_1, X_2, \ldots, X_n $ are the independent variables.
- $ b_0 $ is the y-intercept.
- $ b_1, b_2, \ldots, b_n $ are the coefficients for the respective independent variables.

For example, predicting the price ($ Y $) of a house based on square footage ($ X_1 $), number of
bedrooms ($ X_2 $), and distance to the city center ($ X_3 $) would involve multiple linear
regression. The model estimates the coefficients ($ b_0, b_1, b_2, b_3 $) that best fit the observed
data.

Q.4. a) Discuss how the clustering is useful in marketing domain?

b) Analyse K - Nearest Neighbour algorithm for machine learning.

a) **Clustering in the Marketing Domain:**

Clustering is highly useful in the marketing domain for various purposes. Here are some ways in
which clustering can be applied:

1. **Customer Segmentation:**
- Identify distinct groups of customers based on their purchasing behavior, demographics, or
preferences.
- Tailor marketing strategies for each segment to increase the effectiveness of targeted campaigns.
- For example, a retail business might discover segments like "frequent shoppers," "budget-
conscious buyers," or "occasional buyers."

2. **Product Recommendations:**
- Analyze customer purchase histories and preferences to recommend products or services based
on similar customer behavior.
- Improve cross-selling and upselling by understanding which products are commonly bought
together.
- For instance, an e-commerce platform might recommend products to users based on the
preferences of others in the same cluster.

3. Market Basket Analysis:

- Identify associations and patterns in customer purchases to optimize product placement and
promotion strategies.
- Understand which products are frequently purchased together and use this information for
strategic product bundling.
- Supermarkets, for example, can optimize shelf layouts based on the relationships between
products.

4. Targeted Marketing Campaigns:

- Customize marketing messages for specific customer segments to enhance engagement.
- Clustering helps in identifying the right audience for a particular promotion, ensuring that
marketing efforts are more personalized and relevant.
- For example, a company might run different advertising campaigns for different clusters of
customers.

5. **Churn Analysis:**
- Predict and identify customers at risk of churning by analyzing their behavior and characteristics.
- Develop retention strategies tailored to different customer segments to reduce churn rates.
- Telecommunication companies, for instance, can identify clusters of customers with higher
likelihoods of churning.

b) K-Nearest Neighbors (KNN) Algorithm:

The K-Nearest Neighbors algorithm is a supervised machine learning algorithm used for classification
and regression tasks. Here's an overview of how it works:

- **Basic Idea:**
- Given a new data point, KNN classifies or predicts its label based on the majority class or average
of the K nearest data points in the feature space.
- The "nearest" data points are determined by a distance metric, often Euclidean distance.

- **Steps:**
1. **Choose K:** Select the number of neighbors, K.
2. **Calculate Distances:** Compute the distance between the new data point and all other data
points in the training set.
3. **Identify Neighbors:** Identify the K nearest neighbors based on the calculated distances.
4. **Majority Vote (Classification) or Average (Regression):**
- For classification, assign the class label that is most common among the K neighbors.
- For regression, predict the average of the target values of the K neighbors.

- **Parameters:**
- The choice of K and the distance metric are critical parameters in KNN.

- **Example:**
- For a simple classification example, consider predicting whether a point belongs to class A or B on
a 2D plane. If K = 3, the algorithm would classify the point based on the majority class of its three
nearest neighbors.

```python
from sklearn.neighbors import KNeighborsClassifier

# Example usage of KNN for classification

X_train = [[1, 2], [2, 3], [3, 1]]
y_train = [0, 0, 1] # Class labels

knn = KNeighborsClassifier(n_neighbors=3)
knn.fit(X_train, y_train)

# Predicting the class of a new point

X_new = [[2.5, 2]]
prediction = knn.predict(X_new)
print("Predicted class:", prediction)
```

In this example, the KNN algorithm is trained on a small dataset, and then it predicts the class of a
new point based on the classes of its three nearest neighbours.

Q.5. a) Design a code in python to print the following pattern.

***

****

*****

b) "Machine learning will make companies more efficient and allow them to streamline business processes of an
organisation". Justify the statement.
a) **Python Code for the Pattern:**

Here's a simple Python code to print the given pattern:

```python
def print_pattern(rows):
for i in range(1, rows + 1):
for j in range(1, i + 1):
print("*", end=" ")
print()

for i in range(rows - 1, 0, -1):

for j in range(1, i + 1):
print("*", end=" ")
print()

# Set the number of rows for the pattern

num_rows = 5
print_pattern(num_rows)
```

This code defines a function `print_pattern` that takes the number of rows as an argument and
prints the pattern accordingly. The pattern consists of two parts: the increasing part and the
decreasing part.

b) Justification for the Statement:

"Machine learning will make companies more efficient and allow them to streamline business
processes of an organisation."
Justification:

1. **Automated Decision-Making:**
- Machine learning enables companies to automate decision-making processes by analyzing large
datasets and learning patterns.
- Automation reduces the time and effort required for routine decision-making tasks, making
processes more efficient.

2. **Predictive Analytics:**
- Machine learning models can predict future trends and outcomes based on historical data.
- Companies can use these predictions to anticipate demand, optimize inventory, and make
strategic decisions, leading to better efficiency.

3. Personalization and Customer Engagement:

- Machine learning algorithms analyze customer behavior to provide personalized
recommendations and experiences.
- This personalized approach enhances customer engagement, leading to increased satisfaction
and loyalty.

4. **Process Optimization:**
- ML algorithms can optimize complex business processes by identifying bottlenecks and
inefficiencies.
- Streamlining these processes improves overall efficiency and resource utilization.

5. **Cost Reduction:**
- Automation through machine learning can significantly reduce operational costs by replacing
manual and repetitive tasks.
- Companies can allocate resources more effectively and focus on high-value tasks.

6. Fraud Detection and Security:

- Machine learning algorithms can detect anomalous patterns in data, enhancing fraud detection
capabilities.
- Improved security measures contribute to the overall efficiency of business operations.

7. Supply Chain Management:

- ML aids in optimizing supply chain processes by predicting demand, managing inventory levels,
and improving logistics.
- Companies can avoid overstocking or stockouts, leading to cost savings and increased efficiency.

8. **Data-Driven Decision-Making:**
- Machine learning facilitates data-driven decision-making by extracting insights from vast
datasets.
- Informed decisions based on data contribute to more efficient and effective business operations.

In summary, machine learning empowers companies to leverage data for smarter decision-making,
automate processes, and enhance various aspects of business operations, ultimately leading to
increased efficiency and competitiveness.
305BA Machine Learning and Cognitive Intelligence using Python 5946

Q.1. a) Write a code in Python to display message “Hello World” b) Why there is need of machine
learning? c) List basic operators used in Python. d) State any 2 differences between Lists and Tuples. e)
What do you understand by function overloading in python? f) Define the term ‘Cognitive Intelligence’.
g) Idenfity the steps of KDD framework of machine Learning. h) Explain the term ‘Data Cleaning &
a)Preparation’
**Pythonwhile Codeworking
to Display "Hello World":**
with Data in Python. [

```python
print("Hello World")
```

This simple Python code uses the `print` function to display the message "Hello World" on
the console.

b) Need for Machine Learning:

Machine learning is needed for several reasons:

- **Complexity of Data:**
- In today's world, data is generated at an unprecedented rate, and it is often complex and
unstructured. Machine learning algorithms can extract meaningful patterns and insights from
large datasets.

- **Automation:**
- Machine learning enables automation of tasks that would be difficult or impractical to
program explicitly. This includes tasks like image recognition, natural language processing,
and decision-making.

- **Predictive Analysis:**
- Businesses benefit from machine learning for predictive analytics, allowing them to
forecast trends, make informed decisions, and gain a competitive advantage.
- **Personalization:**
- Machine learning is used to create personalized experiences for users, whether in
recommendations on e-commerce platforms, content recommendations on streaming
services, or targeted advertising.

- **Optimization:**
- ML algorithms optimize processes in various industries, such as supply chain management,
resource allocation, and logistics, leading to increased efficiency.

- **Fraud Detection:**
- Machine learning plays a crucial role in detecting fraudulent activities by identifying
patterns that may indicate fraudulent behavior in financial transactions, online activities, etc.

- Healthcare and Medicine:

- In healthcare, machine learning is used for disease prediction, diagnosis, personalized
treatment plans, and drug discovery.

- **Improved Decision-Making:**
- ML provides tools for analyzing data and making predictions, supporting better decision-
making across various domains.

c) Basic Operators in Python:

- **Arithmetic Operators:**
- `+` (addition)
- `-` (subtraction)
- `*` (multiplication)
- `/` (division)
- `%` (modulus)
- `**` (exponentiation)
- **Comparison Operators:**
- `==` (equal to)
- `!=` (not equal to)
- `<` (less than)
- `>` (greater than)
- `<=` (less than or equal to)
- `>=` (greater than or equal to)

- **Logical Operators:**
- `and` (logical AND)
- `or` (logical OR)
- `not` (logical NOT)

- **Assignment Operators:**
- `=` (assignment)
- `+=` (addition assignment)
- `-=` (subtraction assignment)
- `*=` (multiplication assignment)
- `/=` (division assignment)

- **Bitwise Operators:**
- `&` (bitwise AND)
- `|` (bitwise OR)
- `^` (bitwise XOR)
- `~` (bitwise NOT)
- `<<` (left shift)
- `>>` (right shift)
- **Membership Operators:**
- `in` (True if value is found in the sequence)
- `not in` (True if value is not found in the sequence)

- **Identity Operators:**
- `is` (True if both variables refer to the same object)
- `is not` (True if variables do not refer to the same object)

d) Differences between Lists and Tuples:

1. **Mutability:**
- Lists are mutable, meaning you can modify their elements (add, remove, or change) after
creation.
- Tuples are immutable; once created, you cannot change, add, or remove elements.

2. **Syntax:**
- Lists are defined using square brackets `[]`.
```python
my_list = [1, 2, 3]
```
- Tuples are defined using parentheses `()`.
```python
my_tuple = (1, 2, 3)
```

e) Function Overloading in Python:

Python does not support traditional function overloading like some other languages (e.g.,
C++). However, Python allows a single function to have different implementations based on
the number or types of its parameters. This is known as "polymorphism" and is achieved
through default values and variable-length argument lists.

For example:

```python
def add_numbers(a, b=0, c=0):
return a + b + c

result1 = add_numbers(1)
result2 = add_numbers(1, 2)
result3 = add_numbers(1, 2, 3)

print(result1, result2, result3)

```

In this example, the `add_numbers` function can take one, two, or three arguments, and it
returns the sum of the provided values. If not provided, the default values are used.

f) **Cognitive Intelligence:**

Cognitive intelligence refers to the ability of a system or entity to simulate and replicate
human-like thought processes, including perception, reasoning, learning, problem-solving,
and understanding natural language. It involves the use of advanced technologies like
artificial intelligence, machine learning, and deep learning to mimic human cognitive
functions.

g) Steps of KDD Framework in Machine Learning:

Knowledge Discovery in Databases (KDD) is a process that involves extracting useful patterns
and knowledge from large datasets. The steps of the KDD framework include:

1. **Selection:**
- Define the target dataset and select relevant data to be analyzed.

2. **Preprocessing:**
- Clean the data by handling missing values, outliers, and noise.

3. **Transformation:**
- Convert raw data into a suitable format for analysis, which may involve feature
engineering or scaling.

4. **Data Mining:**
- Apply machine learning algorithms to discover patterns, trends, or relationships in the
data.

5. **Interpretation/Evaluation:**
- Interpret the results of data mining and evaluate the discovered patterns for their
significance and reliability.

6. **Utilization:**
- Apply the knowledge and insights gained from the data to make informed decisions or
take appropriate actions.

h) **Data Visualization:**

Data visualization is the representation of data in graphical or visual formats, such as charts,
graphs, and maps, to facilitate the understanding of patterns, trends, and insights in the data.
It involves the use of visual elements to communicate information effectively and is an
essential part of the data analysis process. Data visualization helps in presenting complex
information in a more understandable and interpretable form, making it easier for decision-
makers to grasp and analyze the data.

Q.2. a) How to read and write files with open statement? Explain with example.

b) Explain anyone Supervised Learning algorithm.

c) Describe SEMMA process model of machine learning.

a) Reading and Writing Files with the `open` Statement:

The `open` statement in Python is used to open and manipulate files. It has the following
syntax:

```python
with open('filename.txt', 'mode') as file:
# Perform operations on the file
```

Here, 'filename.txt' is the name of the file you want to open, and 'mode' is the mode in which
you want to open the file (`'r'` for reading, `'w'` for writing, `'a'` for appending, etc.).

Reading from a File:

```python
# Example of reading from a file
with open('example.txt', 'r') as file:
content = file.read()
print(content)
```
In this example, the content of the file 'example.txt' is read and printed.

**Writing to a File:**

```python
# Example of writing to a file
with open('output.txt', 'w') as file:
file.write('Hello, this is a sample text.\n')
file.write('Writing to a file is easy with Python.')
```

In this example, two lines of text are written to the file 'output.txt'.

b) Supervised Learning Algorithm: Linear Regression

**Linear Regression:**

Linear Regression is a supervised learning algorithm used for predicting the value of a
continuous variable based on one or more predictor features. It assumes a linear relationship
between the input features and the output variable.

**Example:**

Let's say we want to predict the price of houses based on their size. The linear regression
model would try to find the best-fitting line (linear equation) that minimizes the difference
between the predicted prices and the actual prices in the training data.

```python
import numpy as np
from sklearn.linear_model import LinearRegression
import matplotlib.pyplot as plt

# Example data
sizes = np.array([1400, 1600, 1700, 1875, 1100, 1550, 2350, 2450, 1425, 1700])
prices = np.array([245000, 312000, 279000, 308000, 199000, 219000, 405000, 324000,
319000, 255000])

# Reshape the data to fit the model

sizes = sizes.reshape(-1, 1)

# Create and fit the model

model = LinearRegression()
model.fit(sizes, prices)

# Make predictions for new data

new_sizes = np.array([2000, 1500]).reshape(-1, 1)
predictions = model.predict(new_sizes)

# Plot the data and the regression line

plt.scatter(sizes, prices, color='blue')
plt.plot(sizes, model.predict(sizes), color='red', linewidth=2)
plt.scatter(new_sizes, predictions, color='green', marker='x')
plt.xlabel('House Size (sq ft)')
plt.ylabel('House Price ($)')
plt.title('Linear Regression Example')
plt.show()
```
In this example, the linear regression model is trained on the sizes and prices of houses. It
then predicts the prices for new house sizes. The red line represents the best-fitting line, and
the green 'x' markers represent the predicted prices for new sizes.

c) SEMMA Process Model of Machine Learning:

SEMMA stands for Sample, Explore, Modify, Model, and Assess. It is a process model used in
data mining and machine learning for developing predictive models. Here's a brief overview:

1. **Sample:**
- Obtain a representative sample of the data to analyze. This involves selecting a subset of
data from the entire dataset.

2. **Explore:**
- Explore and visualize the data to understand its characteristics, identify patterns, and gain
insights. This step involves descriptive statistics, data visualization, and data profiling.

3. **Modify:**
- Preprocess the data by cleaning, transforming, and handling missing values. This step also
involves feature engineering, where new features are created or existing ones are modified
to improve model performance.

4. **Model:**
- Build and train predictive models using the prepared dataset. This step includes selecting
appropriate algorithms, training the models, and tuning parameters to optimize
performance.

5. **Assess:**
- Evaluate the performance of the models using metrics such as accuracy, precision, recall,
or mean squared error. Assess the models against the business objectives to ensure they
meet the desired criteria.

The SEMMA process is iterative, and analysts may revisit earlier stages based on insights
gained during the later stages. It provides a structured framework for guiding the data mining
and machine learning process from data exploration to model assessment.

Q.3. a) Explain Supervised Learning technique using K-Nearest Neighbour method.

b) State and explain applications of supervised learning in any one domain which you know.

a) Supervised Learning using K-Nearest Neighbors (KNN):

**Supervised Learning:**
Supervised learning is a type of machine learning where the algorithm is trained on a labeled
dataset, which means the dataset includes both input features and corresponding target
labels. The goal is for the algorithm to learn a mapping from inputs to outputs, allowing it to
make predictions on new, unseen data.

K-Nearest Neighbors (KNN):

K-Nearest Neighbors is a supervised learning algorithm used for classification and regression
tasks. In the context of classification, given a new data point, the algorithm identifies the K
training data points closest to it in the feature space. The majority class among these K
neighbors is assigned to the new data point.

Example of KNN for Classification:

Let's consider a simple example where we want to classify whether a fruit is an apple or a
banana based on two features: sweetness and color. We have a labeled dataset with the
sweetness, color, and corresponding labels.

```python
from sklearn.neighbors import KNeighborsClassifier
# Sample dataset
X_train = [[8, 'red'], [6, 'yellow'], [7, 'red'], [4, 'yellow']]
y_train = ['apple', 'banana', 'apple', 'banana']

# Create a KNN classifier

knn_classifier = KNeighborsClassifier(n_neighbors=3)
knn_classifier.fit(X_train, y_train)

# Predict the class of a new fruit

new_fruit = [[7, 'red']]
predicted_class = knn_classifier.predict(new_fruit)

print("Predicted class:", predicted_class)

```

In this example, the KNN algorithm is trained on a dataset with labeled instances of fruits.
When given a new fruit with sweetness 7 and red color, the algorithm predicts that it is an
apple.

b) Applications of Supervised Learning in Healthcare:

Application: Disease Diagnosis

**Explanation:**
Supervised learning is extensively used in healthcare for disease diagnosis. Medical
professionals can collect datasets with features such as patient symptoms, test results, and
demographic information, along with corresponding labels indicating the presence or
absence of a particular disease.
**Example:**
Consider the application of supervised learning in diagnosing diabetes. A dataset may include
features like blood sugar levels, age, BMI, and family medical history, with labels indicating
whether the patient has diabetes or not. A supervised learning algorithm, such as a support
vector machine (SVM) or a decision tree, can be trained on this data to predict diabetes in
new patients.

**Benefits:**
1. **Early Detection:** Supervised learning models can assist in early detection of diseases,
enabling timely intervention and treatment.
2. **Personalized Medicine:** By analyzing patient-specific data, models can recommend
personalized treatment plans based on the individual's characteristics.
3. **Resource Optimization:** Efficient allocation of medical resources, such as prioritizing
screenings for individuals at higher risk, can be achieved using predictive models.

In healthcare, the application of supervised learning contributes to more accurate and timely
diagnoses, personalized patient care, and overall improvements in the efficiency of
healthcare processes.

Q.4. a) Elaborate the applications of unsupervised learning in marketing domain.

b) Distinguish between decision trees & linear regression technique with suitable example.

a) Applications of Unsupervised Learning in Marketing:

Unsupervised learning in marketing is valuable for discovering patterns, segmenting

customer groups, and gaining insights without labeled data. Here are some applications:

1. **Customer Segmentation:**
- **Objective:** Divide customers into meaningful segments based on their behavior,
preferences, or characteristics.
- **Example:** Clustering algorithms can group customers who exhibit similar purchasing
patterns. Marketers can then tailor specific strategies for each segment, improving the
effectiveness of targeted campaigns.
2. **Market Basket Analysis:**
- **Objective:** Identify associations and relationships between products frequently
purchased together.
- **Example:** Association rule mining algorithms can reveal that customers who buy
diapers are likely to purchase baby wipes. Retailers can use this information for strategic
product placement and bundling.

3. **Anomaly Detection:**
- **Objective:** Detect unusual or anomalous patterns in customer behavior that may
indicate fraud or other issues.
- **Example:** Unsupervised algorithms can flag unusual transactions or activities, helping
prevent fraudulent activities and enhancing security in e-commerce platforms.

4. **Content Recommendation:**
- **Objective:** Recommend products, services, or content based on users' preferences
and behavior.
- **Example:** Collaborative filtering algorithms can suggest movies, products, or articles
based on the preferences of users with similar behavior, leading to a more personalized user
experience.

5. Social Media Analysis:

- **Objective:** Analyze user-generated content to understand sentiment and identify
trends.
- **Example:** Clustering algorithms can group social media posts based on similar themes
or sentiments. Marketers can use this information to gauge public opinion and tailor their
messaging accordingly.

6. **Attribution Modeling:**
- **Objective:** Understand the contribution of each marketing channel to conversions.
- **Example:** Unsupervised learning techniques can help identify the most influential
touchpoints in the customer journey, allowing marketers to allocate resources effectively and
optimize their marketing mix.

Unsupervised learning techniques empower marketers to uncover hidden patterns and

insights in their data, leading to more informed decision-making and targeted strategies.

b) Distinguishing Decision Trees and Linear Regression:

**Decision Trees:**
- Decision trees are a versatile supervised learning algorithm used for both classification and
regression tasks.
- They make decisions by recursively splitting the dataset based on the most significant
attribute at each node, aiming to create homogeneous subsets.
- The final result is a tree structure where each leaf node represents a class (for classification)
or a predicted value (for regression).

**Linear Regression:**
- Linear regression is a supervised learning algorithm used for predicting a continuous target
variable based on one or more independent features.
- It assumes a linear relationship between the input features and the output variable, fitting a
line to the data that minimizes the sum of squared errors.
- The model equation takes the form $Y = mX + b$, where $Y$ is the output, $X$ is the
input, $m$ is the slope, and $b$ is the y-intercept.

**Differences:**
1. **Output Type:**
- Decision trees can be used for both classification and regression tasks.
- Linear regression is specifically designed for regression tasks, predicting continuous
numeric values.
2. **Model Representation:**
- Decision trees are represented as tree structures with nodes and branches.
- Linear regression is represented by a linear equation, typically a line in two dimensions or
a hyperplane in higher dimensions.

**Example:**
Consider predicting the price of a house based on its size:
- Decision Tree: The tree would make decisions at each node based on features like size,
location, and number of bedrooms, ultimately leading to a predicted price at the leaf node.
- Linear Regression: The linear regression model would find the best-fitting line that
minimizes the difference between the predicted and actual prices based on the size of the
house.

In summary, decision trees are versatile and suitable for both classification and regression,
while linear regression is specifically designed for regression tasks, predicting continuous
values based on a linear relationship between features and the target variable.

Q.5. a) Write a Python code for calculating factorial of a given number.

b) How machine learning techniques will be useful for fraud analysis for credit card. Explain.

a) Python Code for Calculating Factorial:

Here's a simple Python code to calculate the factorial of a given number using a recursive
function:

```python
def factorial(n):
if n == 0 or n == 1:
return 1
else:
return n * factorial(n - 1)

# Example: Calculate factorial of 5

number = 5
result = factorial(number)
print(f"The factorial of {number} is: {result}")
```

In this code, the `factorial` function is defined recursively. It returns 1 for the base cases
(when $n$ is 0 or 1) and calculates the factorial for other values.

b) Machine Learning for Fraud Analysis in Credit Cards:

Machine learning techniques are highly beneficial for fraud analysis in credit cards due to
their ability to detect patterns and anomalies in large datasets. Here's how they can be
useful:

1. **Anomaly Detection:**
- **Technique:** Unsupervised learning algorithms, such as clustering or isolation forests,
can detect anomalies in transaction patterns.
- **Use:** Identify unusual patterns that may indicate fraudulent activities, such as large
transactions, transactions from unfamiliar locations, or unusual spending behavior.

2. **Predictive Modeling:**
- **Technique:** Supervised learning algorithms, like decision trees or support vector
machines, can be trained on labeled datasets to predict the likelihood of a transaction being
fraudulent.
- **Use:** Predict and prioritize transactions with a higher likelihood of fraud, allowing for
proactive measures and timely intervention.
3. **Behavior Analysis:**
- **Technique:** Machine learning models can analyze the historical spending and
transaction patterns of users to establish a baseline of normal behavior.
- **Use:** Identify deviations from the established patterns, triggering alerts for
transactions that significantly differ from the user's typical behavior.

4. **Real-time Monitoring:**
- **Technique:** Stream processing and real-time analytics using machine learning models
enable immediate detection of suspicious activities.
- **Use:** Quickly identify and block potentially fraudulent transactions as they occur,
minimizing the impact on both cardholders and financial institutions.

5. **Feature Engineering:**
- **Technique:** Extract relevant features from transaction data, such as time of day,
transaction amount, location, and frequency.
- **Use:** Provide valuable information for machine learning models to identify patterns
associated with legitimate and fraudulent transactions.

6. **Ensemble Methods:**
- **Technique:** Combine multiple models (ensemble methods) to enhance the overall
fraud detection accuracy.
- **Use:** Improve the robustness of the fraud detection system by leveraging the
strengths of different algorithms.

Machine learning techniques enable credit card companies to adapt and evolve their fraud
detection strategies continuously. By learning from new patterns and emerging fraud tactics,
these models can enhance their accuracy over time, providing a more proactive and effective
defense against fraudulent activities.

From CPP To COM
No ratings yet
From CPP To COM
60 pages
Amos Annotated Output Sem Cfa PDF
No ratings yet
Amos Annotated Output Sem Cfa PDF
31 pages
Essbase Interview Questions
No ratings yet
Essbase Interview Questions
43 pages
305 BA PYTHON - APR 2022 ANSWER Key
No ratings yet
305 BA PYTHON - APR 2022 ANSWER Key
14 pages
MADA
No ratings yet
MADA
48 pages
ML QB Answers
No ratings yet
ML QB Answers
11 pages
Report
No ratings yet
Report
11 pages
ML
No ratings yet
ML
5 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
Types of Machine Learning Algorithms
No ratings yet
Types of Machine Learning Algorithms
14 pages
14401172022_tanu raman ml lab file
No ratings yet
14401172022_tanu raman ml lab file
21 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Machine Learning Most Important Question For Mid Term Ipu University
No ratings yet
Machine Learning Most Important Question For Mid Term Ipu University
36 pages
Machine learning_question bank
No ratings yet
Machine learning_question bank
45 pages
RLDL File
No ratings yet
RLDL File
31 pages
CS3491 Lab Manual
No ratings yet
CS3491 Lab Manual
21 pages
Intro ML Linear Classifier
No ratings yet
Intro ML Linear Classifier
18 pages
PUT MLT
No ratings yet
PUT MLT
12 pages
BCS602 Model Question Paper Solved(Search Creators)-2-37
0% (2)
BCS602 Model Question Paper Solved(Search Creators)-2-37
36 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
ML Lab Record
No ratings yet
ML Lab Record
27 pages
InterviewMaterial
No ratings yet
InterviewMaterial
14 pages
Diya Basera
No ratings yet
Diya Basera
15 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Silver Oak College of Computer Application: Subject:Machine Learning
No ratings yet
Silver Oak College of Computer Application: Subject:Machine Learning
15 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
Crash Course Sul Machine Learning ?
No ratings yet
Crash Course Sul Machine Learning ?
13 pages
ML ASS ppt
No ratings yet
ML ASS ppt
16 pages
UNIT-1,2,3
No ratings yet
UNIT-1,2,3
30 pages
ml lab
No ratings yet
ml lab
23 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
ML_notion_1
No ratings yet
ML_notion_1
18 pages
Supervised - ML Complete Book
No ratings yet
Supervised - ML Complete Book
153 pages
Machine Learning
No ratings yet
Machine Learning
21 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
Machine Learning Qs
No ratings yet
Machine Learning Qs
10 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
An Introduction To Machine Learning
No ratings yet
An Introduction To Machine Learning
136 pages
Coursera Machine Learning Specialization
No ratings yet
Coursera Machine Learning Specialization
46 pages
BCS602 Model Question paper Solved(Search Creators)
No ratings yet
BCS602 Model Question paper Solved(Search Creators)
37 pages
What Is Machine Learning_ _ Python Data Science Handbook
No ratings yet
What Is Machine Learning_ _ Python Data Science Handbook
11 pages
ClassNote One
No ratings yet
ClassNote One
2 pages
ML
No ratings yet
ML
8 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
UNIT1@
No ratings yet
UNIT1@
4 pages
L03 The Regression Pipeline - 2
No ratings yet
L03 The Regression Pipeline - 2
58 pages
AI unit 2
No ratings yet
AI unit 2
14 pages
6th_SEM Machine Learning Notes PDF
100% (1)
6th_SEM Machine Learning Notes PDF
36 pages
Ai Solved
No ratings yet
Ai Solved
15 pages
Unit 1 Machine Learning - PDF Lands
No ratings yet
Unit 1 Machine Learning - PDF Lands
5 pages
ML-1
No ratings yet
ML-1
48 pages
Computer Vision-Lec 3
No ratings yet
Computer Vision-Lec 3
11 pages
ML Sem
No ratings yet
ML Sem
24 pages
Python Code For AI
100% (3)
Python Code For AI
219 pages
Lecture 17&18 - Introduction To Machine Learning
No ratings yet
Lecture 17&18 - Introduction To Machine Learning
51 pages
Chapter 01 machine learning
No ratings yet
Chapter 01 machine learning
22 pages
Deep Learning
No ratings yet
Deep Learning
25 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Harsha Valwani - 2023-05-01
No ratings yet
Harsha Valwani - 2023-05-01
1 page
May_Jun_2024_Revised
No ratings yet
May_Jun_2024_Revised
3 pages
Nov_Dec_2023
No ratings yet
Nov_Dec_2023
3 pages
May_Jun_2024
No ratings yet
May_Jun_2024
3 pages
Nov_Dec_2023
No ratings yet
Nov_Dec_2023
2 pages
SAS Introduction
No ratings yet
SAS Introduction
13 pages
Working As A Data Librarian Eric O Johnson pdf download
No ratings yet
Working As A Data Librarian Eric O Johnson pdf download
76 pages
Model Paper BIG DATA (KOE097)
No ratings yet
Model Paper BIG DATA (KOE097)
8 pages
DWDM Online Bits
No ratings yet
DWDM Online Bits
3 pages
Sandeep Updated Resume
No ratings yet
Sandeep Updated Resume
6 pages
AJP-Test 2 Question Bank
No ratings yet
AJP-Test 2 Question Bank
14 pages
IT Security Assignment
No ratings yet
IT Security Assignment
2 pages
Converting CLOBs 2 VARCHAR
No ratings yet
Converting CLOBs 2 VARCHAR
15 pages
Lab 03
No ratings yet
Lab 03
12 pages
Blast
No ratings yet
Blast
19 pages
231 Rudransh Sharma
No ratings yet
231 Rudransh Sharma
253 pages
Chapter 3 Part 1
No ratings yet
Chapter 3 Part 1
43 pages
Splunk PDF
No ratings yet
Splunk PDF
11 pages
Big Data in Iot: July 2019
No ratings yet
Big Data in Iot: July 2019
8 pages
Time Calculation Rule Fast Formula Reference Guide: Oracle Fusion Time and Labor
100% (1)
Time Calculation Rule Fast Formula Reference Guide: Oracle Fusion Time and Labor
60 pages
Unit 9.2 Database Models
No ratings yet
Unit 9.2 Database Models
14 pages
IPower For IFIX 6.1 Demonstration Guide
No ratings yet
IPower For IFIX 6.1 Demonstration Guide
58 pages
Oracle Concepts and Architecture Database Structures
No ratings yet
Oracle Concepts and Architecture Database Structures
102 pages
Da Types
No ratings yet
Da Types
21 pages
Adbms: Database Recovery Techniques in DBMS
No ratings yet
Adbms: Database Recovery Techniques in DBMS
5 pages
Application of GIS in Construction Management
No ratings yet
Application of GIS in Construction Management
10 pages
Relevant Information
No ratings yet
Relevant Information
6 pages
Dbms Multiple Choice Questions For Midterm LPU
No ratings yet
Dbms Multiple Choice Questions For Midterm LPU
5 pages
Amazon RDS
No ratings yet
Amazon RDS
9 pages
DP-300 V12.35 (6855)
No ratings yet
DP-300 V12.35 (6855)
51 pages
DBMS Ppts Till MTE
No ratings yet
DBMS Ppts Till MTE
264 pages
Types of Keys in Database Management System: Sos in Computer Science and Application Pgdca 203: Dbms
No ratings yet
Types of Keys in Database Management System: Sos in Computer Science and Application Pgdca 203: Dbms
11 pages

305_BA_MachineLearning_And_Cognitive_Intellegence_using_Python_1

Uploaded by

305_BA_MachineLearning_And_Cognitive_Intellegence_using_Python_1

Uploaded by

305 BA: MACHINE LEARNING & COGNITIVE INTELLIGENCE USING PYTHON 5860

b) Two features of machine learning are:

c) Various loops in Python include:

d) Two differences between lists and sets in Python:

g) Steps of CRISP-DM Methodology (Cross-Industry Standard Process for Data Mining):

Q.2. a) Describe Numpy Arrays. Explain with example.

b) Distinguish between clustering and classification in machine learning.

c) Discuss the Reinforcement learning with example.

# Creating a 1-dimensional array

# Creating a 2-dimensional array

b) **Clustering vs. Classification:**

- **Clustering:** Clustering is an unsupervised learning technique where the algorithm tries to

- **Classification:** Classification is a supervised learning technique where the algorithm learns

c) **Reinforcement Learning with Example:**

def choose_action(self, state):

def update_q_values(self, state, action, reward, next_state):

for step in range(max_steps_per_episode):

print("Episode {}: Total Reward: {}".format(episode, total_reward))

b) Explain the concept of simple and multiple regression.

b) **Simple and Multiple Regression:**

\[ Y = b_0 + b_1X_1 + b_2X_2 + \ldots + b_nX_n \]

Q.4. a) Discuss how the clustering is useful in marketing domain?

b) Analyse K - Nearest Neighbour algorithm for machine learning.

3. **Market Basket Analysis:**

4. **Targeted Marketing Campaigns:**

b) **K-Nearest Neighbors (KNN) Algorithm:**

# Example usage of KNN for classification

# Predicting the class of a new point

Q.5. a) Design a code in python to print the following pattern.

Here's a simple Python code to print the given pattern:

for i in range(rows - 1, 0, -1):

# Set the number of rows for the pattern

b) **Justification for the Statement:**

3. **Personalization and Customer Engagement:**

6. **Fraud Detection and Security:**

7. **Supply Chain Management:**

b) **Need for Machine Learning:**

Machine learning is needed for several reasons:

- **Healthcare and Medicine:**

c) **Basic Operators in Python:**

d) **Differences between Lists and Tuples:**

e) **Function Overloading in Python:**

print(result1, result2, result3)

g) **Steps of KDD Framework in Machine Learning:**

b) Explain anyone Supervised Learning algorithm.

c) Describe SEMMA process model of machine learning.

a) **Reading and Writing Files with the `open` Statement:**

**Reading from a File:**

b) **Supervised Learning Algorithm: Linear Regression**

# Reshape the data to fit the model

# Create and fit the model

# Make predictions for new data

# Plot the data and the regression line

c) **SEMMA Process Model of Machine Learning:**

Q.3. a) Explain Supervised Learning technique using K-Nearest Neighbour method.

a) **Supervised Learning using K-Nearest Neighbors (KNN):**

**K-Nearest Neighbors (KNN):**

**Example of KNN for Classification:**

# Create a KNN classifier

# Predict the class of a new fruit

print("Predicted class:", predicted_class)

b) **Applications of Supervised Learning in Healthcare:**

**Application: Disease Diagnosis**

Q.4. a) Elaborate the applications of unsupervised learning in marketing domain.

a) **Applications of Unsupervised Learning in Marketing:**

Unsupervised learning in marketing is valuable for discovering patterns, segmenting

5. **Social Media Analysis:**

Unsupervised learning techniques empower marketers to uncover hidden patterns and

b) **Distinguishing Decision Trees and Linear Regression:**

Q.5. a) Write a Python code for calculating factorial of a given number.

a) **Python Code for Calculating Factorial:**

# Example: Calculate factorial of 5

b) **Machine Learning for Fraud Analysis in Credit Cards:**

You might also like

b) Clustering vs. Classification:

- Clustering: Clustering is an unsupervised learning technique where the algorithm tries to

- Classification: Classification is a supervised learning technique where the algorithm learns

c) Reinforcement Learning with Example:

b) Simple and Multiple Regression:

3. Market Basket Analysis:

4. Targeted Marketing Campaigns:

b) K-Nearest Neighbors (KNN) Algorithm:

b) Justification for the Statement:

3. Personalization and Customer Engagement:

6. Fraud Detection and Security:

7. Supply Chain Management:

b) Need for Machine Learning:

- Healthcare and Medicine:

c) Basic Operators in Python:

d) Differences between Lists and Tuples:

e) Function Overloading in Python:

g) Steps of KDD Framework in Machine Learning:

a) Reading and Writing Files with the `open` Statement:

Reading from a File:

b) Supervised Learning Algorithm: Linear Regression

c) SEMMA Process Model of Machine Learning:

a) Supervised Learning using K-Nearest Neighbors (KNN):

K-Nearest Neighbors (KNN):

Example of KNN for Classification:

b) Applications of Supervised Learning in Healthcare:

Application: Disease Diagnosis

a) Applications of Unsupervised Learning in Marketing:

5. Social Media Analysis:

b) Distinguishing Decision Trees and Linear Regression:

a) Python Code for Calculating Factorial:

b) Machine Learning for Fraud Analysis in Credit Cards: