0% found this document useful (0 votes)

8 views73 pages

Chapter Eight

Uploaded by

Senait Desalegn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views73 pages

Chapter Eight

Uploaded by

Senait Desalegn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 73

Data analytics for Cyber Security

Chapter Eight
Future Direction in data analytics for cyber security
School of Information Technology and Engineering
Addis Ababa Institute of Technology
Addis Ababa University
Jan 2025

sD
by senait Desalegn
Content
 Cyber physical system
 IoT VS CPS
 Vulnerabilities and Security challenges to CPS
 Multi Domain Mining
 Deep Learning
 Deep Learning Challenges
 Generative Adversarial Networks
 Ethical Thinking in the Data Analytics Process

sD
What is Cyber Physical Systems (CPS)?

Cyber-Physical Systems (CPS) are engineered systems that integrate

computational elements with physical processes.
CPS leverages sophisticated algorithms and real-time data analysis to monitor
and control physical processes. By embedding intelligence into physical
objects and environments, CPS makes the machines capable of making
autonomous decisions, adapting to changing conditions, and optimizing
performance in numerous ways.
CPS has a lot of potential for innovation and transformation across various
D

sectors. It enables interaction between digital and physical components,

which drive advancements in manufacturing, transportation, healthcare,
energy, and beyond.
Architectural overview of Cyber-Physical Systems
(CPS)
Cyber-physical systems (CPS) have an architecture that integrates hardware
and software components. Because of this architecture, the dynamic
interaction between physical processes and computational elements becomes
possible. This forms the foundation for the intelligent and responsive behavior
of CPS. A CPS architecture consists of the following:
Physical Process: The real-world system or environment that CPS interacts
with.
Sensors: Devices that collect data (e.g., temperature, pressure, motion) from
D

the physical process.

Communication Network: Wired or wireless infrastructure that transmits data
between the physical and cyber domains.
Architectural overview of Cyber-Physical Systems
(CPS)
Computational Nodes: Devices (e.g., microprocessors, servers) that process
and analyze data from sensors.
Actuators: Devices that control the physical process based on commands from
the computational nodes.
Control Algorithms: Software that analyzes sensor data, makes decisions, and
generates control signals.

D
Examples Of Cyber-Physical Systems

Operational Technology (OT): This is a combination of hardware and software

systems that monitor and control physical devices and processes in industrial
settings. OT is important for managing infrastructure like power plants,
manufacturing facilities, and transportation systems.
Industrial Internet of Things (IIoT): The IIoT refers to the network of
interconnected sensors, instruments, and other devices used in industrial
applications. It enables real-time data collection, analysis, and automation,
leading to increased efficiency, productivity, and safety in various industries.
D

Smart Manufacturing: In smart factories, robots collaborate with humans.

Algorithms analyze data to predict maintenance needs, preventing failures.
Examples Of Cyber-Physical Systems

Operational Technology (OT): This is a combination of hardware and software

Smart Manufacturing: In smart factories, robots collaborate with humans.

Algorithms analyze data to predict maintenance needs, preventing failures.
Examples Of Cyber-Physical Systems

Autonomous Vehicles: Self-driving cars and trucks use sensors and AI for
navigation.
Smart Grids: The physical power grid infrastructure is interconnected with
digital systems for real-time monitoring and control.
Medical Devices: Implantable devices like pacemakers use sensors to monitor
and adjust therapy automatically. These devices integrate sensors and
actuators within the human body to maintain physiological functions.
Building Automation: Smart buildings use CPS to control HVAC, lighting, and
D

security systems.
Examples Of Cyber-Physical Systems

Aerospace and Defense: Drones, missile defense systems, and autonomous

underwater vehicles are all examples of CPS used in these sectors.
Traffic Management: Adaptive traffic signals and intelligent transportation
systems use CPS to optimize traffic flow and improve safety.
Process Control: Industrial Control Systems (ICS) incorporate CPS to monitor
and control complex processes in critical infrastructure, ensuring safety and
efficiency.

D
Key Features Of Cyber-Physical Systems

Cyber-physical systems (CPS) have distinct characteristics that set them apart
from computer systems and embedded devices. These features drive their
transformative impact across various domains:
Integration of Cyber and Physical Components: CPS are not merely software
running on hardware; they are integrated systems where the cyber and
physical components are deeply intertwined. This integration allows for real-
time interaction and feedback between the digital and physical worlds.
Real-Time Operation: CPS operates in real time and responds to changes in
D

the physical environment with minimal delay. This is important for

applications that require immediate action, such as autonomous vehicles or
industrial control systems.
Key Features Of Cyber-Physical Systems
Networking and Communication: CPS are connected through networks. This
enables them to exchange data, coordinate actions, and learn from each
other. This interconnectedness leads to scalability and adaptability.
Adaptability and Autonomy: CPS adapts to changing conditions and operates
autonomously to some extent. CPS can make decisions based on real-time
data, learn from their experiences, and optimize their behavior over time.
Heterogeneity: CPS consists of different components, like sensors, actuators,
processors, and communication devices. This heterogeneity requires
sophisticated integration and coordination mechanisms for seamless
D

operation.
Data-Driven Decision Making: CPS relies on data collected from sensors and
other sources. They use this data to make informed decisions and adapt to
IoT vs CPS

The Internet of Things (IoT) and Cyber-Physical Systems (CPS) have some
similarities, but there are key distinctions in their capabilities and applications.
Internet-of-Things (IoT)
Focus: IoT connects everyday objects to the internet, to collect and exchange
data.

Functionality: Involves simple tasks like sensing and transmitting data, such as
a smart thermostat adjusting temperature based on occupancy.

Control: Limited control over the physical environment, focusing more on data
collection and communication.

Examples: Smart home devices, wearable fitness trackers, environmental

sensors
IoT vs CPS

Cyber-Physical Systems (CPS)

Focus: It integrates computational elements with physical processes. These
intelligent systems have monitoring and controlling capabilities.
Functionality: Performs complex tasks like real-time data analysis, decision-
making, and control actions, such as a self-driving car navigating traffic.

Control: Exerts a higher degree of control over the physical environment,

involving closed-loop feedback systems.

Examples: Smart factories, autonomous vehicles, medical devices, power

grids.
IoT vs CPS

D
Vulnerabilities and security challenges to CPS
Complexity: CPS are complex, with diverse components and intricate
interactions, making them difficult to secure. Attackers can find vulnerabilities
in individual components or in the communication networks that connect
them.
Connectivity: The increasing connectivity of CPS exposes them to a wider
range of cyber threats. Attackers can gain access to CPS through the internet
or other networks, causing significant damage.
Legacy Systems: Many infrastructures rely on legacy OT systems that were not
designed with cyber security in mind. These outdated systems are often
D

poorly patched and lack basic security features, making them easy targets for
attackers.
Temporal Evaluation of a sensor network
Data analytics in Cyber Physical Systems
(CPS)?
Sensor networking and its applications can be adapted to
Industrial Control Systems (ICS) and computer networks for
supporting cyber security
Let us consider a part of the sensor network where the nodes
A, B, C, D
We evaluate the nodes based on connections or number of
links in (a)
Here nodes A and C can be considered important as they are
connecting hubs, where a large number of edges are incident,
indicating a high degree of communication
Similarly node B can be considered important because it is a
connector between the two hubs
Data analytics in Cyber Physical Systems
(CPS)?
However, in (b), we see a different scenario, where it is difficult
to determine the clear importance based on links
Thus, simply using metrics such as centrality, focusing on the
number of edges incident on a node, may not be the best or
even feasible
Evaluating the critical nodes based on their behavior over time
is much more useful here
This behavior can be captured in terms of relationship based on
edges between nodes, which could represent data transfers
between nodes, such as in a computer network or an ICS
network. For example, in (c), the data transfers at time t1, t2, t3
for the network in (b) are shown
Data analytics in Cyber Physical Systems
(CPS)?

These can now be used for link based patterns, or node based
patterns to produce a ranking of links and nodes by importance, in
terms of how many times they appear in the temporal windows as
shown in (d)
These can be mined in an association rule based method
Certain time periods may be more important and thus, may be given
more weight, which may be mined using quantitative association rule
based methods
Internet of Things(IoT)
Internet-Of-things (IoT) is interchangeably used and considered as
complimentary to CPS in some studies

Other studies have distinguished IoT as a class of CPS or seen it as

intersecting with CPS

IoT has indeed become a big share of the CPS space due to the
explosion of the number of devices and advancements of smart
devices and sensors

Even though estimates vary, they are projected to grow anywhere

from 18 billion to 50 billion devices worldwide by 2020

This is a very large number of devices by any estimate and creates

more security challenges given the highly unregulated and non-
standard market for IoT devices
Internet of Things(IoT)
One such space where smart devices have created this interesting
intersection between cyber physical and Internet of Things is a
smart car.

We are making our vehicles smart, fully connected with Internet to

view real time traffic and weather, talk on the phone with
Bluetooth, listen to the radio, watch video as well as getting real
time status of automobile’s mechanical functions

However, this smart interface comes with a price, which is the

vulnerability to threats as well as malfunctions to mechanical parts
of the vehicle. Other areas where IoT has taken a big role is in
home security systems with fire alarms, CO monitors, door and
garage sensors alarms, temperature control sensors
Car and Home sensors
IoT

Let us consider a smart home with a series of devices

measuring various phenomena around them

In such a setting, depicted, several smart devices collect the

information from a location with different levels of precision,
collecting different data streams, perhaps using different
standards

We may want to evaluate behavioral aspects such as: How are

we using our devices? Are there behavioral trends?

We may also want to evaluate aspects of something anomalous

in this complex space such as: Are there deviations from these
trends? How do we discover threats and attacks?
IoT

The data collected here is often of different modalities including

spatial, temporal, image and text data

The data can also be sliced in a multi-dimensional view over

time. The attack space is simply unmanageable

Let us consider an example of a recent case study evaluating an

incident at a university had close to 5000 systems making DNS
lookups every 15 minutes

In our current connected environments edge devices, vending

machines, environmental systems, alarm systems, light bulbs
and every other connected device on a university campus can
lead to a massive attack space - This truly becomes a needle in
the haystack problem
IoT

It lies both with the device maker and the user

This still creates difficult scenarios where IoT use is occurring in a

public or shared space

This landscape is one of the frontiers of cybersecurity and developing

of novel data analytics solutions for such a space
Multi Domain Mining

Data in real world is generated by multiple sources and is often

heterogeneous in terms of the types of attributes in each dataset
To be preemptive and provide actionable insights data from multiple
sources need to be analyzed.
Such type of mining is referred to as multi-domain mining, where
domain refers to distinct sources of data and these distinct sources
may be completely disparately generated
A Couple of examples are discussed to highlight the challenges and
potential solutions to analyzing disparate data sources to provide
actionable knowledge for events
Multi Domain Mining: Integrating multiple
heterogeneous data
In a computer network there are various mechanisms to allow for
analyzing the network traffic data

There may be scenarios where we want to expand the decision criteria

especially when we may not have access to any traffic data, such as
payload, but only header information

In such a scenarios we can augment the header information with other

types of data

One such view point is that of a geospatial data which can enhance the
knowledge of the IP session or even the IP reputation score itself
Multi Domain Mining: Integrating
multiple heterogeneous data
Current reputation systems pursue classification into a white and black
list, i.e., binary categorization
Separate lists for URLs and IP addresses are maintained
Some tools that provide
include Cisco SenderBase (https://www.senderbase.org/), VirusTotal IP
reputation (https://www.virustotal.com/) and Spam and Open Relay
Blocking System (SORBS) (http://www.sorbs.net/)
Most of these tools and lists are based on single dimensional features
with no correlation among them
Such shortcoming degrades a system’s effectiveness for detecting
sophisticated attacks and terminating malicious activities
Multi Domain Mining: Integrating
multiple heterogeneous data
However, the set of attributes that the reputation scoring considers
can be enriched, providing an expressive scoring system that enables
an administrator to understand what is at stake, and increasing
robustness by correlating the various pieces of information while
factoring in the trustworthiness of their sources
Multi Domain Mining: Integrating multiple
heterogeneous data-IP Reputation scoring
IP reputation scoring model can be enriched using network session
features and geo-contextual features such that the incoming session IP
is labeled based on most similar IP addresses, both in terms of
network features and geo-contextual features

This can provide better threat assessment by considering not only the
network features but also additional context, such as the geospatial
context information collected from external sources

Indeed in some countries, networks may encounter or even host large

quantities of attacks as compared to others
Multi Domain Mining: Integrating multiple
heterogeneous data-IP Reputation scoring
This may be due to shortage of cyber security expertise, level of
development, the abundance of resources, corruption levels, or the
computing culture in these countries. Identifying these factors and
quantifying them can provide insights into security policies and have a
positive impact on the attack incidents

These scenarios not only impact the countries facing such cyber
security crises but also impact other countries and end users due to
the level of connectivity in today’s day and age

Studies have also identified regions across the world which are prone
to hosting certain types of attacks
Multi Domain Mining: Integrating multiple
heterogeneous data-IP Reputation scoring
For example, studies have indicated that Trojans, worms and viruses
are most prevalent in Sub-Saharan Africa ,some families of malware
preferentially target Europe and US Yet other studies have explained
broad categories of worldwide systemic risks and country-specific risks
where country specific risks include aspects of economy, technology,
industry and international cooperation in enforcement of laws and
policies
Multi Domain Mining: Integrating multiple
heterogeneous data
Geospatial data not only provides additional context but provides
a framework to accommodate additional geo-political information
which often plays a big role in hactivism or politically inspired
attacks. The figure provides a set of rich sources to access
geospatial data for countries and in some cases even at a granular
level of cities

Some of these sources such as Ip2location provide a way to

identify a user location based on IP address in a non-intrusive
manner. Several other data sources such as World bank open data
(https://data.worldbank.org/ ), PIOR-GRID (http://grid.prio.org/#/
), ACLED (https://www.acleddata.com/ ) data provide socio-
political and geo political conflict data
Multi Domain Mining: Integrating multiple
heterogeneous data
Such data can be used to create Geospatial characterization of
regions (for example using methods proposed by Janeja et. al.
2010)

When an IP address is encountered it can be geolocated using the

IP location databases such as Ip2location or Maxmind Based on
its geolocation the location score from the characterization can be
attributed to it

The geospatial attributes for this region can be appended to the

network attributes for this IP (Sainani 2018)
Multi Domain Mining: Integrating multiple
heterogeneous data

Any additional security intelligence can be appended to provide an

aggregate reputation score to this IP

The data heterogeneity in terms of types of attributes, namely

categorical vs. continuous can be addressed using methods which
are capable of handling mixed attribute datasets (such as Misal
2016)
Multi Domain Mining: Integrating multiple
heterogeneous data
Integrated alerts from multiple sources

IComputer networks are increasingly facing the threat of

unauthorized access

Other networks such as sensor networks, industrial control

systems also face similar threats

Intrusion detection aims at identifying such threats using

signatures of unauthorized access or attacks

There are very few systems which address the issue of ‘zero
day’ attacks where the attack signature is not known before-
hand
Integrated alerts from multiple sources

Let us consider a scenario where the threat is two pronged first

there is an attack on the organization and second there is an
attack on a partner which shares key resources

In the first part of the attack intruders take advantage of

vulnerabilities in public-facing web servers

In addition hackers secretively scout the network from

compromised workstations which have already been targeted
beforehand as part of a coordinated prolonged attack

The second part of the attack starts with spear-phishing

Integrated alerts from multiple sources

groups of people with something in common such as common

employer, similar banking or financial institution, same college,
etc. The e-mails are deceptive since they appear to be from
organizations from which victims are expecting emails

Potentially a second group of hackers institutes a spear-

phishing attack on the organization’s major business partners,
with which it shares network resources

The hackers are able to obtain a privileged account and

compromise a root domain controller that is shared by the
organization and its partner when the intruders try to recreate
and assign privileges, it triggers an alarm
Deep Learning
Deep learning is a type of machine learning that learns the
features through multiple layers of abstractions

For example, if the task is learning to recognize a picture of an

individual, the deep learning model may start with various levels
of abstractions, starting with the most basic pixels in an image, to
an abstraction of an outline of the nose to the outline of the facial
structure

Deep learning algorithms compute the representations of one

layer by tuning the parameters from the previous layers (Le Cun
2015)
Deep Learning
As the amount of data increases performance of most machine
learning algorithms plateaus. However, performance of deep
learning algorithms increases as the amount of input data
increases. An example deep learning model is shown in figure
where the input is translated into several features or
representations in layer 1

Some of these representations can be dropped in subsequent

layers, throughout the layers the representations are weighted
based on the reduction in a loss function until the model
converges to the output which is the prediction task
Deep Learning
Deep learning has found a major application in computer vision
where images can be labelled based on their most basic of
features and abstracting to the higher level composition of the
images

Deep learning has also found applications in anomaly detection

Deep Learning emulates how human brain learns through
connections of neurons

The most fundamental level of learning comes from neural

networks which were in vogue in the early 1960’s and have now
had a renewed interest due to the deep learning algorithms
Deep Learning
The difference is now we have the availability of massive amounts
of data and computing capacity which has resulted in stronger
models and learning algorithms
Deep Learning: Challenges
Deep learning models have several hyper parameters that need to
be predetermined and tuned including number of layers, number
of nodes in each layer, network weights, and dropouts in each
layer.
Some of these factors are also interdependent and can also
impact the learning rate of the model
If the input data is not large the model cannot be trained well
The true strength of deep learning is possible in massive datasets
and requires heavy parameter tuning
Deep Learning: Challenges
The challenge also comes in with explain ability, of how these
parameters are impacting the outputs and the interpretation of
the final outcomes
Generative Adversarial Networks (GAN)

GAN let’s break it into separate three parts

Generative – To learn a generative model, which describes how
data is generated in terms of a probabilistic model. In simple
words, it explains how data is generated visually.
Adversarial – The training of the model is done in an adversarial
setting.
Networks – use deep neural networks for training purposes.
h as phishing and social engineering.
Generative Adversarial Networks (GAN)

GAN Techniques have shown impressive results in various

domains like image synthesis, text generation, and video
generation, enhancing the field of generative modeling and
enabling new creative applications in artificial intelligence.
Generator Network: Takes random input to generate samples
resembling training data.
Discriminator Network: Distinguishes between real and
generated samples.
Adversarial Training: Generator tries to fool the discriminator,
and the discriminator improves its distinguishing skills.
Generative Adversarial Networks (GAN)

Progression: Generator produces more realistic samples;

discriminator becomes better at identifying them.
Applications: Image synthesis, text generation, video
generation, creating deepfakes, enhancing low-resolution
images.
h as phishing and social engineering.
Types of GANs
DC GAN: It is a Deep convolutional GAN Techniques. It is one of the
most used, powerful, and successful types of GAN architecture. It is
implemented with help of ConvNets in place of a Multi-layered
perceptron. The ConvNets use a convolutional stride and are built
without max pooling and layers in this network are not completely
connected.
Conditional GAN and Unconditional GAN (CGAN): Conditional GAN
is deep learning neural network in which some additional parameters
are used. Labels are also put in inputs of Discriminator in order to
help the discriminator to classify the input correctly and not easily full
by the generator.h as phishing and social engineering.
Types of GANs
Least Square GAN(LSGAN): It is a type of GAN Techniques that adopts
the least-square loss function for the discriminator. Minimizing the
objective function of LSGAN results in minimizing the Pearson
divergence.
Auxilary Classifier GAN(ACGAN): It is the same as CGAN and an
advanced version of it. It says that the Discriminator should not only
classify the image as real or fake but should also provide the source
or class label of the input image.p
Dual Video Discriminator GAN: DVD-GAN is a generative adversarial
network model for video generation built upon the BigGAN
architecture. DVD-GAN uses two discriminators: a Spatial
Discriminator and a Temporal Discriminator.hishind social
engineering.
Types of GANs
SRGAN: Its main function is to transform low resolution to high
resolution known as Domain Transformation.
Cycle GAN: It is released in 2017 which performs the task of Image
Translation. Suppose we have trained it on a horse image dataset and
we can translate it into zebra images.
Info GAN: Advance version of GAN which is capable to learn to
disentangle representation in an unsupervised learning
approach.ishind social engineering.
Training & Prediction of Generative
Adversarial Networks (GANs)
Step 1: Define a Problem
The problem statement is key to the success of the project so
the first step is to define your problem. GANs work with a
different set of problems you are aiming so you need to define
What you are creating like audio, poem, text, Image is a type of
problem.
Step 2: Select Architecture of GAN
There are many different types of GAN Architecture, that we
will study further. we have to define which type of GAN
architecture we are using.

.
Training & Prediction of Generative
Adversarial Networks (GANs)
Step 3: Train Discriminator on Real Dataset
Now Discriminator is trained on a real dataset. It is only having a
forward path, no back propagation is there in the training of the
Discriminator in n epochs. And the Data you are providing is without
Noise and only contains real images, and for fake images,
Discriminator uses instances created by the generator as negative
output. Now, what happens at the time of discriminator training.
 It classifies both real and fake data.
 The discriminator loss helps improve its performance and penalize
it when it misclassifies real as fake or vice-versa.
 weights of the discriminator are updated through discriminator
loss.
Training & Prediction of Generative
Adversarial Networks (GANs)
Step 4: Train Generator
Provide some Fake inputs for the generator(Noise) and It will use
some random noise and generate some fake outputs. when Generator
is trained, Discriminator is Idle and when Discriminator is trained,
Generator is Idle. During generator training through any random noise
as input, it tries to transform it into meaningful data. to get
meaningful output from the generator takes time and runs under
many epochs. steps to train a generator are listed below
• get random noise and produce a generator output on noise sample
• predict generator output from discriminator as original or fake.
• we calculate discriminator loss.
•
Training & Prediction of Generative
Adversarial Networks (GANs)
• perform back propagation through discriminator, and generator
both to calculate gradients.
• Use gradients to update generator weights.
Step 5: Train Discriminator on Fake Data
The samples which are generated by Generator will pass to
Discriminator and It will predict the data passed to it is Fake or real
and provide feedback to Generator again.
Training & Prediction of Generative
Adversarial Networks (GANs)
6.Train Generator with the output of Discriminator
Again Generator will be trained on the feedback given by
Discriminator and try to improve performance.
This is an iterative process and continues running until the Generator
is not successful in making the discriminator fool.
Training & Prediction of Generative
Adversarial Networks (GANs)
Generative Adversarial Networks (GAN)

.
Generative Adversarial Networks (GAN)

.We can see that the two components are pitted against each
other. Here an input (combination of noise and random images)
is provided to the generator, which generates samples
On the other hand the discriminator which is trained on the real
world images examines these generated samples
These samples are labelled as real or fake. In addition, the
discriminator also learns from the loss of labelling the samples
and corrects the weights in the discriminator
Generative Adversarial Networks (GAN)

.
Ethical Thinking in the Data Analytics Process

eEthics in data analytics involves more than just following legal guidelines; it
encompasses a commitment to fairness, transparency, accountability, and
respect for the privacy and rights of individuals. Without a strong ethical
framework, data analytics can lead to biased outcomes, discrimination, and a
loss of trust among stakeholders.se techniques enhance the overall security
posture by complementing traditional security measures and providing a
proactive approach to threat detection and mitigation
Best Practices for Ethical Data Use

1. Establish a Data Governance Framework

Develop a comprehensive data governance framework that outlines policies
and procedures for data collection, storage, analysis, and sharing. This
framework should include:
Data Ownership: Clearly define who owns the data and who is responsible
for ensuring its ethical use.
Data Stewardship: Assign data stewards to oversee the ethical management
of data within specific domains.
Compliance with Regulations: Ensure that your data practices comply with
relevant laws and regulations, such as GDPR or CCPA.
enhance the overall security posture by complementing traditional security
measures and providing a proactive approach to threat detection and
Best Practices for Ethical Data Use

2.Ensure Transparency and Accountability

Transparency is essential for building trust with stakeholders. Ensure that data
collection and usage practices are transparent, and that accountability
mechanisms are in place. This includes:
Data Transparency: Clearly communicate how data will be used, who will
have access to it, and for what purpose.
Accountability Structures: Establish roles and responsibilities to ensure
individuals are accountable for ethical data use.
Regular Audits: Conduct regular audits to assess compliance with ethical
standards and identify areas for improvement.

enhance the overall security posture by complementing traditional security

measures and providing a proactive approach to threat detection and
Best Practices for Ethical Data Use

3. Implement Data Minimization Principles

Data minimization is a key principle in ethical data use, focusing on collecting
only the data necessary for the intended purpose. This can be achieved by:
Purpose Limitation: Collect data only for specific, legitimate purposes and
avoid over-collection.
Data Retention Policies: Establish policies for how long data will be kept and
ensure it is deleted when no longer needed.
Anonymization and Pseudonymization: Where possible, anonymize or
pseudonymize data to protect individual identities.

enhance the overall security posture by complementing traditional security

measures and providing a proactive approach to threat detection and
Best Practices for Ethical Data Use

4. Address Bias and Fairness

Bias in data analytics can lead to unfair outcomes and discrimination. To
promote fairness, consider the following practices:
Bias Detection Tools: Utilize tools and techniques to detect and mitigate
biases in your data and algorithms.
Diverse Data Sources: Ensure that the data used in analytics is representative
of the population to avoid skewed results.
Inclusive Decision-Making: Involve diverse stakeholders in the decision-
making process to gain different perspectives

enhance the overall security posture by complementing traditional security

measures and providing a proactive approach to threat detection and
mitigation
Best Practices for Ethical Data Use

5. Educate and Train Employees

Ensure that all employees involved in data handling are aware of ethical
standards and best practices. This can be achieved through:
Regular Training: Provide regular training sessions on ethical data use, privacy
regulations, and best practices.
Ethical Guidelines: Develop and distribute clear ethical guidelines that
employees can refer to when handling data.
Ethics Committees: Establish ethics committees to provide oversight and
guidance on complex ethical issues.

enhance the overall security posture by complementing traditional security

measures and providing a proactive approach to threat detection and
Best Practices for Ethical Data Use

6. Foster a Culture of Ethics

Promote a culture of ethics within your organization where ethical
considerations are a core part of decision-making processes. Strategies to
achieve this include:
Leadership Commitment: Ensure that leadership is committed to ethical
practices and sets a strong example for others.
Ethical Decision-Making: Incorporate ethical considerations into all data-
related decisions, from the top down.
Open Communication: Encourage open dialogue about ethical issues and
provide channels for reporting unethical practices

enhance the overall security posture by complementing traditional security

Case Studies: Ethical Dilemmas in Data
Analytics
1. The Target Pregnancy Prediction Controversy
In 2012, Target made headlines when it was revealed that their data analytics
team could predict when a customer was pregnant based on purchasing
patterns. The company used this information to send personalized
advertisements, which led to a highly publicized incident where a father
discovered his teenage daughter’s pregnancy through these ads. This case
raises significant ethical questions about data privacy, consent, and the
unintended consequences of predictive analytics.
Ethical Issues: Invasion of privacy, lack of informed consent, potential
emotional harm.
.
Case Studies: Ethical Dilemmas in Data
Analytics
Outcome: Target revised its data practices, including how it uses predictive
analytics and communicates with customers.
Lessons Learned: Organizations must consider the potential impact of using
predictive analytics on individuals’ privacy and ensure that data is used
responsibly and transparently.
Case Studies: Ethical Dilemmas in Data
Analytics
2. Cambridge Analytica and the 2016 U.S. Election
One of the most infamous cases in recent history, the Cambridge Analytica
scandal involved the misuse of data from millions of Facebook users to
influence voter behavior during the 2016 U.S. presidential election. The
company exploited data obtained without proper user consent to create
psychographic profiles and target individuals with personalized political ads.
This case highlights severe ethical breaches in data analytics, including data
exploitation, lack of transparency, and the manipulation of public opinion.
Ethical Issues: Data misuse, lack of informed consent, manipulation of
personal information, impact on democracy.
Case Studies: Ethical Dilemmas in Data
Analytics
Outcome: The scandal led to widespread criticism, multiple investigations,
significant financial penalties for Facebook, and the eventual closure of
Cambridge Analytica.
Lessons Learned: The importance of transparency in data collection and use,
the necessity for robust data protection laws, and the ethical responsibility of
companies to avoid manipulating personal data for deceptive purposes.
Thank you!

Staad Questions PDF
No ratings yet
Staad Questions PDF
8 pages
Machine Learning For Cyber Physical System by Janmenjoy Nayak
100% (1)
Machine Learning For Cyber Physical System by Janmenjoy Nayak
412 pages
Cover
No ratings yet
Cover
191 pages
Previewpdf
No ratings yet
Previewpdf
43 pages
Classification of Business Environment
83% (6)
Classification of Business Environment
12 pages
Cyber Physical System
100% (1)
Cyber Physical System
40 pages
World Religions Week 3
100% (1)
World Religions Week 3
24 pages
Cyber-Physical Systems Security - Limitations, Issues and Future Trends
No ratings yet
Cyber-Physical Systems Security - Limitations, Issues and Future Trends
10 pages
Software-Defined Cyber-Physical Multinetworks: Zhijing Qin, Ngoc Do, Grit Denker, Nalini Venkatasubramanian
No ratings yet
Software-Defined Cyber-Physical Multinetworks: Zhijing Qin, Ngoc Do, Grit Denker, Nalini Venkatasubramanian
5 pages
Introduction of CYber Physical System
No ratings yet
Introduction of CYber Physical System
4 pages
ZT105A (MT) 非公路自卸车技术规格书（中英） 20230818
No ratings yet
ZT105A (MT) 非公路自卸车技术规格书（中英） 20230818
15 pages
Cyber-Physical Systems: Integrating Computing With Physical Processes (WWW - Kiu.ac - Ug
No ratings yet
Cyber-Physical Systems: Integrating Computing With Physical Processes (WWW - Kiu.ac - Ug
4 pages
Cyber-Physical Systems - A Confluence of Cutting Edge Technological Streams PDF
No ratings yet
Cyber-Physical Systems - A Confluence of Cutting Edge Technological Streams PDF
4 pages
Cyber-Physical Systems
No ratings yet
Cyber-Physical Systems
3 pages
Seminar Report
No ratings yet
Seminar Report
38 pages
Unit 4 &5
No ratings yet
Unit 4 &5
20 pages
UNIT 2 IoT and Cyber-Physical Systems
No ratings yet
UNIT 2 IoT and Cyber-Physical Systems
43 pages
Cyber Physical Systems: Theory and Its Applications
No ratings yet
Cyber Physical Systems: Theory and Its Applications
5 pages
Foundations of Cyber Physical Systems: Prof. Soumyajit Dey
No ratings yet
Foundations of Cyber Physical Systems: Prof. Soumyajit Dey
946 pages
Cs Notes... Module 3,4and 5
No ratings yet
Cs Notes... Module 3,4and 5
53 pages
Cyber Physical System 20 - Amitkumar Vidyakant Jha
No ratings yet
Cyber Physical System 20 - Amitkumar Vidyakant Jha
465 pages
Industry Standard 5.0 - Unit 3
No ratings yet
Industry Standard 5.0 - Unit 3
18 pages
AUJ Cyber Physical System Security Challenges in Internet of Things System
No ratings yet
AUJ Cyber Physical System Security Challenges in Internet of Things System
6 pages
Chapter Seven
No ratings yet
Chapter Seven
61 pages
L3 CPS Jan10 2024
No ratings yet
L3 CPS Jan10 2024
2 pages
CPS 1 1
No ratings yet
CPS 1 1
14 pages
Lecture 6 - Core Technology - Cyber Physical Systems - Dr. Deepak Kumar Rout
No ratings yet
Lecture 6 - Core Technology - Cyber Physical Systems - Dr. Deepak Kumar Rout
18 pages
A Detailed Study On Security Breaches of Digital Forensics in Cyber
No ratings yet
A Detailed Study On Security Breaches of Digital Forensics in Cyber
6 pages
Surya Seminar
No ratings yet
Surya Seminar
11 pages
Cyber Physical Systems: Trường Đại Học Công Nghệ Thông Tin
No ratings yet
Cyber Physical Systems: Trường Đại Học Công Nghệ Thông Tin
60 pages
Review On Cyber-Physical Systems (2017)
No ratings yet
Review On Cyber-Physical Systems (2017)
14 pages
WR6 Patel 3200028
No ratings yet
WR6 Patel 3200028
14 pages
SCBA Pre-Use Inspection
No ratings yet
SCBA Pre-Use Inspection
2 pages
CPSD Lecture I
No ratings yet
CPSD Lecture I
17 pages
Journal Tiis 8-12 TIISVol8No12-1
No ratings yet
Journal Tiis 8-12 TIISVol8No12-1
27 pages
ICCSPA2024
No ratings yet
ICCSPA2024
6 pages
Introduction of CPS
No ratings yet
Introduction of CPS
21 pages
Cyber Physical Systems: By: Rajnish Singh Thakur 6 Sem IOT
No ratings yet
Cyber Physical Systems: By: Rajnish Singh Thakur 6 Sem IOT
6 pages
CPS Unit 1 Notes
No ratings yet
CPS Unit 1 Notes
28 pages
Cyber Physical Systems
No ratings yet
Cyber Physical Systems
40 pages
Report On Cyber Physical System For Smart City Application
No ratings yet
Report On Cyber Physical System For Smart City Application
4 pages
Chapter 1. Fundamentals of Cyber-Physical Systems
No ratings yet
Chapter 1. Fundamentals of Cyber-Physical Systems
14 pages
L1
No ratings yet
L1
24 pages
Industry 4.0 Module 2 Part A
No ratings yet
Industry 4.0 Module 2 Part A
31 pages
Research Paper
No ratings yet
Research Paper
45 pages
A Survey of Cyber Attacks On Cyber Physical Systems Recent Advances and Challenges
No ratings yet
A Survey of Cyber Attacks On Cyber Physical Systems Recent Advances and Challenges
17 pages
Microprocessors and Microsystems
No ratings yet
Microprocessors and Microsystems
33 pages
Unit I-1
No ratings yet
Unit I-1
50 pages
Institute of Aeronotical Engineering: Presented by Sahana Susheela Assistant Professor Cyber Security Department
No ratings yet
Institute of Aeronotical Engineering: Presented by Sahana Susheela Assistant Professor Cyber Security Department
10 pages
Cyber-Physical Systems Challenges and Future Direc
No ratings yet
Cyber-Physical Systems Challenges and Future Direc
7 pages
Module 5.1
No ratings yet
Module 5.1
8 pages
CPS Survey Ksii 11 03 20141
No ratings yet
CPS Survey Ksii 11 03 20141
27 pages
Cyber-Physical Systems KA - Draft For Review January 2019
No ratings yet
Cyber-Physical Systems KA - Draft For Review January 2019
32 pages
Secure and Smart Cyber-Physical Systems by Uttam Ghosh
No ratings yet
Secure and Smart Cyber-Physical Systems by Uttam Ghosh
260 pages
Unit 01
No ratings yet
Unit 01
29 pages
CPS Unit-1
No ratings yet
CPS Unit-1
55 pages
Cyber Physical Systems CPS
No ratings yet
Cyber Physical Systems CPS
1 page
Cyber Physical System Concept and Applications
No ratings yet
Cyber Physical System Concept and Applications
20 pages
Reference Architecture and Maturity Levels For Cyber-Physical Systems in The Mechanical Engineering Industry PDF
No ratings yet
Reference Architecture and Maturity Levels For Cyber-Physical Systems in The Mechanical Engineering Industry PDF
6 pages
Cyber
No ratings yet
Cyber
4 pages
Axioms of Cyber Physical Systems
No ratings yet
Axioms of Cyber Physical Systems
51 pages
MidTerm Study Material CPS
No ratings yet
MidTerm Study Material CPS
174 pages
ISU Transaction Codes and Table Names - SAP Community
No ratings yet
ISU Transaction Codes and Table Names - SAP Community
8 pages
Autopage C3-RS665 PDF
No ratings yet
Autopage C3-RS665 PDF
34 pages
MN67672 Eng
No ratings yet
MN67672 Eng
22 pages
2
No ratings yet
2
29 pages
Basic Switch Configuration in Cisco
No ratings yet
Basic Switch Configuration in Cisco
3 pages
Learning Area Learners With Special Educational Needs (LSEN) Learning Delivery Modality Modular Distance Learning Modality
No ratings yet
Learning Area Learners With Special Educational Needs (LSEN) Learning Delivery Modality Modular Distance Learning Modality
5 pages
Chapter-1 Data Analysis For Cyber Security
No ratings yet
Chapter-1 Data Analysis For Cyber Security
30 pages
TOCFL 基礎級 A2
No ratings yet
TOCFL 基礎級 A2
11 pages
TH 2
No ratings yet
TH 2
4 pages
A Simple Penetration Testing Practice
No ratings yet
A Simple Penetration Testing Practice
10 pages
Teen Smart Prep 2 2020
No ratings yet
Teen Smart Prep 2 2020
151 pages
Motion 1 QP
No ratings yet
Motion 1 QP
15 pages
Onion - Wikipedia, The Free Encyclopedia1
No ratings yet
Onion - Wikipedia, The Free Encyclopedia1
7 pages
BBMF2083 - Chap 2 - 6.22
No ratings yet
BBMF2083 - Chap 2 - 6.22
40 pages
Resumen Productos Datalogic SENSORES
No ratings yet
Resumen Productos Datalogic SENSORES
219 pages
Risk Ranger
No ratings yet
Risk Ranger
31 pages
Aapl 10k2013
No ratings yet
Aapl 10k2013
91 pages
Chapter Five - Web App Sec
No ratings yet
Chapter Five - Web App Sec
64 pages
Fundamental of CS-Chapter Six
No ratings yet
Fundamental of CS-Chapter Six
37 pages
Nature 14432
No ratings yet
Nature 14432
17 pages
Anomaly Detection For Cyber Security
No ratings yet
Anomaly Detection For Cyber Security
31 pages
Metalsa Supplier Manual Rev 4 1
No ratings yet
Metalsa Supplier Manual Rev 4 1
58 pages
Human Resources, Job Design, and Work Measurement: Human Resource Strategy For Competitive Advantage
No ratings yet
Human Resources, Job Design, and Work Measurement: Human Resource Strategy For Competitive Advantage
3 pages
Chapter Seven
No ratings yet
Chapter Seven
66 pages
CHapter 11
No ratings yet
CHapter 11
32 pages
Fundamentals of CS - Chapter 3 - Part 1
No ratings yet
Fundamentals of CS - Chapter 3 - Part 1
35 pages
Fundamentals of CS - Chapter 3 - Part 2
No ratings yet
Fundamentals of CS - Chapter 3 - Part 2
73 pages
Fundamentals of CS - Chapter 1
No ratings yet
Fundamentals of CS - Chapter 1
86 pages
Curs 2-Formatare Conditionala
No ratings yet
Curs 2-Formatare Conditionala
12 pages
Traumatic Care DR - GOLDEN
No ratings yet
Traumatic Care DR - GOLDEN
34 pages
SonarQube Users (Archive) - Java - lang.OutOfMemoryError - Java Heap Space PDF
No ratings yet
SonarQube Users (Archive) - Java - lang.OutOfMemoryError - Java Heap Space PDF
9 pages
Amisha Reflective Report
No ratings yet
Amisha Reflective Report
8 pages
ASIC Implementation of Efficient 16-Parallel Fast FIR Algorithm Filter Structure
No ratings yet
ASIC Implementation of Efficient 16-Parallel Fast FIR Algorithm Filter Structure
5 pages
Bio Paper 5 PDF
No ratings yet
Bio Paper 5 PDF
8 pages
Porsche Case Study
No ratings yet
Porsche Case Study
4 pages
Day 4 Plastic Pollution Ielts Nguyenhuyen
No ratings yet
Day 4 Plastic Pollution Ielts Nguyenhuyen
1 page
IoT Maintenance: Predictive Techniques for Smart Equipment
From Everand
IoT Maintenance: Predictive Techniques for Smart Equipment
Alex C. Techworth
No ratings yet
Digital Technologies – an Overview of Concepts, Tools and Techniques Associated with it
From Everand
Digital Technologies – an Overview of Concepts, Tools and Techniques Associated with it
Editor IJSMI
No ratings yet
The Internet of Things (IoT) in Industrial Automation: Industrial Automation, #4
From Everand
The Internet of Things (IoT) in Industrial Automation: Industrial Automation, #4
The Digital Allchemist
No ratings yet