Unidad 2
Unidad 2
Artificial
BASADA EN DATOS
Introduction
Data is the lifeblood of AI. The more data these
algorithms consume, the more proficient
they become at understanding, predicting and
delivering personalized solutions. But
beneath this promise lies a complex world of
challenges, from ethical concerns to data
quality and privacy issues.
Data science
fundamentals
Data science combines mathematics and
statistics, specialized programming,
advanced
analytics, artificial intelligence (AI) and
machine learning with subject-specific
knowledge
to uncover actionable insights hidden in
an organization's data. These insights can
be used
to guide decision making and strategic
planning
Data
engineering.
Data engineering is a key discipline
in the field of data science, as it is
responsible for
designing, building, and maintaining
the infrastructure and tools needed
to collect, store, and process data.
This includes tasks such as data
warehousing, data integration and
data
quality management.
Data mining
Data mining is the use of machine learning and statistical analysis to discover patterns and
other valuable information from large data sets.
Data mining techniques that underpin data analytics can be deployed for two main
purposes. They can describe the target dataset or predict results using machine learning
algorithms.
Machine learning
Machine learning (ML) is the subsection of artificial intelligence (AI) that focuses on
developing systems that learn, or improve performance, based on the data they consume.
Artificial intelligence is a broad term that refers to systems or machines that mimic human
intelligence. Machine learning and AI are often named together, and the terms are
sometimes used interchangeably, but they do not mean the same thing. An important point
to note is that while all machine learning is AI, not all AI is machine learning.
Supervised learning
Supervised machine learning algorithms are the most commonly used. With this model, a
data scientist acts as a guide and teaches the algorithm what conclusions to draw. Like a
child learning to identify fruits by memorizing them with a picture book, in supervised
learning, the algorithm is trained by a data set that is already labeled and has a
predefined
outcome.
Examples of supervised machine learning include algorithms such as linear and logistic
regression, multiclass classification, and support vector machines
Unsupervised learning
Unsupervised machine learning uses a more independent approach, in which a
computer
learns to identify complex processes and patterns without close and constant
guidance from
a human being. Unsupervised machine learning involves training based on data that has
neither labels nor a specific defined outcome.
Examples of unsupervised machine learning algorithms include k-means clustering,
principal and independent component analysis, and association laws
Reinforcement Learning