0% found this document useful (0 votes)
7 views4 pages

Lect 01 DS Intro

Uploaded by

danieltshuma64
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views4 pages

Lect 01 DS Intro

Uploaded by

danieltshuma64
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

App of Python for DS & AI Lecture 1

Lecture 1
Application of Python language for Data Science and Artificial Intelligence

• Data Science
• Artificial Intelligence
• Python language

What Is Data Science (DS)?


Data science is a relatively new knowledge domain.
Data Science is a combination of multiple disciplines:
• statistics, statistical modeling, visualization
• linear algebra,
• machine learning,
• data storage, data retrieval,
• data analysis, business intelligence
Data Science is used to analyze data and to extract knowledge and insights from it.

Data Science is about:


• data gathering,
• analysis,
• decision-making.
Data Science is about:
• finding patterns in data,
• make future predictions.
By using Data Science, we are able to make:
• better decisions,
• predictive analysis,
• pattern discoveries.

A data scientist requires expertise in several backgrounds:


• Machine Learning,
• Statistics,
• Programming (Python, R, Matlab, Julia),
• Mathematics,
• Databases.

Data Science Venn Diagram


(Drew Conway 2010)

“Mathematics is not for wimps, it is a sharp tool with which it is easy to cut yourself” (S.Banach,
Polish mathematician , co-creator of functional analysis).
“Data Science is not for wimps, it is a sharp tool with which it is easy to cut yourself”

Data is a commodity:
• traded electronically,
o to be in this market you need to be “hacker”
▪ at least: text file manipulation, vectorized operations, algorithmic thinking,
• processed mathematically and statistically,
o to be in this market you need to know mathematics and statistics
▪ at least baseline familiarity with math and stat
• (for example square root regression)

Piotr Zaremba Summer 2025 1/4 Lect_01_DS_intro.docx


App of Python for DS & AI Lecture 1

• requires expert knowledge in selected fields (those to which the data relates)
o to be on this market you need to know what questions and hypotheses to ask for testing, you
need to understand the test results.

In above Venn Diagram the common parts are:


• Machine Learning,
• Traditional Research,
• Danger Zone,
• Data Science.

Data science it is fundamentally an interdisciplinary subject.


Comprises three distinct and overlapping skill areas:
• of a statistician who knows how to model and summarize datasets,
• of a computer scientist who can design, use algorithms, process, and visualize data,
• of the domain expert to formulate the right questions and to put their answers in context.

What is artificial intelligence (AI)?


In our considerations we will treat AI as a technology that enables computers and machines to simulate human
• learning,
• comprehension,
• problem solving,
• decision making,
• creativity,
• autonomy.

Technology that deals with:


• modeling
o knowledge,
o data,
• developing algorithm systems.

The goal of this technology is to obtain a relatively automated system for:


• acquiring,
• processing,
• analyzing data.

Such a system should be able to:


• autonomously improve itself,
• predict behavior,
• act on the basis of the analysis of collected data and correlations between them.

Applications and devices equipped with AI can:


• see and identify objects,
• understand and respond to human language,
• learn from new information and experience,
• make detailed recommendations to users and experts,
• act independently, replacing the need for human intelligence or intervention.

Currently, the attention of scientists and AI researchers is focused on issues related to generative AI (GenAI) a
subset of AI technology that can create original:
• text,
• images,
• video,
• other forms of data.
These models:
• learn the underlying patterns and structures from their training data,
• use them to produce new data based on the input

Piotr Zaremba Summer 2025 2/4 Lect_01_DS_intro.docx


App of Python for DS & AI Lecture 1

o input data may take the form of natural language prompts.

Generative AI tools are:


• machine learning (ML),
• deep learning.

Machine learning (ML)


ML is a subset of AI:
• that aims on creating software systems that can learn and improve their performance based on the data
they process,
• concerned with the development and study of statistical algorithms that can learn from data and
generalize to unseen data, and thus perform tasks without explicit instructions,

This means that every ML solution is an AI solution but not all AI solutions are ML solutions.

Neural network (machine learning)


In machine learning, a neural network (artificial neural network) is a model inspired by the structure and
function of biological neural networks in animal brains.

Deep learning
Subset of machine learning that focuses on utilizing neural networks to perform tasks such as:
• classification,
• regression,
• representation.
The adjective "deep" refers to the use of many layers in the network (hundreds, thousands of layers).

Timeline (IBM version)


• 1950’s – Artificial intelligence (AI)
o Human intelligence exhibited by machines
• 1980’s – Machine learning (ML)
o AI systems that learn from historical data
• 2010’s – Deep learning
o ML models that mimic human brain function
• 2020’s – Generative AI (GenAI)
o Deep learning that create original content

Concepts related to AI and especially GenAI do not have a single interpretation accepted by all.

Piotr Zaremba Summer 2025 3/4 Lect_01_DS_intro.docx


App of Python for DS & AI Lecture 1

Python
High-level, general-purpose, dynamic programming language with garbage collector.
Designed by Guido van Rossum, in the early 1990’s.
The design of the language emphasizes code readability.
An important element of the language's syntax is the indentation in the code created by a tab or a group of
spaces.

Support multiple programming paradigms:


• object,
• procedural,
• functional.
First release:
• Python – 1991
• Python 2.0 – 2000.
• Python 3.0 – 2008.

Python ranks as one of the most popular programming languages.


Widely used in Data Science and Machine Learning.
One of the advantages of this language is its very extensive standard library.

Piotr Zaremba Summer 2025 4/4 Lect_01_DS_intro.docx

You might also like