0% found this document useful (0 votes)

18 views59 pages

Datascience Day1

This document provides an overview of data science and machine learning. It discusses the history of the fields and key events. It then describes what Oracle AI is and how it uses cloud services to help organizations leverage all types of data through applications of data science, machine learning and AI. It outlines the services that Oracle provides to support these applications and how they integrate various data management, analytics and infrastructure capabilities.

Uploaded by

Chusheel srinivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views59 pages

Datascience Day1

Uploaded by

Chusheel srinivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

Introduction and Configuration

• Oracle Cloud Infrastructure

Data Science: Introduction

Data Science and Machine Learning (ML) in History Data Science and Machine Learning Today

1300s 1962 2021

Philosopher William Mathematician John Professor Anthony Klotz
Ockham tells scientists to Tukey predicts effects predicts and coins the
opt for less complex of electronic computing term
formulas and theories for on data analysis as an “The Great Resignation”
best empirical results. empirical science. of the U.S. workforce.

2008
1700s Today
Astronomer Tobias Mayer Data experts will learn how to build
makes quantitative argument Dr. DJ Patil and a machine learning
insisting that more data is Jeff Hammerbacher model
better; is considered first coin the term
unofficial data scientist. “data science.”

1952 1997
IBM pioneer Arthur Samuel Chess grandmaster
coins the term “machine Garry Kasparov is easily
learning.” defeated by an IBM
supercomputer in 19
moves.
In the 1300s, William Ockham, a philosopher and friar, believed that scientists should prefer simpler theories
over more complex ones. The principle that bears his name, known as “Ockham's razor” can be applied to
machine learning by looking for the simplest solution.

In the 1750s, astronomer Tobias Mayer, made a quantitative argument that more data is better. He was studying
the motions of the moon and collected nine times as many data points as necessary, claiming this made his
observations more accurate. Because of this, he is often considered the first true “data scientist.”

In 1952, Arthur Samuel, an IBM pioneer in computing, gaming, and AI, coined the term “machine learning.” He
designed a game for playing checkers and discovered that the more the computer played the game the more it
learned winning strategies from experience.

In 1962, mathematician John W. Tukey predicted the effect of modern-day electronic computing on data analysis
as an empirical science. However, Tukey’s predictions occurred decades before the explosion of big data and the
ability to perform complex and large-scale analyses.

In 1997, an IBM supercomputer called Deep Blue, defeated chess grandmaster Gary Kasparov in only 19 moves.
Kasparov resigned after this match. The highly advanced supercomputer could calculate as many as 100 billion to
200 billion positions in the three minutes traditionally allotted to a player per move in standard chess.

In 2008, Dr DJ Patil of LinkedIn and Jeff Hammerbacher of Facebook, coined the term “data science” to describe
an emerging field of study that focused on teasing out the hidden value in collected data from retail and business
sectors.
Organizations have many types of unique, and often unstructured, data from many
different sources: things like equipment sensors, mobile apps, social media, customer
interactions via voice and text, videos, images, documents, and more.

Organizations want to use all data to produce new insights and new data products. They
want to improve their business operations by creating better customer experiences,
anticipating service demand, and preventing avoidable equipment outages.

The next generation of business problems – or scenarios - means being able to use all
data, and we need the capabilities provided by data science, machine learning, and AI to
understand and use that data.
Importance of Data Science and AI

UNSTRUCTURED

Social data
SEMI-STRUCTURED

DB/APP DATA

Medical
forms
Phone call
Next-gen scripts
Increasing complexity

scenarios must be
Orders
able to use and The “all data” opportunity

understand DNA

all data. Invoices

Customer
records

Weather

Retention details

6
What is Oracle AI?
Unified cloud services for AI and machine learning (ML)

Applications

Oracle Digital Assistant OCI Language OCI Speech OCI Vision OCI Anomaly Detection OCI Forecasting

AI Services

Oracle Cloud Infrastructure Data Science Machine Learning in Oracle Database OCI Data Labeling

ML Services

Data
“Oracle AI” is this portfolio of cloud services for helping organizations take advantage of all data for the next
generation of scenarios.

The foundation of all of this is data. Obviously, AI and machine learning work on data and require data.

The top layer of this diagram is applications, and this loosely refers to all the ways AI is consumed. That could be an
application, a business process, or an analytics system.

Between the application and data layers you see two groups here, the AI services (on top) and the machine learning
services (on the bottom). The difference between the two groups is that machine learning services are used primarily
by data scientists to build, train, deploy, and manage machine learning models.

Data scientists can work with familiar open-source frameworks in OCI Data Science, and that’s the cloud service that’s
the focus of this course. Data scientists and database specialists can take advantage of machine learning algorithms
built-in to Oracle Database. An important service that supports both machine learning and AI services is OCI Data
Labeling. Because when you’re building machine learning models that work on images, text, or speech, you need
labeled data that can be used to train the models.

AI services contains prebuilt machine learning models for specific uses. Some of the AI services are pretrained, and
some are trained by the customer with their own data. All are used by simply calling the API for the service, passing in
data to be processed, and the service returns a result.
OCI Services That Support AI and ML
AI Services

Analytics Machine Learning Services

Oracle Analytics Cloud Graph Analytics

Integration Data Management

Streaming Data Integration Data Catalog GoldenGate & Big Data Data Object Autonomous
Oracle Data Integrator Service Flow Storage Database

Cloud Infrastructure

Compute Networking Storage Security Cloud Native

• Those AI and ML services don’t stand alone. They are supported by many other services
available in Oracle Cloud Infrastructure including:

• Business analytics and graph analytics, and many forms of data integration and data
management – all running on the basic cloud infrastructure.

• These services can be combined in various architectures to support many different

scenarios.
Model
Model Training Model
Deployment Management

JupyterLab Model
Notebook Explanation
What is Oracle Cloud Infrastructure Data Science?

Projects Open Source

Libraries
Oracle Cloud Infrastructure Data Science is a
cloud service focused on serving data scientists
throughout the full machine learning life cycle Model
Catalog
ADS,
AutoML
with support for Python and open-source
libraries.
Data Management
Database – Data Lake - Access – Integration - Preparation

Infrastructure
CPU – GPU – Storage - Network
Core Principles of Oracle Cloud Infrastructure Data Science
Accelerated
Collaborative
Enterprise-Grade

Accelerated

Allow data scientists to work how they want, and provide access to automated workflows, the best of open-source
libraries, and a streamlined approach to building models

The first principle is about accelerating the work of the individual data scientist. Data scientists coming out of
universities today have been trained using open-source tools and that's what they're most comfortable with. But using
open-source tools on a laptop means managing lots of libraries from different sources and being limited to the compute
power on the laptop.

OCI Data Science provides data scientists with open-source libraries along with easy access to a range of compute
power without having to manage any infrastructure. It also includes Oracle's own library to help streamline many
aspects of their work.
Collaborative

Enable Data science teams to work together with ways to share and reproduce models in a structured, secure way for
enterprise-grade results

The second principle is collaboration. It goes beyond individual data scientist productivity to enable data science
teams to work together.

This is done through the sharing of assets, reducing duplicative work, and supporting reproducibility and auditability
of models for collaboration and risk management.
Enterprise-Grade

Provide a fully managed platform built to meet the needs of the modern enterprise

The third principle is about being enterprise-grade. That means it's integrated with all the OCI
security and access protocols. The underlying infrastructure is fully managed. The customer doesn't
have to think about provisioning compute and storage, and the service handles all the maintenance,
patching, and upgrades so users can focus on solving business problems with data science.
It serves data scientists Users work in a
throughout that full machine familiar JupyterLab
learning life cycle with notebook interface
support for Python and open- where they write
source libraries. Python code.

Whom it serves Where it’s used

Data Science cloud Users preserve their models

service rapidly builds, in the model catalog and
trains, deploys, and deploy
manages machine their models to
learning models. managed infrastructure.

What it does How it’s used

OCI Data Science Details
Project

Data Science Features and Terminology

Projects Model Deployment Job

Notebook Sessions
Notebook Session
Conda Environments
JupyterLab
Model Catalog
Pre-installed libraries
Accelerated Data Science(ADS) Accelerated Data Science
SDK
SDK

Models

Model Catalog

Model deployments

Jobs
Projects
• Projects are containers that enable data science teams to organize their work. They represent
collaborative workspaces for organizing and documenting data science assets, such as notebook sessions
and models. Note that a tenancy can have as many projects as needed without limits.

Notebook Sessions
• Notebook sessions are where data scientists work. Notebook sessions provide a Jupyterlab environment
with pre-installed open-source libraries and the ability to add others. Notebook sessions are interactive
coding environments for building and training models. Notebook sessions run in managed infrastructure
and the user can select CPUs or GPUs, the compute shape, and amount of storage without having to do
any manual provisioning of environments.
Conda Environments
• Conda is an open-source environment and package management system and was
created for Python programs. It is used in the service to quickly install, run, and update
packages and their dependencies. Conda easily creates, saves, loads, and switches
between environments in your notebook sessions.

Accelerated Data Science SDK

• Oracle’s Accelerated Data Science (ADS) SDK is a Python library that is included as
part of OCI Data Science. ADS has many functions and objects that automates or
simplifies the steps in the data science workflow, including connecting to data,
exploring and visualizing data, training a model with AutoML, evaluating models, and
explaining models. In addition, ADS provides a simple interface to access the data
science service model catalog and other OCI services including object storage.
Models
• Models define a mathematical representation of your data and business process. You create
models in notebooks sessions inside projects.
Model Catalog
• The model catalog is a place to store, track, share, and manage models. The model catalog is
a centralized and managed repository of model artifacts. A stored model includes metadata
about the provenance of the model including Git-related information and the script or
notebook used to push the model to the catalog. Models stored in the model catalog can be
shared across members of a team and they can be loaded back into a notebook session.
Model Deployments
• Model deployments allow you to deploy models stored in the model catalog as HTTP
endpoints on managed infrastructure. Deploying machine learning models as web
applications (HTTP API endpoints) serving predictions in real time is the most common way
to operationalize models. HTTP endpoints are flexible and can serve requests for model
predictions.
Jobs
• Data science jobs enable you to define and run repeatable machine learning tasks on fully-
managed infrastructure.
Ways to Access Oracle Cloud Infrastructure Data Science

OCI Console
Provides easy-to-use browser-based interface; enables access to notebook sessions and all service
features
The most common method is the OCI Console. The OCI Console provides an easy-to-use, browser-
based interface that enables access to Notebook sessions and all the features of the service.

Language SDKs
Provides programming language SDKs for Java, Python, .NET, Go, Ruby, and
TypeScript/JavaScript
OCI also provides programming language SDKs for Java, Python, TypeScript/JavaScript, .Net, Go,
and Ruby. These SDKs enable the user to write code to manage data science resources. We’ll
provide some examples of how the Python SDK can be used to deploy models and create jobs.
Rest API
Provides access to service functionality; requires programming expertise
The REST API provides access to service functionality but requires programming expertise. An API
reference is provided in the product documentation.

CLI
Provides quick access and full functionality without the need for scripting
The Command Line Interface provides both quick access and full functionality without the
need for scripting.
Oracle is frequently adding new
Where to Find Data Science regions, so visit Oracle.com/cloud
to get the latest information on
OCI Region Availability cloud regions.

Regions are globally distributed data

Data Science is available through global regions: centers that provide secure, high-
performance, local environments.
That makes OCI Data Science available
around the globe in commercial,
government, and dedicated regions.

Commercial regions
Government regions
Dedicated regions
• Oracle Cloud Infrastructure

ADS SDK Overview

Accelerated Data Science (ADS)
Software Development Kit (SDK)

ADS SDK covers the end-to-end life cycle of machine learning models from data acquisition to
model evaluation.

Accelerated Data Science (ADS) SDK is an Oracle Cloud Infrastructure Data Science and Machine Learning
SDK (software development kit).

It covers the end-to-end life cycle of machine learning models from data acquisition to model evaluation.
Ways to Access ADS SDK
Conda Environments Local Environment Installation

Modules Libraries

Interpreter Programs

python3 -m pip install oracle-ads

Container

There are 2 main ways to access ADS:

• Inside one of the conda environments in OCI Data Science
• In your local environment by using pip to install ADS
python3 -m pip install oracle-ads
ADS SDK: Features

Connect to Data Sources Data Visualization Feature Engineering Model Training

Model Evaluations Model Interpetation Model Deployment

ADS SDK: Features

There are many helpful features of ADS SDK. Some of them include:

• Connect to different data sources

• Data visualization
• Feature engineering
• Model training
• Model evaluations
• Model interpretation and explainability
• Model deployment
Connect to Data Sources

ADS supports loading data from multiple sources. Here is a list of data sources you
can load data from:
• Local storage
• Object storage
• Oracle Database
• Other cloud providers such as S3, Google Cloud Service, Azure
• MongoDB and NoSQL DB instance
• OCI Big Data service / HDFS
• HTTP(S) sources
• Elastic Search instances
• Blob
Data Visualization

• Data visualization is an important part in performing exploratory data analysis (EDA) to help
gain a better understanding of the data set you are working with.
• ADS has a method show_in_notebook() that automatically creates visualizations for a data
set.
• It provides basic information about a data set including:
• Predictive data type (i.e., binary classification, multi-class classification, regress)
• Number of columns and rows of the data set
• Summary visualization of each feature
• Correlation map of the features
Feature Engineering

Transforms existing features into new ones to improve ML model quality

Has built-in functionality and tools to:

Turn a data set into an ADSDataset object
Provide recommended transformations to a data set
Support categorical encoding, null values, and imputation
Feature Engineering.

• Feature engineering is the process of transforming existing features into new ones.

• Why is this helpful? The idea is to transform existing features into new ones in order to improve the quality of
machine learning models. ADS has built-in tools to simplify the process of data transformation and feature
engineering.

• ADS has the functionality to turn a data set into an ADSDataset object. Any operation that can be performed
for a Pandas dataframe can also be applied to an ADSDataset. This makes it easy to apply data
transformations.

• ADS has built-in automatic data transformation tool that provides recommended transformations to a data set.

• ADS has built-in functions that support categorical encoding, null values, and imputation.
Model Training

Model training using Oracle Labs’ AutoML can:

Automate training multiple algorithms
Optimize model hyperparameters
Compare performance

ADSTuner performs hyperparameter tuning.

After model training, ADS can:

Package all necessary files to recreate model (model artifact)
Save the model artifact in the model catalog
Model training.
You can train your model with Oracle Labs’ AutoML or packages that come pre-installed in the
conda pack
• ADS SDK inside OCI Data Science comes with Oracle Labs’ AutoML. AutoML is an
automated machine learning module that allows you to:
Automate training multiple algorithms on the data
Optimize model hyperparameters
Compare performance
• You can also train models from packages pre-installed in the conda packs such as scikit-
learn.
• After the model is done training, ADS provides the functionality for users to package all
the files necessary to recreate a model called the model artifact and save the model artifact
into the model catalog, a managed and centralized storage space for models in OCI Data
Science.
Model Evaluations

Allow for comparison between your trained ML models with standard metrics

Are offered through ADS as a collection of:

Tools
Metrics
Charts

Are available through an ADS evaluation class that supports:

Binary classification
Multinomial classification
Regression
Model evaluations:

• Evaluations is where you compare the different machine learning models you have trained with industry-
standard metrics and try to understand the trade-offs between them

• ADS has an evaluation class that provides a collection of tools, metrics and charts to help with model
evaluation.

• The ADS evaluation class supports evaluation for binary classification, multi-class classification, and regression.
Model Interpretation and Explainability

It is the process of explaining and interpreting machine learning models.

Provides explanations through the ADS module that are:
Interpretable
Model-agnostic
What-if
Local
Explain why the ML made a specific prediction

Global
Explain the general behavior of an ML model
Model Deployment

Module provided through ADS; ads.model.framework

Uses ADS to deploy common models such as:

Oracle Labs AutoML
PyTorch
Scikit-learn
Tensor Flow

Integrates with OCI Logging service, which:

Stores access and prediction logs
Uses APIs provided by ADS to assist interaction
Model deployment.

ADS has a model deployment module, ads.model.deployment, which allows you to deploy models with OCI
Data Science’s managed resource model deployments

You can use ADS to deploy a model artifact saved in the Data Science model catalog or the URI of a directory in
the local block storage or object storage.

Model deployments integrate with the OCI Logging service. You can use it to store the access and prediction
logs from model deployments. ADS provides APIs to make interacting with the Logging service simple.
• Oracle Cloud Infrastructure

Tenancy Configuration Basics

Tenancy Configuration Concepts

Compartments

A logical container for organizing OCI resources

User groups

A group of users

Dynamic groups

A group of resource principals

Policies

A way to grant access to groups within compartments

How Data Science Components Work Together

Assign users to Create policies that grant

Create dynamic groups for
appropriate groups. access to resources
Data Science resources.
in a compartment.
Compartments

Compartments
A logical grouping of resources that can be accessed only by certain groups that have received administrator
permission

Enable you to organize and control access to your

cloud resources

Tip: When configuring tenancies, decide how you will organize your Data Science resources, then create
compartments for those resources through the Identity Console.
Compartments allow you to organize and control access to your cloud resources.

A compartment is a logical grouping of resources that can be accessed only by certain groups that have been
given permission by an administrator.

When configuring your tenancy, the first step is to make a plan of how you will organize your data science
resources going forward.

Once you’ve made a plan, you can create a compartment(s) for data science resources through the Identity
Console. To do that, go to Identity > Compartments, and click ”Create Compartment.” Enter a name and
description and then click Create Compartment.

View https://docs.oracle.com/en-
us/iaas/Content/GSG/Concepts/settinguptenancy.htm#Setting_Up_Your_Tenancy for more details
Creating a Compartment

Cloud Console

Step 1 Step 2 Step 3

Go to Identity and Click the Create Enter name,

select Compartment description, and
Compartments. button. click Create
Compartment.
User Groups

Individual users are grouped in OCI and granted access to Data Science resources within compartments.

Admins perform three steps to create user groups:

Create users.
Create groups.
Add users to groups.

Tip: When configuring groups, decide how users will access resources in the compartments.
Dynamic Groups

Dynamic groups are a special type of group that contains resources (such as data science notebook sessions, job runs,
and model deployments) that match rules that you define.

These matching rules allow group membership to change dynamically as resources that match those rules are created
or deleted. These resources act as "principal" actors and can make API calls to services according to policies that you
write for the dynamic group.

For example, using the resource principal of a Data Science Notebook Session, you could make a call to the Object
Storage API to read data from a bucket, if the dynamic group of the notebook session has a policy which enables
object storage access.

Resources match rules and rules are applied to dynamic groups.

Dynamic Groups: Matching Rules

Dynamic groups have matching rules, where <compartment-ocid> is replaced by the identifier of the
compartment created for Data Science.

ALL {resource.type='datasciencenotebooksession', resource.compartment.id='<compartment-ocid>'}

ALL {resource.type='datasciencemodeldeployment', resource.compartment.id='<compartment-ocid>'}

ALL {resource.type='datasciencejobrun', resource.compartment.id='<compartment-ocid>'}

Once you give your dynamic group a name and description, you’ll fill in the following matching rules, where
<compartment-ocid> is replaced by the identifier of the compartment you created for Data Science:

ALL {resource.type='datasciencenotebooksession', resource.compartment.id='<compartment-ocid>'}

ALL {resource.type='datasciencemodeldeployment', resource.compartment.id='<compartment-ocid>'}
ALL {resource.type='datasciencejobrun', resource.compartment.id='<compartment-ocid>'}
Policies
Policies define what principals (users and resources) have access to in OCI. Access is granted at the group and
compartment level, which means you can write a policy that gives a group a specific type of access within a specific
compartment.

Policies have a basic syntax:

Allow group <group-name> to <verb> <resource-type> in compartment <compartment-name>

• <group-name> - This will be filled in with the name of the user group or dynamic group
• <verb> - This will define the level of access
• <resource-type> - This will specify the type of resource or resource family to be accessed
• <compartment-name> - This will be filled in with the name of the compartment
Policy Syntax

Allow group <group-name> to <verb> <resource-type> in compartment <compartment-name>

Group Name Resource Type Compartment Name

Verb
This is the name of This is the name of the
This defines the This specifies the type
the user group or level of access. compartment.
of resource or
dynamic group.
resource family to be
accessed.
Policies have a basic syntax:

Allow group <group-name> to <verb> <resource-type> in compartment <compartment-name>

Verbs include (from least to most permissive) inspect, read, use, and manage.

• Inspect: Ability to list resources without access to any user-specified metadata

• Read: Includes inspect, plus the ability to get user-specified metadata and the actual resource itself
• Use: Includes read, plus the ability to work with the resource, including updating it. Generally, does not include creating
or deleting permissions.
• Manage: Includes all permissions, including creating and deleting.
Policy Basics: Resource Type

Allow group <group-name> to <verb> <resource-type> in compartment <compartment-name>

• Resource type in the policy defines which specific resource you are writing the policy for. For example, Data Science
includes resources such as data-science-models or data-science-jobs.

• You can write a policy for an individual resource type; however, to make writing policies for related resource easier,
there are aggregate resource types which contain a family of related resources. The aggregate resource type for data
science is data-science-family.
Required Data Science Policies

Allow <subject> to <verb> <resource-type> in <location>

To allow data scientists to manage all Data Science resources in a specific compartment:
Allow group <user-group-name> to manage data-science-family in compartment <compartment-name>

To allow Data Science resources, such as a notebook session, in a dynamic group to manage all Data Science
resources:
Allow dynamic group <dynamic-group-name> to manage data-science-family in compartment <compartment-
name>
These are the most critical of the required data science policies, not just mere policy examples.

To allow data scientists to manage all data science resources in a specific compartment:

Allow group <user-group-name> to manage data-science-family in compartment

<compartment-name>

To allow data science resources, such as a notebook session, in a dynamic group to manage all data science

resources:

Allow dynamic group <dynamic-group-name> to manage data-science-family in

compartment <compartment-name>
More Required Data Science Policies

The following policies are required to enable users access to metrics and logging for data science resources:

Allow group <user-group-name> to read metrics in compartment <compartment-name>

Allow dynamic-group <dynamic-group-name> to use log-content in compartment <compartment-name>

Allow group <user-group-name> to manage log-groups in compartment <compartment-name>

Allow group <user-group-name> to use log-content in compartment <compartment-name>

Optional Data Science Policies

To use custom networking in Data Science you will need the following policies:

• Allow service datascience to use virtual-network-family in compartment <compartment-name>

• Allow group <user-group-name> to use virtual-network-family in compartment <compartment-name>

• Allow dynamic group <dynamic-group-name> to use virtual-network-family in compartment <compartment-

name>
Optional Policies for Data Science–Related Services
Allow group <user-group-name> to use vaults in compartment <compartment-name>
Allow group <user-group-name> to manage keys in compartment <compartment-name>
Vault
Allow dynamic-group <dynamic-group-name> to use vaults in compartment <compartment-name>
Allow dynamic-group <dynamic-group-name> to manage keys in compartment <compartment-name>

Object Allow dynamic-group <dynamic-group-name> to manage object-family in compartment <compartment-name>

Storage Allow group <user-group-name> to manage object-family in compartment <compartment-name>

Data Allow group <user-group-name> to manage data-labeling-family in compartment <compartment-name>

Labeling Allow dynamic group <dynamic-group-name> to manage data-labeling-family in compartment <compartment-name>

Data Allow group <user-group-name> to manage dataflow-family in compartment <compartment-name>

Flow Allow dynamic group <dynamic-group-name> to manage dataflow-family in compartment <compartment-name>

Data Allow group <user-group-name> to manage data-catalog-family in compartment <compartment-name>

Catalog Allow dynamic group <dynamic-group-name> to manage data-catalog-family in compartment <compartment-name>
Which statement is true about dynamic groups?

They have matching rules, where <compartment-ocid> is replaced by the identifier of the
compartment created for Data Science.

They define what Data Science principals, such as users and resources, have access to in OCI.
(policy definition)

They are individual users that are grouped in OCI by administrators and granted access to Data
Science resources within compartments (user groups definition)
They are a logical grouping of resources that can be accessed only by certain groups that have
received administrator permission (compartment definition)

Seminar On Data Science
100% (7)
Seminar On Data Science
25 pages
Artificial Intelligence: The Complete Beginner’s Guide to the Future of A.I.
From Everand
Artificial Intelligence: The Complete Beginner’s Guide to the Future of A.I.
John Adamssen
4.5/5 (6)
Rojas-Galeano S. ChatGPT. Your Python Coach, Mastering... in 100 Prompts 2023
No ratings yet
Rojas-Galeano S. ChatGPT. Your Python Coach, Mastering... in 100 Prompts 2023
186 pages
M 1 FDS Notes
No ratings yet
M 1 FDS Notes
19 pages
Ai For The Digital Era Strategy
No ratings yet
Ai For The Digital Era Strategy
31 pages
Schaum's Outline Theory and Problems of Fourier Analysis
0% (1)
Schaum's Outline Theory and Problems of Fourier Analysis
12 pages
1) Data-Sci Chapter-1
No ratings yet
1) Data-Sci Chapter-1
17 pages
Data Science Chacha
No ratings yet
Data Science Chacha
150 pages
Lecture 01 05.08.2024 AI-ML Introduction
No ratings yet
Lecture 01 05.08.2024 AI-ML Introduction
46 pages
Datascience
75% (8)
Datascience
28 pages
Unit 2 Data Science
No ratings yet
Unit 2 Data Science
53 pages
Chapter 1 Data Science Fundamentals
No ratings yet
Chapter 1 Data Science Fundamentals
34 pages
Fds Module 1
No ratings yet
Fds Module 1
65 pages
Introduction To DS PDF
No ratings yet
Introduction To DS PDF
34 pages
Question Bank Syllbuswise
No ratings yet
Question Bank Syllbuswise
16 pages
Day 1 Intro To DS and ML - New
No ratings yet
Day 1 Intro To DS and ML - New
41 pages
2 ML
No ratings yet
2 ML
80 pages
UNIT - I Intro To DS
No ratings yet
UNIT - I Intro To DS
18 pages
Himadev
No ratings yet
Himadev
37 pages
GPT (CH 6)
No ratings yet
GPT (CH 6)
22 pages
Chapter-1 DS
No ratings yet
Chapter-1 DS
15 pages
Question 1
No ratings yet
Question 1
5 pages
Ch7-Overview of Data Science-Part 1
No ratings yet
Ch7-Overview of Data Science-Part 1
37 pages
Machine Learning Lessons
No ratings yet
Machine Learning Lessons
44 pages
Unit 1
No ratings yet
Unit 1
22 pages
DS Module 1
No ratings yet
DS Module 1
112 pages
Module 1
No ratings yet
Module 1
192 pages
OCI AI Portafolio
No ratings yet
OCI AI Portafolio
8 pages
Unit 1-FDS
100% (2)
Unit 1-FDS
18 pages
Data Science Fir Civil Engineering Unit 1 Notes and Assignments
No ratings yet
Data Science Fir Civil Engineering Unit 1 Notes and Assignments
29 pages
Lec1 - For Upload Complete
No ratings yet
Lec1 - For Upload Complete
111 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
20 pages
Unit 1 DS BCA NOTES
No ratings yet
Unit 1 DS BCA NOTES
7 pages
DSUP (AI-DS) Experiments Prem
No ratings yet
DSUP (AI-DS) Experiments Prem
107 pages
Unit I Introduction To Data Science
No ratings yet
Unit I Introduction To Data Science
79 pages
1.1 Idml
No ratings yet
1.1 Idml
3 pages
Unit 1
No ratings yet
Unit 1
28 pages
iMY DATA SCIENCE - Removed
No ratings yet
iMY DATA SCIENCE - Removed
19 pages
In TMT Data Science Transforming Noexp
No ratings yet
In TMT Data Science Transforming Noexp
24 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
37 pages
The 365 DS Booklet PDF
100% (1)
The 365 DS Booklet PDF
67 pages
Final Seminar Report
100% (2)
Final Seminar Report
18 pages
Advancement of Data Science
No ratings yet
Advancement of Data Science
7 pages
Data Science PPT-2
No ratings yet
Data Science PPT-2
34 pages
Data Science Presentation Final
No ratings yet
Data Science Presentation Final
34 pages
1c. INTRODUCTION-Data-Science-basic
No ratings yet
1c. INTRODUCTION-Data-Science-basic
31 pages
LO2a) - Introduction To Data Engineering
No ratings yet
LO2a) - Introduction To Data Engineering
32 pages
Unit 1 FUNDAMENTALS OF DATA SCIENCE-1
No ratings yet
Unit 1 FUNDAMENTALS OF DATA SCIENCE-1
27 pages
1-Need For Data Science-13!12!2024
No ratings yet
1-Need For Data Science-13!12!2024
51 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
32 pages
INTRODUCTION and M1-CH-1
No ratings yet
INTRODUCTION and M1-CH-1
63 pages
Fundamentals of Data Science
100% (3)
Fundamentals of Data Science
62 pages
Data Science
No ratings yet
Data Science
40 pages
Lect 01 DS Intro
No ratings yet
Lect 01 DS Intro
4 pages
Summer Training 2020: Advanced Data Science With IBM & Bionic Robotic Arm
No ratings yet
Summer Training 2020: Advanced Data Science With IBM & Bionic Robotic Arm
10 pages
Lecture 1-2 - Introduction
No ratings yet
Lecture 1-2 - Introduction
72 pages
Question 3
No ratings yet
Question 3
6 pages
White Paper The Evolution of Data Science
No ratings yet
White Paper The Evolution of Data Science
12 pages
Data Science Presentation Enhanced
No ratings yet
Data Science Presentation Enhanced
34 pages
Shef USA Brouchure
No ratings yet
Shef USA Brouchure
21 pages
A.I: The Path towards Logical and Rational Agents: Thinking Machines
From Everand
A.I: The Path towards Logical and Rational Agents: Thinking Machines
alasdair gilchrist
4/5 (1)
Enumerate
No ratings yet
Enumerate
2 pages
Exception
No ratings yet
Exception
2 pages
CBI - Technical Specifications - VF
No ratings yet
CBI - Technical Specifications - VF
108 pages
Velocity Analysis
No ratings yet
Velocity Analysis
16 pages
Winter Report
No ratings yet
Winter Report
82 pages
Smart Irrigation System
0% (1)
Smart Irrigation System
61 pages
The Next Level of Data Visualization in Python
100% (1)
The Next Level of Data Visualization in Python
17 pages
Department of Civil Engineering: DR Barbara Turnbull H21POR
No ratings yet
Department of Civil Engineering: DR Barbara Turnbull H21POR
9 pages
Bugscanner Ipynb
No ratings yet
Bugscanner Ipynb
3 pages
Python Data Analysis For Newbies Numpypandasmatplotlibscikit Learnkeras
No ratings yet
Python Data Analysis For Newbies Numpypandasmatplotlibscikit Learnkeras
95 pages
Mod 5 Python Introduction
No ratings yet
Mod 5 Python Introduction
7 pages
Day1 - Introduction To Python
100% (1)
Day1 - Introduction To Python
24 pages
BCA Internship Report JECRC UNIVERSITY
No ratings yet
BCA Internship Report JECRC UNIVERSITY
56 pages
Project ETE 8th Sem Bhaskar
No ratings yet
Project ETE 8th Sem Bhaskar
42 pages
All Python CS
100% (2)
All Python CS
10 pages
Continuumio Anaconda Platform Docs Site 5.0.2 PDF
No ratings yet
Continuumio Anaconda Platform Docs Site 5.0.2 PDF
128 pages
Applications of AI in InfoSec
No ratings yet
Applications of AI in InfoSec
86 pages
Markdown For Jupyter Notebooks Cheatsheet - IBM Watson Data - Medium
No ratings yet
Markdown For Jupyter Notebooks Cheatsheet - IBM Watson Data - Medium
3 pages
BIG Data Analytics 21CSH-471: Computer Science & Engineering
No ratings yet
BIG Data Analytics 21CSH-471: Computer Science & Engineering
7 pages
JupyterLab - Cheatsheet
No ratings yet
JupyterLab - Cheatsheet
2 pages
Sagemaker DG
No ratings yet
Sagemaker DG
3,324 pages
IPython
No ratings yet
IPython
681 pages
Jupyter Notebook Installation Guide (Mac)
No ratings yet
Jupyter Notebook Installation Guide (Mac)
27 pages
Major Project Report (ROHIT)
No ratings yet
Major Project Report (ROHIT)
113 pages
Quarto Cheat Sheet
No ratings yet
Quarto Cheat Sheet
2 pages
Data Preprocessing For Machine Learning in Python
No ratings yet
Data Preprocessing For Machine Learning in Python
27 pages
Sample Project Report
No ratings yet
Sample Project Report
26 pages
Major Report 1
No ratings yet
Major Report 1
48 pages
Capítulo 1 - Conceptos Básicos de Python
No ratings yet
Capítulo 1 - Conceptos Básicos de Python
23 pages
Lab Manual Day 1 Python Installation
No ratings yet
Lab Manual Day 1 Python Installation
15 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
Chapter 1
No ratings yet
Chapter 1
85 pages
Akusu Elijah 215053
No ratings yet
Akusu Elijah 215053
20 pages