Chiru Internship
Chiru Internship
Presented By:
K.Chiru Sai
23555A4404
CSE(DS)
GIER
Lecture Details:
DATA SCIENCE
Internship using skilldzire platform
Contents
Introduction to Data Science
Python
Numpy
Pandas
Matplotlib
SQL
Machine Learning
Linear Regression
Logistic Regression
Data Science
aggregation tools.
SQL (Structured Query Language) is the standard language for managing and manipulating data in
relational database management systems (RDBMS).
It is declarative, focusing on specifying what data to retrieve or manipulate rather than the specifics
of how to do it.
Key commands include SELECT for retrieving data, INSERT for adding new records, UPDATE for
modifying existing records, and DELETE for removing records.
Structured Query Language (SQL)
SQL supports various data types like INTEGER, VARCHAR, and DATE, and enables powerful
operations such as joins to combine data from different tables based on related columns,
aggregations for calculating values across rows, and constraints to ensure data integrity.
Adhering to best practices like using meaningful names and formatting queries for readability
enhances its effectiveness in database management and application development.
Machine Learning
• Machine learning is a branch of
artificial intelligence that includes creating
algorithms that can learn from data and make
predictions or judgments based on that
information. It focuses on developing models
that can learn and grow on their own, rather than
relying on explicit instructions. Machine
learning applications range from predictive
analytics to natural language processing,
computer vision, and autonomous systems.
Regression
Linear regression is a statistical approach for
determining the connection between a dependent
variable and one or more independent variables. It
aids in predicting how the dependent variable will
change when the independent factors shift.
Multiple linear regression is a kind of linear
regression that uses multiple independent
variables to predict a dependent variable. It
investigates how a dependent variable changes
when two or more independent variables are
modified concurrently.
Logistic Regression
Logistic regression is a statistical model that
predicts the likelihood of a binary result (often
coded as 0 or 1) using one or more predictor
variables.
Logistic regression is frequently utilized in
many sectors, including healthcare (forecasting
illness presence), marketing (predicting client
behavior), and finance (predicting loan defaults),
where understanding and predicting binary
outcomes is crucial.
THANK YOU