Bachelor of Technology IN Artificial Intelligence and Data Science
Bachelor of Technology IN Artificial Intelligence and Data Science
Bachelor of Technology IN Artificial Intelligence and Data Science
Machine learning
INTERNSHIP REPORT
Submitted by
NAME: KAVIYALAXMI.P REG NO:312621243017
In partial fulfilment for the award of the degree of
BACHELOR OF TECHNOLOGY
IN
ARTIFICIAL INTELLIGENCE AND
DATA SCIENCE
THANGAVELU ENGINEERING
COLLEGE
KARAPAKKAM
CHENNAI 600097
ANNA UNIVERSITY
CHENNAI 600025
OCTOBER 2023
2
ACKNOWLEDGEMENT
The success and final outcome of learning machine learning recquired a lot of
guidance and assistance from many people and I am extremely priviledged to
have got this all along the completion of my course and fow of the projects.All
that i have done is only due to such aupervion and assistance and I would not
forgrt to thank them.
I respect and thankful to and fortunate enough to get constant
encouragement,support and guidance from all teaching staffs which helped me in
succesfully completing my internship and project work.
(Signature of Student)
Name:Kaviyalaxmi.P
Reg no:312621243017
Date:
4
TABLE OF CONTENTS:
Confirmation letter
Internship Certificate
Platform and Instructor
Introduction
Overview
Day 1:Introductin to python
A Taste of Machine Learning
Relation to Data Mining.
Relation to Optimization.
Relation to Statistics.
Day 2:Future of Machine Learning.
Technology Learnt.
Introduction to Artificial Intelligence and Machine Learning.
Day 3: Definition of Artificial Intelligence.
Definition of Machine Learning..
Machine Learning Algorithms..
Day 4: Applications of Machine Learning.
Techniques of Machine Learning.
Supervised Learning.
Unsupervised Learning.
Day 5: Semi-supervised Learning.
Reinforcement Learning.
Some Important Considerations in Machine Learning.
Day 6: Data Preprocessing.
Data Preparation.
Day 7: Feature Engineering.
Feature Scaling.
Day 8: Datasets.
Dimensionality Reduction with Principal Component Analysis.
Day 9:Math Refresher.
Concept of Linear Algebra.
Day 10: Eigenvalues, Eigenvectors, and Eigen decomposition..
Introduction to Calculus.
Day 11: Probability and Statistics.
Day 12: Supervised learning
5
Day 13: Regression.
Lincar Regression..
Day 14: Multiple Linear Regression.
Polynomial Regression
Day 15: Decision Tree Regression..
Random Forest Regression.
Day 16: Classification.
Linear Models.
Day 17: Logistic Regression.
Support Vector machines.
Day 18: Nonlinear Models
K-Nearest Neighbors (KNN).
Kernel Support Vector Machines (SVM).
Day 19:Naive Bayes
Decision Tree Classification.
conclusion
6
21 JUNE 2023
RE: Confirmation of Internship
To Ms KAVIYALAXMI.P :
This letter serves as confirmation that the GLOBAL TECHNO SOLUTIONS would
like to offer KAVIYALAXMI.P an internship. The company will provide the student
an educational, work-based learning experience directly related to the student's
major field of study, fulfilling the internship training on MACHINE LEARNING
requirement to complete his qualification.
The period of the internship will be from 26 June to 14 July 2023
Please do not hesitate to contact me should you have any questions or concerns
on +91 44 4203 3422
Regards,
H.R.Manager
7
COMPLETED CERTIFICATE
8
INTRODUCTION
This report provides a summary of myinternship experience in the Python
Machine Learning course offered by Global Techno Solutions.The course lasted
for 3 weeks from June 26,2023 to July 14,2023.During this time,I had the
oppurtunity to learn about all aspects of Machine Learning from the basics of
Python programming to the more advanced topics of data preprocessing and
doing project.
LEARNING OBJECTIVES
My primary learning objective for the internship were to;
Learn about python programming and its features
A basic concepts of machine learning
Work effectively in a team environment to deliver high quality software
applications.
10
LEARNING PATH
11
INTRODUCTION TO PYTHON
Python is a widely used general-purpose, high level programming language. It was created by
Guido van Rossum in 1991 and further developed by the Python Software Foundation. It was
designed with an emphasis on code readability, and its syntax allows programmers to express
their concepts in fewer lines of code.
Python is a programming language that lets you work quickly and integrate systems more
efficiently.
Python is a computer programming language often used to build websites and software,
automate tasks, and conduct data analysis. Python is a general-purpose language, meaning it
can be used to create a variety of different programs and isn't specialized for any specific
problems.
Python is an interpreted, interactive, object- oriented programming language. It incorporates
modules, exceptions, dynamic typing, very high level dynamic data types, and classes. It
supports multiple programming paradigms beyond object- oriented programming, such as
procedural and functional programming.
We will introduce the most important ones here: strings (text), numbers (integers and floating
point numbers), tuples (simple sequences), lists (more flexible sequences), and dictionaries.
ADVANTAGES:
• Presence of third-party modules
Extensive support libraries(NumPy for numerical calculations, Pandas for data analytics, etc.)
• Open source and large active community base
• Versatile, Easy to read, learn and write
User-friendly data structures
High-level language
Dynamically typed language(No need to mention data type based assigned, it takes data
type) on the value
Object-Oriented and Procedural Programming language
Portable and Interactive
Ideal for prototypes - provide more
functionality with less coding
Highly Efficient(Python's clean object- oriented design provides enhanced process control,
and the language is equipped with excellent text processing and integration.
12
A TASTE OF MACHINE LEARNING
Machine Learning is the field of Machine Learning is the field of study that gives computers the
capability to learn without being explicitly study that gives computers the capability to learn without
being explicitly programmed. ML is one of the most exciting technologies that one would have ever
come across. As it is programmed. ML is one of the most exciting technologies that one would have ever
come across. As it is evident from the name, it gives the computer that makes it more similar to
huevident from the name, it gives the computer that makes it more similar to humans: The ability to learn.
mans: The ability to learn. Machine learning is actively being used today, perhaps in many more places
than one would expect.Machine learning is actively being used today, perhaps in many more places than
one would expect. Features of Machine learning Machine learning is data driven
technology. Large amount of data generated by organizations on daily bases. So, by notable
relationships in data, organizations makes better decisions. Machine can learn itself from
past data and automatically improve. From the given dataset it detects various patterns
on data. For the big organizations branding is important and it will become more easy to
target relatable customer base. It is similar to data mining because it is also deals with the
huge amount of data.
Some of the most common examples are: Image Recognition Speech Recognition
Recommender Systems Fraud Detection Self Driving Cars Medical Diagnosis
Stock Market Trading
RELATION TO DATA MINING:
Data mining is the process of extracting useful information from large sets of
data. It involves using various techniques from statistics, machine learning, and
database systems to identify patterns, relationships, and trends in the data. This
information can then be used to make data-driven decisions, solve business
problems, and uncover hidden insights. Applications of data mining include
customer profiling and segmentation, market basket analysis, anomaly detection,
and predictive modeling. Data mining tools and technologies are widely used in
various industries, including finance, healthcare, retail, and telecommunications.
RELATION TO OPTIMIZATION: Optimization, collection of mathematical
principles and methods used for solving quantitative problems in many
disciplines,
13
including physics, biology, engineering, economics, and business. The subject
grew from a realization that quantitative problems in manifestly different
disciplines have important mathematical elements in common. Because of this
commonality, many problems can be formulated and solved by using the unified
set of ideas and methods that make up the field of optimization.
RELATION TO STATISTICS: Statistics is the study of the collection, analysis,
interpretation, presentation, and organization of data. In other words, it is a
mathematical discipline to collect, summarize data. Also, we can say that statistics
is a branch of applied mathematics. However, there are two important and basic
ideas involved in statistics; they are uncertainty and variation. The uncertainty
and variation in different fields can be determined only through statistical
analysis. These uncertainties are basically determined by the probability that
plays an important role in statistics.
Future of Machine Learning Machine Learning as we know, is becoming very
popular. It is now becoming a top player in the industry. This field has a lot of
research potential. It is the top subject for research papers in computer science.
Machine Learning is a part of the much bigger Artificial Intelligence. There are
many types of AI. Today, various industries and researches use ML. Machine
Learning is growing due to one of the key factors, that is, computational and
processing power. The more advanced the GPUs and processors become, the
more ML evolves. With better GPUs, the computation of larger data is possible at
a faster rate. ML has now grown into something remarkable. The impact it has on
the workforce is impressive. It is one of the fastest-growing fields in computer
science. With this pace and growing influence on the market, it has a bright
future.
14
1. Increased Commercial Applications For "Federated ML" 2.
Promising AI Applications Within The Health Sector 3. Hyper-
Personalization Within E-Commerce 4. New AI And ML
Innovations With NLG
INTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
15
Artificial intelligence (AI) is the theory and development of computer systems
capable of performing tasks that historically required human intelligence, such as
recognizing speech, making decisions, and identifying patterns. AI is an umbrella
term that encompasses a wide variety of technologies, including machine
learning, deep learning, and natural language processing (NLP). Although the term
is commonly used to describe a range of different technologies in use today,
many disagree on whether these actually constitute artificial intelligence. Instead,
some argue that much of the technology used in the real world today actually