0% found this document useful (0 votes)

21 views18 pages

School of Engineering and Technology: Data Science"

The document is an internship report submitted by B. Mani Kowshik for the Bachelor of Technology in Computer Science and Engineering at CMR University, detailing his internship experience at Internshala focused on Data Science. It outlines the training modules, including foundational concepts, programming in Python, statistics, and machine learning, emphasizing the significance of data-driven insights in various industries. The report also includes acknowledgments, a declaration of originality, and certification of successful completion of the internship.

Uploaded by

meghanamegu447

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views18 pages

School of Engineering and Technology: Data Science"

Uploaded by

meghanamegu447

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

SCHOOL OF ENGINEERING AND TECHNOLOGY

An Internship Report
On
“DATA SCIENCE”

Submitted in partial fulfillment of the requirements for the award of degree in

Bachelor of Technology
in
Computer Science and Engineering
Of CMR University, Bangalore
Submitted by:
B. Mani Kowshik
21BBTCS056

Internship carried out

at
INTERNSHALA

Internal Guide External Guide:

Dr Parameswaran T Dr KOUSHIK
Associate Professor Asst. Professor
Dept. of CSE, SOET Dept. of CSE,
IIT Bhuvaneshwar

Department of Computer Science and Engineering

Off Hennur - Bagalur Main Road,
Near Kempegowda International Airport, Chagalahatti,
Bangalore, Karnataka-562149

2023-2024
SCHOOL OF ENGINEERING AND TECHNOLOGY
Department of Computer Science and Engineering

CERTIFICATE

This is to certify that the Internship work entitled “Data Science”, submitted to the CMR
University, Bangalore, in partial fulfillment of the requirements for the award of the degree of Bachelor
of Technology in Computer Science and Engineering is a record of work done by B. Mani Kowshik
bearing university register number 21BBTCS056 during the academic year 2023-24 at School of
Engineering and Technology, CMR University, Bangalore under my supervision and guidance. The
Internship report has been approved as it satisfies the academic requirement in respect of internship work
prescribed for the said degree.

Internal Guide External Guide:

Dr Parameswaran T Dr KOUSHIK
Associate Professor Asst. Professor
Dept. of CSE, SOET Dept. of CSE,
IIT Bhuvaneshwar

Signature of the HOD Signature of the Dean

(Dr Rubini P) (Dr N Kannan)

Examiners Signature with date

(1)
Name Signature

(2)
Name Signature
DECLARATION

I, B. Mani Kowshik bearing USN 21BBTCS056, student of Bachelor of Technology,

Computer Science and Engineering, CMR University, Bengaluru, hereby declare that the
internship work entitled “Data Science” submitted by me, for the award of the Bachelor’s degree
in Computer Science and Engineering to CMR University is a record of Bonafide work carried
out independently by me under the supervision and guidance of Dr Parameswaran T,
Associate Professor, CSE Dept., CMR University.

I further declare that the work reported in this internship has not been submitted and will
not be submitted, either in part or in full, for the award of any other degree in this university or
any other institute or University.

Place: Bengaluru B. Mani Kowshik

Date: 21BBTCS056

i
ACKNOWLEDGEMENT

The satisfaction that accompanies the successful completion of this project would be incomplete
without the mention of the people who made it possible, without whose constant guidance and
encouragement would have made efforts go in vain.

I consider myself privileged to express gratitude and respect towards all those who guided me through
the completion of the project. I express my thanks to my Internal Internship Guide Dr KOUSHIK, Asst.
Professor, Department of Computer Science and Engineering, IIT Bhuvaneshwar for his constant
support.

I express my sincere gratitude to my internship external guide Dr Parameswaran T, Associate

Professor, Department of CSE, School of Engineering and Technology, CMR University for his
constant support.

I would like to express my thanks to Dr Rubini P, Professor and Head, Department of CSE, School of
Engineering and Technology, CMR University, Bangalore, for her encouragement that motivated me
for the successful completion of internship work.

I express my heartfelt sincere gratitude to Dr N Kannan, Dean, Lake Side Campus, School of
Engineering and Technology, CMR University, for his support.

I would like to express my sincere thanks and gratitude to our internship coordinator for his support,
invaluable guidance and encouragement throughout the tenure of this internship.

B. Mani Kowshik (21BBTCS056)

ii
Sai Charan Lakum
from CMR UNIVERSITY BANGALORE has successfully completed a 6-week online training on Data Science. The
training consisted of Introduction to Data Science, Python for Data Science, Understanding the Statistics for
Data Science, Predictive Modeling and Basics of Machine Learning, and The Final Project modules.
Sai Charan scored 100% marks in the final assessment and is a top performer in the training.
We wish Sai Charan all the best for future endeavours.

Date of certification: 2023-09-04 Certificate no. : b8scsi1g353

For certificate authentication, please visit https://trainings.internshala.com/verify_certificate

ABSTRACT
INTERNSHALA is a free education internship focused on underrepresented communities that helps
adult learners, high school and university students, and faculty develop valuable new skills and access
career opportunities. The program includes an online platform complemented by customized practical
learning experiences that aim to respond to learners’ needs as they progress through their education
and career journeys. This effort is in collaboration with a network of world-class education partners,
including public high schools, non-profit organizations, governments, and corporations. They also
offer cybersecurity, data analysis, data science, cloud computing, and many other technical
disciplines. The online learning platform also includes courses on workplace skills such as Design
Thinking.
In a variety of industries, data science solutions are being used to prevent fraud and improve security.
In banking, machine learning may be used to create super-accurate predictive maintenance models
that can recognize and prioritize all types of potential fraudulent actions. Businesses can then build a
data-driven queue for high-priority incidents to be investigated. It aids in improving customer
satisfaction by safeguarding their accounts and ensuring that valid transactions are not challenged.
The field of data science encompasses a wide range of techniques and methodologies aimed at
extracting insights and knowledge from complex and often massive datasets. Just as artificial
intelligence leverages computer systems to perform tasks that traditionally required human
intelligence, data science employs various tools and methods to uncover patterns, make predictions,
and generate valuable insights from data. Machine learning plays a crucial role in enabling data science
processes, allowing systems to adapt and improve over time, much like the way humans learn from
experience.

The data science internship journey uncovered these pivotal findings, illuminating the power and
potential of data-driven approaches. From the critical role of data quality to the diversity of machine
learning techniques and the ethical considerations that shape the field, the experience showcased that
data science is a dynamic and impactful discipline with the ability to transform how we perceive,
analyse, and make decisions based on data.

The institute provide this opportunity in order to build different skills which are currently or will be
in the software sector. The skills such as artificial intelligence, machine learning, and deep learning
are the currently in the market which is rapidly growing and in almost everywhere the application is
used. The data science is majorly used to solve problem solving through having wide range of data
and through prediction and analysing. As we have worked together, we got different perspective on
same topic and tried our best to gain the perfect solution.

1
TABLE OF CONTENTS

Chapter Contents Page

No. No.
1 Introduction 4-5

2 Internship Discussion 6-9

3 Conclusion 10-11

4 Bibliography 12

2
LEARNING OBJECTIVES

Foundational Concepts: Understand the fundamental concepts of data science, including data
types, structures, and formats. Comprehend the importance of data quality, cleaning, and
preprocessing.
Learn to explore and visualize data to gain insights and identify patterns.

Programming and Tools: Gain proficiency in programming languages commonly used in

data science, such as Python or R. Familiarize yourself with data manipulation libraries (e.g.,
pandas) and data visualization tools (e.g., Matplotlib, Seaborn) to analyse and present data
effectively.
Learn to use version control systems (e.g., Git) for collaborative projects.

Statistics and Mathematics: Develop a solid understanding of statistical concepts, including

probability, hypothesis testing, and inference.
Learn regression analysis for predictive modelling and linear algebra for data manipulation.
Acquire knowledge of basic probability distributions and statistical techniques.

Basics of Machine Learning: Explore supervised learning techniques, such as regression and
classification, and their applications. Understand unsupervised learning methods, including
clustering and dimensionality reduction.
Gain exposure to ensemble methods and model evaluation techniques.

Ethics and Bias in Data Science: Understand ethical considerations surrounding data privacy,
security, and bias mitigation. Learn to identify and address potential biases in data and models.

Communication and Visualization: Develop the ability to communicate technical findings to

both technical and non-technical stakeholders. Learn data visualization best practices to create
compelling and informative visualizations.

Real-world Applications: Apply data science techniques to real-world problems and datasets
across various domains. Gain experience in collaborating on team projects to solve complex data-
driven challenges.

3
1. INTRODUCTION
1.1 About Internshala

Internshala is an internship and online training platform, based in Gurgaon, India.[1][2] Founded by
Sarvesh Agrawal, an IIT Madras alumnus, in 2011, the website helps students find internships with
Organisations in India.

Our Vision
Our Vision lies to bring in a technology-oriented career-driven Industrial Experience into the
aspirant’s career with Great Value.
Data science entails the utilization of advanced techniques to extract meaningful insights and
predictions from complex datasets. It doesn't seek to replace human decision-making; rather, it
amplifies human judgment by providing valuable information. Data science tools, often referred to
as "analytical services," are sophisticated computational engines designed to process and analyze
data. These tools excel in pattern recognition and predictive modeling, enhancing their predictive
capabilities. Some employ machine learning algorithms, progressively improving predictions with
new data inputs. Others utilize intricate neural network structures, mirroring the human brain's
information processing.

Data science isn't about thinking; it's about intricate calculations. These calculations, fueled by data
analysis, hold immense potential. By uncovering hidden patterns and trends, data science empowers
us to make informed decisions. It might appear as simple predictions, but these insights hold
significant sway over human lives. From healthcare diagnostics to financial forecasting, the impact
of data science extends far beyond calculations – it influences the way we understand and shape the
world around us.

Classical Machine Learning

Classical machine learning is AI systems learned by ingesting data and getting better at recognizing
patterns. The AI systems could predict things like the distance between points or the intensity of
values. Like all machine learning, the classical form depends on algorithms. Recall that algorithms
are mathematical expressions that output a result. Classical machine learning uses a small number of
algorithms in a relatively simple arrangement. Sometimes machine learning algorithms are binary,
which means that they output one of only two values. Typical binary results might be a 1 or a 0, a
YES or a NO, and a TRUE or a FALSE.

Machine learning has evolved into a collection of powerful applications called the deep learning

4
ecosystem. The foundation for many of applications is called a neural network. A neural network
uses electronic circuitry inspired by the way neurons communicate in the human brain. In a neural
network, a building block, called a perceptron, acts as the equivalent of a single neuron. A perceptron
has an input layer, one or more hidden layers, and an output layer. A signal enters the input layer and
the hidden layers run algorithms on the signal. Then, the result is passed to the output layer.

5
2. INTERNSHIP DISCUSSION

Module 1: Introduction to Data Science

Data Science is an interdisciplinary field that focuses on extracting insights from vast amounts of
structured and unstructured data. It involves a combination of statistical methods, computer science,
and domain expertise to analyze and interpret data trends effectively. The first step in Data Science
is data collection, which involves gathering data from multiple sources, such as databases, APIs, or
web scraping techniques. This is followed by preprocessing, cleaning, and transforming raw data into
meaningful and structured information.

Once data is cleaned, exploratory data analysis (EDA) techniques are used to understand patterns,
detect outliers, and summarize key characteristics of the dataset. Visualization techniques such as
histograms, scatter plots, and box plots are employed to better understand distributions and
relationships among variables. Predictive modeling and machine learning play a crucial role in Data
Science, where algorithms such as regression analysis, classification, and clustering are applied to
derive meaningful insights from data.

Python and R are the most commonly used programming languages in data science, offering a wide
range of libraries for data manipulation, visualization, and predictive modeling. Data Science
applications are vast, ranging from healthcare and finance to marketing, social media analysis, and
urban planning. Organizations leverage data science to improve efficiency, optimize operations, and
make informed decisions based on real-time data. This module provided an overview of fundamental
concepts, illustrating how data-driven insights power modern industries.

The increasing reliance on data in modern industries has made data science an essential field for
technological advancement and innovation. Companies use data science techniques to enhance
business intelligence, optimize decision-making, and drive competitive strategies. The ability to
harness and analyze large-scale data enables organizations to predict future trends, mitigate risks, and
improve overall efficiency. This module laid the foundation for understanding the significance of data
science and its impact across various domains.

Module 2: Python for Data Science

Python is a versatile and widely used programming language in the field of data science due to its
simplicity, flexibility, and powerful libraries. This module introduced the foundational elements of
Python programming, including variables, data types, operators, and control structures such as loops

6
and conditional statements. Additionally, it covered functions, modules, and file handling, which are
essential for developing scalable and modular code.

A significant portion of the module was dedicated to data manipulation using Pandas, a powerful
library that allows for easy handling and processing of structured datasets. Dataframes and series
were explored, along with essential operations like filtering, merging, and aggregation. NumPy,
another crucial library, was introduced for numerical computing, including array operations,
mathematical functions, and linear algebra.

Visualization techniques were explored using Matplotlib and Seaborn, enabling graphical
representation of data for better interpretation. The module also introduced Scikit-learn for machine
learning, covering topics such as supervised and unsupervised learning, model training, and
evaluation. Understanding Python’s role in data science is essential for processing large datasets
efficiently and building predictive models. This module provided hands-on experience in writing
Python programs, handling data, and implementing machine learning models to solve real-world
problems.

Python's adaptability and ease of use have contributed to its widespread adoption in data science and
analytics. Its rich ecosystem of libraries and frameworks has made it an indispensable tool for data
professionals, researchers, and developers. With increasing demand for data-driven solutions,
proficiency in Python has become a critical skill in the industry. This module provided a strong
foundation for understanding how Python supports data manipulation, visualization, and machine
learning applications.

Module 3: Understanding Statistics for Data Science

Statistics is the backbone of data science, providing essential tools to analyze, interpret, and make
inferences from data. This module covered the fundamental concepts of descriptive and inferential
statistics, highlighting their importance in understanding data patterns and trends. Descriptive
statistics focused on measures of central tendency (mean, median, mode) and dispersion (range,
variance, standard deviation, and interquartile range) to summarize and describe datasets.

The module then introduced probability theory, explaining concepts such as probability distributions,
random variables, and expected values. Common probability distributions, including normal,
binomial, and Poisson distributions, were explored in depth. Hypothesis testing, an integral part of
inferential statistics, was discussed to determine the statistical significance of observed data.
Concepts such as confidence intervals, p-values, and t-tests were introduced to help evaluate
hypotheses and draw meaningful conclusions from sample data.
7
Additionally, data visualization techniques such as histograms, box plots, and scatter plots were
covered to analyze distributions and relationships between variables. The module emphasized the
importance of statistical methods in data-driven decision-making, helping businesses and researchers
gain insights from data. Understanding these statistical techniques is crucial for model evaluation,
risk assessment, and predictive analytics in real-world applications.

As businesses and organizations increasingly rely on data-driven strategies, a solid understanding of

statistics has become essential for data professionals. Statistical methods provide the foundation for
making data-driven decisions, ensuring accuracy, reliability, and meaningful insights. The knowledge
gained in this module is vital for performing effective data analysis, interpreting results, and applying
statistical models to real-world scenarios.

Module 4: Introduction to Machine Learning

Machine Learning (ML) is a subset of artificial intelligence that enables systems to learn from data
and improve their performance over time without explicit programming. This module provided an
introduction to key machine learning concepts and different learning paradigms, including
supervised, unsupervised, and reinforcement learning.

Supervised learning was explored in detail, covering algorithms such as linear regression, logistic
regression, decision trees, support vector machines, and ensemble methods like random forests. These
algorithms are widely used in applications such as fraud detection, recommendation systems, and
customer segmentation. Unsupervised learning techniques, including K-means clustering,
hierarchical clustering, and principal component analysis (PCA), were introduced to identify patterns
and group similar data points without labeled outputs.

The module also covered reinforcement learning, where an agent interacts with an environment and
learns optimal strategies through rewards and penalties. This approach is used in robotics, game
playing, and autonomous decision-making.

Additionally, the module focused on the essential steps in building a machine learning model, such
as data preprocessing, feature selection, model training, hyperparameter tuning, and evaluation using
metrics like accuracy, precision, recall, and F1-score. Machine learning has transformed industries
by automating decision-making processes, enhancing predictive capabilities, and optimizing business
operations. This module provided hands-on experience in implementing ML models using Python
libraries such as Scikit-learn and TensorFlow. Machine learning has revolutionized numerous fields,
from healthcare and finance to marketing and automation. Its ability to process vast amounts of data
and identify patterns has led to advancements in artificial intelligence and predictive analytics. This
8
module laid the foundation for understanding machine learning techniques and their applications in
solving real-world problems.

Module 5: The Final Project

The final project allowed learners to apply all the skills acquired throughout the course to solve a
real-world business problem. The case study focused on a retail banking institution aiming to improve
its marketing strategies for selling term deposits.

The project involved extensive data preprocessing, including handling missing values, outlier
detection, and feature engineering to prepare the dataset for machine learning models. Exploratory
data analysis was conducted to identify key trends and insights, helping define the most relevant
features for predictive modeling.

Various machine learning algorithms, such as logistic regression, decision trees, random forests, and
support vector machines, were implemented to predict customer behavior and optimize marketing
outreach. The models were evaluated using performance metrics like confusion matrix, ROC curves,
and classification reports.

The project also emphasized model deployment, demonstrating how trained models can be integrated
into business applications for real-time decision-making. The hands-on nature of this project provided
practical experience in applying data science methodologies to real-world problems, reinforcing the
importance of data-driven decision-making in today’s competitive business landscape.

Working on a practical project provided valuable experience in tackling real-world data challenges.
The hands-on approach ensured a deep understanding of data science workflows, reinforcing critical
problem-solving skills. The project served as a bridge between theoretical concepts and industry
applications, preparing learners for future opportunities in data science and analytics.

Furthermore, this project highlighted the importance of iterative model improvement and
optimization. Hyperparameter tuning and feature selection were performed to enhance model
accuracy and efficiency. Additionally, the integration of advanced techniques such as ensemble
learning and cross-validation helped ensure robust and reliable predictions. Beyond technical
execution, this project underscored the significance of effective communication of data insights.
Visualizations, dashboards, and detailed reports were created to present findings in a clear and
actionable manner, ensuring that stakeholders could make informed decisions based on data-driven
insights. This comprehensive experience equipped learners with the necessary skills to tackle industry
challenges and develop scalable, real-world data science solutions.

9
3. CONCLUSION
In conclusion, data science stands as a transformative discipline that has reshaped the landscape of
decision-making, innovation, and problem-solving across various industries. Through the strategic
collection, analysis, and interpretation of data, data science empowers organizations to extract
valuable insights, make informed decisions, and drive positive outcomes. The journey from raw data
to actionable insights involves a series of intricate steps, including data collection, preprocessing,
feature engineering, modeling, and interpretation.

Data science has proven its significance in addressing complex challenges, from predictive analytics
and customer segmentation to medical diagnosis and fraud detection. Its ability to uncover hidden
patterns, trends, and correlations within data has led to advancements in artificial intelligence,
machine learning, and deep learning. Furthermore, data science serves as a bridge between
technology and human understanding, enabling individuals to comprehend complex phenomena and
make data-driven choices.

However, the realm of data science is not without its challenges. From data quality and preprocessing
complexities to ethical considerations and model interpretability, navigating these obstacles demands
a holistic approach that combines technical expertise, domain knowledge, and effective
communication. Despite these challenges, data science's impact is undeniable, contributing to
enhanced efficiency, innovation, and competitive advantage.

As data continues to grow exponentially and technology evolves, the future of data science holds the
promise of unlocking even deeper insights and pushing the boundaries of what is achievable. With
its ability to transform raw data into meaningful knowledge, data science remains a cornerstone of
modern decision-making, shaping a world where information is not just a resource but a powerful
tool for progress. The continuous advancements in computational power, algorithmic development,
and data availability will further enhance the capabilities of data science. Organizations and
professionals must stay adaptive and proactive in harnessing these advancements to drive further
innovation and societal progress.

Moreover, as industries continue to integrate AI-driven solutions, data science will play an even more
critical role in automation, predictive analytics, and personalized experiences. The emergence of real-
time data processing and edge computing will enable faster and more efficient decision-making,
reducing latency and improving responsiveness in critical applications. Ethical AI and responsible
data usage will also become central themes, prompting professionals to address biases, ensure data
10
privacy, and build transparent machine learning models. With an increasing emphasis on
explainability and fairness, the future of data science will not only be about innovation but also about
fostering trust and accountability in AI-driven decision systems. As a result, interdisciplinary
collaboration between data scientists, ethicists, policymakers, and industry leaders will be essential
to shaping a responsible and forward-looking data science ecosystem.

11
BIBILOGRAPHY

1. Casella G, Berger RL. Statistical inference. Second edition. Delhi: : Cengage

Learning 2017.
2. Abu-Mostafa YS, Magdon-Ismail M, Lin H-T. Learning from data: a short
course. [United States]: : AMLBook.com 2012.

3. Grimmett G, Stirzaker D. Probability and random processes. Third edition.

Oxford: : Oxford University Press 2001.
4. Mood AM, Graybill FA, Boes DC. Introduction to the theory of statistics. Third
edition. [Auckland?]: : McGraw-Hill Book Company 1974.

5. Hastie T, Tibshirani R, Friedman JH. The elements of statistical learning:

data mining, inference, and prediction. 2nd ed. New York: : Springer 2009.

Internshala Summer Training Report On Data Science
77% (22)
Internshala Summer Training Report On Data Science
70 pages
Internship Report 40 Pages
No ratings yet
Internship Report 40 Pages
40 pages
Altair Data Science Internship
No ratings yet
Altair Data Science Internship
48 pages
Database Notes
100% (1)
Database Notes
38 pages
Internship Report
No ratings yet
Internship Report
61 pages
Data Science Training Report.
100% (1)
Data Science Training Report.
73 pages
FactoryTalk View SE - Adding The Data Server Name and Tag Address To The Tag Address Syntax For Third Party OPC Servers
No ratings yet
FactoryTalk View SE - Adding The Data Server Name and Tag Address To The Tag Address Syntax For Third Party OPC Servers
7 pages
Sindhu Internship Report
No ratings yet
Sindhu Internship Report
38 pages
Building Modern GUIs with tkinter and Python: Building user-friendly GUI applications with ease (English Edition)
From Everand
Building Modern GUIs with tkinter and Python: Building user-friendly GUI applications with ease (English Edition)
Saurabh Chandrakar
No ratings yet
Data Science-Logbook
No ratings yet
Data Science-Logbook
101 pages
Bandit Torrism 1-3
100% (1)
Bandit Torrism 1-3
50 pages
Kickstart Quantum Computing and Communication Fundamentals: Master Quantum Computing Principles, Unlock Cutting-Edge Communication Protocols, and Build Future-Ready Solutions with Quantum Algorithms
From Everand
Kickstart Quantum Computing and Communication Fundamentals: Master Quantum Computing Principles, Unlock Cutting-Edge Communication Protocols, and Build Future-Ready Solutions with Quantum Algorithms
Paras Nath
No ratings yet
Kickstart Quantum Computing and Communication Fundamentals: Master Quantum Computing Principles, Unlock Cutting-Edge Communication Protocols, and Build Future-Ready Solutions with Quantum Algorithms (English Edition)
From Everand
Kickstart Quantum Computing and Communication Fundamentals: Master Quantum Computing Principles, Unlock Cutting-Edge Communication Protocols, and Build Future-Ready Solutions with Quantum Algorithms (English Edition)
Paras Nath Barwal
No ratings yet
Data Science Report - Compress
No ratings yet
Data Science Report - Compress
31 pages
Galileo!!! Ticketing Traning!!!!!!!!!!!!!!! Learn Free!!!!!!!!! - Air Ticketing (GDS)
100% (1)
Galileo!!! Ticketing Traning!!!!!!!!!!!!!!! Learn Free!!!!!!!!! - Air Ticketing (GDS)
9 pages
BUAN6320 - Chapter - 1-4 & 9
No ratings yet
BUAN6320 - Chapter - 1-4 & 9
191 pages
HFS+ File System Format Reference Sheet: HFS+ Data Is Big Endian GPT Is Li2le Endian
No ratings yet
HFS+ File System Format Reference Sheet: HFS+ Data Is Big Endian GPT Is Li2le Endian
2 pages
E.venkatasai Ir
No ratings yet
E.venkatasai Ir
204 pages
Lesson 4. Spatial Data Input and Editing
No ratings yet
Lesson 4. Spatial Data Input and Editing
9 pages
Dbms Questions Addon
No ratings yet
Dbms Questions Addon
7 pages
Internship Report
No ratings yet
Internship Report
64 pages
Data Science & Machine Learning: Prajapati Dipkumar Ramabhai
No ratings yet
Data Science & Machine Learning: Prajapati Dipkumar Ramabhai
53 pages
Virtual Lifelong Learning: Educating Society with Modern Communication Technologies
From Everand
Virtual Lifelong Learning: Educating Society with Modern Communication Technologies
Neha
No ratings yet
DataScience Internship
No ratings yet
DataScience Internship
87 pages
Internshala Summer Training Report On Data Science
No ratings yet
Internshala Summer Training Report On Data Science
70 pages
Security Part II: Auditing Database Systems: IT Auditing, Hall, 4e
No ratings yet
Security Part II: Auditing Database Systems: IT Auditing, Hall, 4e
37 pages
My Internship Document
No ratings yet
My Internship Document
41 pages
English Sample Exam Exin CCC BDF 201606 PDF
No ratings yet
English Sample Exam Exin CCC BDF 201606 PDF
28 pages
Coding Invaders DA
No ratings yet
Coding Invaders DA
31 pages
Project Report
No ratings yet
Project Report
58 pages
C0 Report
No ratings yet
C0 Report
50 pages
Data Mining (Module-1)
No ratings yet
Data Mining (Module-1)
14 pages
Akshay Final Internship Report
No ratings yet
Akshay Final Internship Report
64 pages
CHM Project Report
No ratings yet
CHM Project Report
23 pages
Vignesh's Documentation
No ratings yet
Vignesh's Documentation
59 pages
Data
No ratings yet
Data
36 pages
Exploring-Strategies Final
No ratings yet
Exploring-Strategies Final
24 pages
Data Science Intern Report Meena
No ratings yet
Data Science Intern Report Meena
24 pages
Skill Report
No ratings yet
Skill Report
36 pages
Data Science Intern Report Sheena
No ratings yet
Data Science Intern Report Sheena
24 pages
Data Valley 21VV1A0510
No ratings yet
Data Valley 21VV1A0510
85 pages
Internship Surekha
No ratings yet
Internship Surekha
47 pages
Internship Report
No ratings yet
Internship Report
73 pages
Rescue 1
No ratings yet
Rescue 1
26 pages
Data Science Report
No ratings yet
Data Science Report
46 pages
Data Science Intern
No ratings yet
Data Science Intern
19 pages
Training Report On Data Sciencep
No ratings yet
Training Report On Data Sciencep
80 pages
Sameer111 PDF
No ratings yet
Sameer111 PDF
20 pages
Seminar Report Maddu Ravindra 19103335 - Ravindra Babu
No ratings yet
Seminar Report Maddu Ravindra 19103335 - Ravindra Babu
21 pages
Project Report Guidelines
No ratings yet
Project Report Guidelines
20 pages
Manoj Intern Data Science
No ratings yet
Manoj Intern Data Science
37 pages
Ict Notes
No ratings yet
Ict Notes
17 pages
21.6 Conclusions: 488 M. Vasconcellos
No ratings yet
21.6 Conclusions: 488 M. Vasconcellos
10 pages
Increasing The Value of Quality Management Systems: Ida Gremyr
No ratings yet
Increasing The Value of Quality Management Systems: Ida Gremyr
14 pages
Sushil 7th (1 PDF
No ratings yet
Sushil 7th (1 PDF
29 pages
Internship (CS015) Report - ANGELINA MATHEWS
No ratings yet
Internship (CS015) Report - ANGELINA MATHEWS
26 pages
Ap Internship Last
No ratings yet
Ap Internship Last
30 pages
Rishisathrughnadata
No ratings yet
Rishisathrughnadata
15 pages
Internship
No ratings yet
Internship
28 pages
Project File For Internship Report
No ratings yet
Project File For Internship Report
17 pages
It Report
No ratings yet
It Report
24 pages
Avinash PDF
No ratings yet
Avinash PDF
23 pages
PDF 1
No ratings yet
PDF 1
20 pages
Fazli Bipin
No ratings yet
Fazli Bipin
24 pages
Data Science Report
No ratings yet
Data Science Report
32 pages
Data Science Report
No ratings yet
Data Science Report
32 pages
Ss2 Ict Second Term Exam 24
No ratings yet
Ss2 Ict Second Term Exam 24
5 pages
Internship Progress Report Template PG
No ratings yet
Internship Progress Report Template PG
14 pages
FINAL INTERN DOCUMENT Dhanunjai
No ratings yet
FINAL INTERN DOCUMENT Dhanunjai
26 pages
Godavari Engg College 24-25 Internship Report
No ratings yet
Godavari Engg College 24-25 Internship Report
19 pages
Final Industrial Report
No ratings yet
Final Industrial Report
34 pages
FOC Project
No ratings yet
FOC Project
41 pages
Neural Networks 16 Mark Answers
No ratings yet
Neural Networks 16 Mark Answers
13 pages
Harsh Synopsis
No ratings yet
Harsh Synopsis
21 pages
Data Science Report
No ratings yet
Data Science Report
32 pages
Data Science
No ratings yet
Data Science
11 pages
Mid Course Summative Assessment - Data Vizualization Tools
No ratings yet
Mid Course Summative Assessment - Data Vizualization Tools
4 pages
Napo
No ratings yet
Napo
21 pages
Ayush Cse Synopsis2
No ratings yet
Ayush Cse Synopsis2
11 pages
Big Data Driven Marketing
No ratings yet
Big Data Driven Marketing
5 pages
Array 1 - 2024 2
No ratings yet
Array 1 - 2024 2
15 pages
Internship Report
No ratings yet
Internship Report
9 pages
Kibana Fa
No ratings yet
Kibana Fa
4 pages
HANA Configuration Overview
No ratings yet
HANA Configuration Overview
15 pages
Data Science Approach To Stock Prices Forecasting
No ratings yet
Data Science Approach To Stock Prices Forecasting
10 pages
Chapter III Continued....
No ratings yet
Chapter III Continued....
7 pages
Chandan MS SOP V2
No ratings yet
Chandan MS SOP V2
3 pages
Shazam For Dummies: A Step-By-Step Guide To Using Shazam
No ratings yet
Shazam For Dummies: A Step-By-Step Guide To Using Shazam
7 pages
Literacy and Reward: Teachers' Effort To Build Children Reading Habit
No ratings yet
Literacy and Reward: Teachers' Effort To Build Children Reading Habit
7 pages
Workshop Practice Manual
From Everand
Workshop Practice Manual
Jatinder Madan
No ratings yet

School of Engineering and Technology: Data Science"

Uploaded by

School of Engineering and Technology: Data Science"

Uploaded by

SCHOOL OF ENGINEERING AND TECHNOLOGY

Submitted in partial fulfillment of the requirements for the award of degree in

Internship carried out

Internal Guide External Guide:

Department of Computer Science and Engineering

Internal Guide External Guide:

Signature of the HOD Signature of the Dean

Examiners Signature with date

I, B. Mani Kowshik bearing USN 21BBTCS056, student of Bachelor of Technology,

Place: Bengaluru B. Mani Kowshik

I express my sincere gratitude to my internship external guide Dr Parameswaran T, Associate

B. Mani Kowshik (21BBTCS056)

Date of certification: 2023-09-04 Certificate no. : b8scsi1g353

For certificate authentication, please visit https://trainings.internshala.com/verify_certificate

Chapter Contents Page

2 Internship Discussion 6-9

Programming and Tools: Gain proficiency in programming languages commonly used in

Statistics and Mathematics: Develop a solid understanding of statistical concepts, including

Communication and Visualization: Develop the ability to communicate technical findings to

Classical Machine Learning

Module 1: Introduction to Data Science

Module 2: Python for Data Science

Module 3: Understanding Statistics for Data Science

As businesses and organizations increasingly rely on data-driven strategies, a solid understanding of

Module 4: Introduction to Machine Learning

Module 5: The Final Project

1. Casella G, Berger RL. Statistical inference. Second edition. Delhi: : Cengage

3. Grimmett G, Stirzaker D. Probability and random processes. Third edition.

5. Hastie T, Tibshirani R, Friedman JH. The elements of statistical learning:

You might also like