Sachin
Sachin
Department of IT
Certificate
This is to certify that the work, which is being presented in the
Practical training seminar report for practical training taken at
“Celebal technologies.” entitled “Python Development”
submitted by Mr. Sachin Choudhary, a student of Fourth
year (VII Sem) B.Tech. in AI&DS as a partial fulfilment for
the award of degree of bachelor of technology is a record of
student’s work carried out and found satisfactory for
submission.
Candidate’s Declaration
I here by declare that the work, which is being presented in the Industrial
Training report, entitled “Python Developer” in partial fulfillment for the
award of Degree of “Bachelor of Technology” in Department of
Information Technology Specialization in Information Technology and
submitted to the Department of Computer Science & Engineering, Arya
College of Engineering, is a record of my own investigations carried under
the Guidance of Mr. Ankur Dutt Assistant Professor, Department of
Computer Science & Engineering.
(Signature Candidate)
Candidate Name
Sachin Choudhary
Roll No: 21EAIAD036
Abstract
During my internship at Celebal technologies, I gained extensive, hands-on experience in Python
development, focusing on building scalable, efficient, and user-centric applications. This
internship provided a deep understanding of Python's versatile frameworks and libraries, enabling
me to solve real-world problems with robust, maintainable code.
Another focus area was data processing and analysis. I worked on projects that required
manipulating large datasets using libraries such as Pandas and NumPy, and I employed Matplotlib
and Seaborn for data visualization. These tasks enhanced my ability to extract insights from data,
streamline workflows, and optimize application performance.
During this internship, I also explored automation and scripting. I developed Python scripts to
automate repetitive tasks, such as data entry and report generation, improving efficiency and
accuracy.
Additionally, I gained experience in web scraping using BeautifulSoup and Scrapy to collect and
process data from external sources.
Collaboration played a central role in my internship. I worked closely with other developers in an
Agile environment, participating in sprint planning, code reviews, and team discussions. This
experience honed my ability to work in cross-functional teams, communicate effectively, and
adapt to dynamic project requirements.
This internship solidified my understanding of Python development, equipping me with the skills
to manage full-stack development, tackle complex challenges, and contribute effectively to diverse
development projects in the future.
Acknowledgement
I would like to thanks the Department of Computer Science & Engineering, Arya
College of Engineering, Head Of Department Mrs. Shilpi Mishra mam for providing
us the opportunity to have such a training where we could get the exposure of
competing and performing with students from other colleges and universities.
I would also like to thank all the teaching assistants at Celebal Techno. Simplified,
they have been very helpful throughout the process both in solving our doubts and
motivating us to complete our tasks and assignments and helping us learn.
I would also like to express my deepest appreciation for Mr. Vimal Daga for guiding
me throughout the training and all the people who have directly or indirectly helped
me to successfully complete the training.
Sachin Choudhary
21EAIAD036
Learning/Internship Objectives
• Internships are generally thought of to obereserved for college students looking to
gain experience in a particular field. However, a wide array of people can benefit
from Training Internships in order to receive real world experience and develop their
skills.
• An objective for this position should emphasize the skills you already possess in the
area and your interest in learning more.
• Some internships are used to allow individuals to perform scientific research while
others are specifically designed to allow people to gain first-hand experience
working.
• Utilizing internships is a great way to build your resume and develop skills that can
be emphasized in your resume for future jobs. When you are applying for a Training
Internship, make sure to highlight any special skills or talents that can make you
stand apart from the rest of the applicants so that you have an improved chance of
landing the position.
TABLE OF CONTENTS
1
2
6
6.1
9
6.2
6.3
6.4
6.5
6.6 Types of Machine Learning 15-16
8 Chapter 8: Conclusion 24
References
Chapter 1 : INTRODCTION
Machine Learning, at its core, represents the culmination of computational prowess and
statistical ingenuity, empowering computers to learn patterns and make predictions without
explicit programming. Artificial Intelligence, on the other hand, aspires to endow machines
with the cognitive abilities to mimic human intelligence, from problem-solving to
understanding natural language.
As we delve into the heart of this report, we will navigate the practical applications reshaping
industries – from healthcare diagnostics and financial predictions to the intuitive
technologies that power our smartphones. Yet, as with any technological leap, ethical
considerations loom large. The report will scrutinize the ethical dimensions of ML and AI,
shedding light on the challenges of bias, accountability, and transparency in automated
decision-making.
In essence, this report serves as a compass in the uncharted territory of Machine Learning and Artificial
Intelligence. It is an invitation to traverse the technological frontiers, where algorithms hold the promise
of innovation, but also beckon a thoughtful examination of the societal impact and ethical responsibilities
accompanying this transformative journey. Join us in the tapestry that is reshaping the future of int
Chapter2 : HISTORY
The narrative of Machine Learning (ML) and Artificial Intelligence (AI) unfolds as a
captivating odyssey through time, marked by milestones that have shaped the very fabric
of technology and human-machine interaction.
1. Roots of AI (1940s-1950s):
The seeds of AI were sown in the 1940s and 1950s, where pioneers like Alan Turing laid
the theoretical groundwork. Turing's seminal work on the Turing Test and his proposition
of a universal machine set the stage for the quest to create intelligent machines.
.
Chapter 3: DEFINITION
The architecture of Machine Learning (ML) and Artificial Intelligence (AI) systems can vary
based on the specific task, application, and the underlying algorithms. However, I'll provide a
general overview of the typical components found in the architecture of these systems:
1. Data Collection:
- Raw Data: The process begins with the collection of raw data relevant to the task at hand.
This data could come from various sources, including sensors, databases, or external
datasets.
2. Data Pre-processing:
- Cleaning: Raw data often contains noise, missing values, or outliers. Data cleaning
involves handling these issues to ensure high-quality input for the model.
- Feature Engineering: In some cases, new features are created or existing ones are
transformed to enhance the model's ability to learn patterns.
3. Feature Selection:
- Choosing Relevant Features: Not all features are equally informative. Feature selection
involves identifying and using the most relevant features for the task.
4. Model Selection:
5. Model Training:
- Learning from Data: The selected model is trained using the pre-processed data. During
training, the model adjusts its internal parameters to minimize the difference between its
predictions and the actual outcomes.
6. Evaluation:
- Performance Metrics: The trained model is evaluated using metrics relevant to the task
(accuracy, precision, recall, F1 score, etc.). This step helps assess how well the model
generalizes to new, unseen data.
7. Deployment:
- Integration with Applications: Once a satisfactory model is obtained, it is integrated into the
8
Chapter 5 :APPLICATIONS
Machine Learning (ML) and Artificial Intelligence (AI) have found diverse applications across
various industries, transforming the way tasks are performed and decisions are made. Here are
some notable applications:
1. Healthcare:
- Disease Prediction and Diagnosis: ML algorithms analyze patient data to predict and
diagnose diseases, aiding healthcare professionals in early detection.
- Drug Discovery: AI accelerates the drug discovery process by predicting potential drug
candidates and optimizing molecular structures.
2. Finance:
- Credit Scoring: ML models assess creditworthiness based on historical financial data,
improving accuracy in lending decisions.
- Algorithmic Trading: AI is used for developing trading algorithms that analyse market
trends and execute trades at optimal times.
3. Retail:
- Recommendation Systems: ML powers personalized recommendations for products or
services based on customer preferences and behaviour.
- Inventory Management: AI optimizes inventory levels by predicting demand, reducing
waste, and ensuring product availability.
4. Marketing:
- Customer Segmentation: ML clusters customers based on behaviour, enabling targeted
marketing campaigns.
- Predictive Analytics: AI predicts market trends and customer preferences, informing
strategic marketing decisions.
5. Autonomous Vehicles:
- Computer Vision: AI processes visual data to enable object detection, lane tracking, and
obstacle avoidance in autonomous vehicles.
- Natural Language Processing: AI assists in voice-activated controls and communication
systems within vehicles.
6. Manufacturing:
- Predictive Maintenance: ML analyses equipment data to predict when machinery is
likely to fail, allowing for proactive maintenance.
- Quality Control: AI-powered vision systems inspect and identify defects in
manufacturing processes.
7. Education:
- Personalized Learning: ML tailors educational content to individual student needs, optimizing
the learning experience.
- Automated Grading: AI automates the grading process for assignments and exams.
8. Cybersecurity:
- Anomaly Detection: ML identifies unusual patterns and behaviours in network traffic,
helping detect potential cyber threats.
- Fraud Detection: AI algorithms analyse transaction patterns to identify and prevent
fraudulent activities.
These applications showcase the versatility and impact of ML and AI in solving complex
problems and improving efficiency across numerous domains. The ongoing advancements
in these fields continue to open new possibilities and opportunities for innovation
CHAPTER-6: INTRODUCTION TO MACHINE LEARNING
Consider machine learning as a super fun robot. This robot friend wants to learn how to do
something new, like recognizing different types of fruits. Instead of telling the robot exactly
what each fruit looks like, you show it a bunch of fruits, like apples, bananas, and oranges.
Now, the robot doesn't know anything about fruits at first, but it's really good at learning. So,
you start showing it lots and lots of fruits, and each time you show it one, you say, "This is
an apple" or "This is a banana."
After seeing many fruits and hearing what they are, the robot starts to notice patterns by
itself. It figures out that apples are usually red or green, bananas are yellow, and oranges are
orange. The robot has learned from all the examples you gave it.
Now, when you show the robot a new fruit it has never seen before, it can make a good
guess about what it is based on the patterns it learned. That's like the robot saying, "Hmm,
this new fruit is red and looks like what I learned about apples, so I think it's an apple!"
That's kind of how machine learning works. It's like teaching computers to learn from
examples and make smart decisions on their own. Just like how you learn to recognize things
by seeing and experiencing them, machine learning helps computers learn from lots of
examples so they can do tasks or solve problems without being explicitly told what to do.
1. Download Python:
Visit the official Python website (https://www.python.org/downloads/) and download the
latest version of Python for your operating system (Windows, macOS, or Linux).
2. Install Python:
Follow the installation instructions provided on the Python website. Make sure to check the
box that says "Add Python to PATH" during the installation.
3. Verify Installation:
Open a command prompt or terminal and type:
4. Scikit-Learn : Scikit-Learn provides simple and efficient tools for data mining and data
analysis.
5
If there are no errors, your Python setup for machine learning is ready to go!
Remember, you can install additional libraries as needed for specific projects. This guide
provides a basic setup for general machine learning tasks.
Pandas is a powerful and widely-used Python library for data manipulation and
analysis. It provides easy-to-use data structures and functions designed to make
working with structured data seamless. The primary data structures in Pandas are
Series and Data Frame.
1. Data Frame:
- Tabular Data Structure: Pandas DataFrame is a two-dimensional, tabular data structure
resembling a spreadsheet. It consists of rows and columns, making it ideal for representing
and working with structured data.
- Column and Row Operations: Pandas allows easy manipulation of columns and rows,
enabling tasks such as selecting, filtering, and transforming data effortlessly.
2. Series:
- One-Dimensional Array: Series is a one-dimensional labelled array capable of holding
any data type. It is often used to represent a single column or row of data within a
DataFrame.
- Indexed Data: Each element in a Series has a corresponding label or index, allowing
for quick and efficient data retrieval and manipulation.
3. Key Features:
- Data Cleaning: Pandas provides powerful tools for handling missing data, removing
duplicates, and reshaping data to facilitate analysis.
- Grouping and Aggregation: The library facilitates grouping data based on specified
criteria and performing aggregation operations on groups.
- Time Series and Dates: Pandas includes functionalities for handling time series data,
making it valuable for financial and temporal analyses.
4. Data I/O:
- Read and Write Data: Pandas supports reading data from various file formats, including
CSV, Excel, SQL databases, and more. It also enables writing data back to these formats.
2. Key Features:
- Efficient Operations: NumPy provides a variety of mathematical functions that operate
on entire arrays, making computations fast and memory-efficient.
- Broadcasting: NumPy's broadcasting allows for operations between arrays of different
shapes and sizes, making code concise and readable.
- Random Number Generation: The library includes functions for generating random
numbers, essential for simulations and statistical applications.
- Linear Algebra Operations: NumPy has a comprehensive set of functions for linear
algebra, including matrix multiplication, eigenvalue decomposition, and singular value
decomposition.
Matplotlib is a comprehensive 2D plotting library for Python widely used for creating
static, interactive, and animated visualizations in Python. Whether you're visualizing
data trends, creating charts, or generating complex plots, Matplotlib provides a
flexible and extensive set of tools for data visualization.
1. Plotting Styles:
- Line Plots: Matplotlib supports a variety of line plots, allowing users to visualize trends,
time series data, and mathematical functions.
- Scatter Plots: Scatter plots are useful for displaying relationships between two variables,
ideal for exploring correlations in data.
3. Advanced Plotting:
- 3D Plotting: Matplotlib supports 3D plotting for visualizing three-dimensional data and
surfaces.
- Contour Plots: Useful for displaying three-dimensional data in a two-dimensional space,
often used in scientific and engineering applications.
Matplotlib is a foundational library for data visualization in Python. Its versatility and
flexibility make it suitable for a wide range of applications, from exploratory data analysis to
creating publication-quality figures for scientific papers.
Machine Learning (ML) can be broadly categorized into three main types based on the
learning process and the nature of the training data. These types are:
These categories represent the fundamental paradigms of machine learning, and many
realworld applications may involve a combination of these types. Additionally, there
are hybrid approaches and specialized techniques within each category to address
specific challenges and tasks in diverse domains. Understanding the type of machine
learning needed for a particular problem is crucial for selecting the appropriate
algorithms and methodologies. - A labelled dataset is prepared, consisting of input-
output pairs. The input represents the features or attributes of the data, and the output
represents the corresponding labels or target values.
2. Training:
- The algorithm is trained on the labelled dataset to learn the relationship between
the input features and the target output.
- During training, the algorithm adjusts its internal parameters (weights and biases)
based on the input-output pairs to minimize the difference between the predicted
output and the actual output.
3. Prediction:
- Once the model is trained, it can make predictions on new, unseen data.
Chapter 7 : Project Implementation
4. Predictive Analytics:
- Machine learning models can predict future trends, behaviours, or outcomes
based on historical data.
- Predictive analytics is applied in various domains, including finance,
healthcare, and manufacturing, for forecasting and risk management
5. Image and Speech Recognition:
- Machine learning enables accurate image and speech recognition, making
advancements in fields like computer vision and natural language processing.
Applications range from facial recognition for security to
REFERENCES
1. McCulloch, W.S.; Pitts, W. A logical calculus of the ideas immanent in nervous
activity. Bull. Math. Biophys. 1943, 5, 115–133. [CrossRef]
2. Rosenblatt, F. The perceptron: A probabilistic model for information storage and
organization in the brain. Psychol. Rev. 1958, 65, 386. [CrossRef] [PubMed]
3. LeCun, Y.; Boser, B.; Denker, J.S.; Henderson, D.; Howard, R.E.; Hubbard, W.;
Jackel, L.D. Backpropagation applied to handwritten zip code recognition. Neural
Comput. 1989, 1, 541–551. [CrossRef]
4. Piatetsky, G. Python Leads the 11 Top Data Science, Machine Learning Platforms:
Trends and Analysis. 2019. Available online:
https://www.kdnuggets.com/2019/05/poll-top-data-science-machine-
learningplatforms.html (accessed on 1 February 2020).
5. Biham, E.; Seberry, J. PyPy: Another version of Py. eSTREAM, ECRYPT
Stream Cipher Proj. Rep. 2006, 38, 2006.
6. Developers, P. How fast is PyPy? 2020. Available online: https://speed.pypy.org
(accessed on 1 February 2020). 7. Team, G. The State of the Octoverse 2020. Available
online: https://octoverse.github.com (accessed on 25 March 2020).
8. Oliphant, T.E. Python for scientific computing. Comput. Sci. Eng. 2007, 9, 10–20.
[CrossRef]
9. Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.;
Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy
1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods
2020, 17, 261–272. [CrossRef]
10. Mckinney, W. pandas: A Foundational Python Library for Data Analysis and
Statistics. Python High Perform. Sci. Comput. 2011, 14, 1–9.