0% found this document useful (0 votes)
59 views

Pankaj Python

This document is a training report submitted by Pankaj Kumar, a student studying Electrical and Electronics Engineering at SRM University, Sonepat from 2019-2020. The report details Kumar's training in the Python programming language, covering topics like Python's history and uses of web scraping using Python. It provides an overview of Python's features and popularity. The report concludes with discussing how Beautiful Soup can be used for web scraping in Python.

Uploaded by

Shubham Goel
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
59 views

Pankaj Python

This document is a training report submitted by Pankaj Kumar, a student studying Electrical and Electronics Engineering at SRM University, Sonepat from 2019-2020. The report details Kumar's training in the Python programming language, covering topics like Python's history and uses of web scraping using Python. It provides an overview of Python's features and popularity. The report concludes with discussing how Beautiful Soup can be used for web scraping in Python.

Uploaded by

Shubham Goel
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 15

A TRAINING REPORT

Python Training Report


Bachelor of Technology

In

Electrical and Electronics Engineering (2019-2020)

Submitted To Submitted By:-

S.Murali Pankaj Kumar

Assistant Prof (EEE) Reg No:-10516210004


TABLE OF CONTENTS
1. Declaration i
2. Certificate ii
3. INDEX iii
DECLARATION

I hereby certify that the gone under training in intershala for Python by
“PANKAJ KUMAR”, in partial fulfillment for the award of degree of
B.Tech.(ELECTRICAL AND ELECTRONICS ENGG.) submitted in the
Department of (ELECTRICAL AND ELECTRONICS ENGG.) at SRM
UNIVERSITY, SONEPAT, HARYANA is an authentic record of my own
work carried out during a period from JULY 2019 to NOVEMBER 2019
under the supervision of Mr Mukesh singh . The mater presented in this
project has not been submitted by me in any other University / Institute for
the Award of B.Tech. Degree.

Pankaj Kumar

This is to certify that the above statement made by the student is correct to the best
of our knowledge and belief.

Signature of Supervisors
CERTIFICATE

This is to certify that this report is based on PYTHON language by Pankaj Kumar
(10516210004) , is submitted of partial fulfillment of the requirement for the
degree of Bachelor of Technology in electrical and Electronics Engineering of the
SRM University Delhi-NCR, Sonepat during the academic year 2019-20, is a
bonafide record carried out under our guidance and supervision.

The result embodied in this report have not been submitted to any other University
or Institution for the award of any degree.

Mr. S.Murali Dr. M.Lydia

Assistant Professor (Head of Department)


INDEX

1. INTRODUCTION
2. HISTORY
3. Web Scrapping
4. Uses of Web Scrapping
5. Benefits of Scrapping
6. Using Requests
7. Python Code
8. Beautiful Soup
9. Conclusion and Beautiful Soup
10.Conclusion and Future Scope
11.Reference
PYTHON

Python is an interpreted high-level programming language for general-


purpose programming. Created by Guido van Rossum and first released
in 1991, Python has a design philosophy that emphasizes code
readability, notably using significant whitespace. It provides constructs
that enable clear programming on both small and large scales. In July
2018, Van Rossum stepped down as the leader in the language
community after 30 years.
Python features a dynamic type system and automatic memory
management. It supports multiple programming paradigms, including
object-oriented, imperative, functional and procedural, and has a large
and comprehensive standard library.
Python interpreters are available for many operating systems. CPython,
the reference implementation of Python, is open source software and has
a community-based development model, as do nearly all of Python's
other implementations. Python and CPython are managed by the non-
profit Python Software Foundation.
The language's core philosophy is summarized in the document The Zen
of Python (PEP 20), which includes aphorisms such as:
Beautiful is better than ugly
Explicit is better than implicit
Simple is better than complex
Complex is better than complicated
Readability counts

Users and admirers of Python, especially those considered knowledgeable or


experienced, are often referred to as Pythonists, Pythonistas, and Pythoneers.
Python's large standard library, commonly cited as one of its greatest strengths,
provides tools suited to many tasks. For Internet-facing applications, many
standard formats and protocols such as MIME and HTTP are supported. It includes
modules for creating graphical user interfaces, connecting to relational databases,
generating pseudorandom numbers, arithmetic with arbitrary precision decimals,
manipulating regular expressions, and unit testing.
Some parts of the standard library are covered by specifications (for example, the
Web Server Gateway Interface (WSGI) implementation wsgiref follows PEP 333),
but most modules are not. They are specified by their code, internal
documentation, and test suites (if supplied). However, because most of the
standard library is cross-platform Python code, only a few modules need altering or
rewriting for variant implementations.

The language's core philosophy is summarized in the document The Zen of Python
(PEP 20), which includes aphorisms such as:
Beautiful is better than ugly
Explicit is better than implicit
Simple is better than complex
Complex is better than complicated
Readability counts
Users and admirers of Python, especially those considered knowledgeable or
experienced, are often referred to as Pythonists, Pythonistas, and Pythoneers.
Python's large standard library, commonly cited as one of its greatest strengths,
provides tools suited to many tasks. For Internet-facing applications, many
standard formats and protocols such as MIME and HTTP are supported. It includes
modules for creating graphical user interfaces, connecting to relational databases,
generating pseudorandom numbers, arithmetic with arbitrary precision decimals,
manipulating regular expressions, and unit testing.
Some parts of the standard library are covered by specifications (for example, the
Web Server Gateway Interface (WSGI) implementation wsgiref follows PEP 333),
but most modules are not. They are specified by their code, internal
documentation, and test suites (if supplied). However, because most of the
standard library is cross-platform Python code, only a few modules need .

As of March 2018, the Python Package Index (PyPI), the official repository for
third-party Python software, contains over 130,000 packages with a wide range of
functionality, including:
Graphical user interfaces
Web frameworks
Multimedia
Databases
Networking
Test frameworks
Automation
Web scraping
Documentation
System administration
Scientific computing
Text processing
Image processing
Python's name is derived from the British comedy group Monty Python, whom
Python creator Guido van Rossum enjoyed while developing the language.
Since 2003, Python has consistently ranked in the top ten most popular
programming languages in the TIOBE Programming Community Index where, as
of January 2018, it is the fourth most popular language (behind Java, C, and C++).
It was selected Programming Language of the Year in 2007 and 2010.

History of Python
Guido Van Rossum published the first version of Python code (version 0.9.0) at
alt.sources in February 1991. This release included already exception handling,
functions, and the core data types of list, dict, str and others. It was also object
oriented and had a module system. Python version 1.0 was released in January
1994. The major new features included in this release were the functional
programming tools lambda, map, filter and reduce, which Guido Van Rossum
never liked. Six and a half years later in October 2000, Python 2.0 was introduced.
This release included list comprehensions, a full garbage collector and it was
supporting unicode. Python flourished for another 8 years in the versions 2.x
before the next major release as Python 3.0 (also known as "Python 3000" and
"Py3K") was released. Python 3 is not backwards compatible with Python 2.x. The
emphasis in Python 3 had been on the removal of duplicate programming
constructs and modules, thus fulfilling or coming close to fulfilling the 13th law of
the Zen of Python: "There should be one -- and preferably only one – obvious way
to do it.
WEB SCRAPING
Web scraping, web harvesting, or web data extraction is data scraping used for
extracting data from websites. Web scraping software may access the World Wide
Web directly using the Hypertext Transfer Protocol, or through a web browser.
While web scraping can be done manually by a software user, the term typically
refers to automated processes implemented using a bot or web crawler. It is a form
of copying, in which specific data is gathered and copied from the web, typically
into a central local database or spreadsheet, for later retrieval or analysis.
Web scraping a web page involves fetching it .

Uses of Web Scrapping


There are many real business examples for which web scraping is being currently
used by businesses. The following are some examples.
1. Real Estate Listings gathering - It is a huge and growing web scraping area.
This is an area where the businesses are using web scraping to gather
already listed properties. 10 All MLS companies are using it.
2. Email Address gathering - This is used by a lot of companies. The main
purpose of this is lead generation. Once the emails are collected, bulk
emails are sent.
3. Product review scrapes - this is an important one and the reason why many
companies use it is so that they can keep an eye on their competitors.
4. Scraping to create other websites. The purpose is to get similar data from
different websites and then post all that data into one.
Website creators. You might have heard of a scraper website. A good
example would be indeed, a specific website example would be
10bestquotes.com
5. Collecting data from different social media websites, what’s trending.
A lot of social media companies.
6. Getting massive amounts of data for research purposes. This could be
scraping of government websites or other websites for stats, general
information and such.
Research Companies (This is again a growing one) Research companies
pull in massive amounts of data and then make sense of it.
7. Specific task scraping / One time scraping. This is when you need data from
a particular website for a very specific purpose just one time.

Benefits of Web Scrapping


You might wonder, why I should scrape the web and I have Google? Well, we
don’t reinvent the wheel here. Web scraping is not for creating search engines
only.
You can scrape your competitor’s web pages and analyze the data and see what
kind of products your competitor’s clients are happy with from their responses. All
this for FREE.
A successful SEO tool like Moz that scraps and crawls the entire web and process
the data for you so you can see people’s interest and how to compete with others in
your field to be on the top. 12
These are just some simple uses of web scraping. The scraped data means making
money :).
FIREBASE
Firebase is a mobile and web application development platform developed by
Firebase, Inc. in 2011, then acquired by Google in 2014. As of October 2018, the
Firebase platform has 18 products[6] which are used by 1.5 million apps.

Firebase real-time database


Firebase provides a real-time database and backend as a service. The service
provides application developers an API that allows application data to be
synchronized across clients and stored on Firebase's cloud. The company provides
client libraries that enable integration with Android, iOS, JavaScript, Java,
Objective-C, Swift and Node.js applications. The database is also accessible
through a REST API and bindings for several JavaScript frameworks such as
AngularJS, React, Ember.js and Backbone.js. The REST API uses the Server-Sent
Events protocol, which is an API for creating HTTP connections for receiving push
27
Notifications from a server. Developers using the real-time database can secure
their data by using the company's server-side-enforced security rules. Cloud Fire
store which is Firebase's next generation of the Real-time Database was released
for beta use.
Python code
Conclusion and future scope
Web scraping enables a user with enormous amount of data, the whole internet act
like a database to the user.
This project sets an example for using web scraping and android and connecting
both the firebase. These technologies would definitely bring change in the future.
In coming years this project would be transformed into complete android
application.
References
1. Internshala(www.internshala.com)
2. Firebase(firebase.google.com)
3. www.python.org
4. Crummy
5. docs.python.com
6. Wikipedia(www.wikipedia.org)
7. StackOverflow(www.stackoverflow.com )

You might also like