0% found this document useful (0 votes)

2 views16 pages

Project 2 EmailbySeleniumSameProject

The document outlines a project using Selenium and Python to scrape school data from a specified website. It details the necessary setup, including module imports, the use of selectors, wait strategies, and the final code to extract and save the data into a CSV file. Key steps include handling dynamic elements, managing browser visibility, and ensuring proper data extraction techniques.

Uploaded by

rana.navdeep557

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views16 pages

Project 2 EmailbySeleniumSameProject

Uploaded by

rana.navdeep557

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Project-2 USing the Selemium+ Python(schools)

Demo url: https://directory.ntschools.net/#/schools

Data to be extracted:
• Name of schools
• Telephone number
• Email Address
• Physical and Postal Address
METHOD:
• Pip install selenium

• Use https://selenium-python.readthedocs.io/ for further

documentation
Step 1: import necessary modules like web driver,
chrome,service,by, keys ,time
• By module is used to access locating elements mainly
•  By XPATH
•  BY CSS SELECTOR (if css selector not working ), then only goes with
•  BY ID
•  BY CLASSNAME
• Keys module is used for sending keys event
• The time module is used to pause the browser until the desired result is obtained, unless the driver quits unexpectedly without prior
notice.

from selenium import webdriver

from selenium.webdriver import Chrome
from selenium.webdriver.chrome.service import Service
from selenium.webdriver.common.by import By
from selenium.webdriver.common.keys import Keys
import time
Ques: Nowadays executable_path is not working . What to do?

• Ans:
• #driver=Chrome(executable_path="D:/DataScientist/WebScraperPractical/chromedriver-win64/
chromedriver.exe")
• # executable path is removed , so use SERVICE instead
• s=Service("D:/DataScientist/WebScraperPractical/chromedriver-win64/chromedriver.exe")
• driver=webdriver.Chrome(service=s)

• CHECK DOCUMENTATION : https://selenium-python.readthedocs.io/

Step 2: Add Selector Gadget extension in chrome

We are using this to get selector for shortcuts so that we don’t have to go to repeatedly in INSPECT tab
Step 3:
• Now go to your desired website, and click on extension . After that ,click on desired link which you want
turns YELLOW and double click on what you don’t want turns RED
• On right-bottom corner, you have something like this

• 273 here is total scraped item

Step 4: There are two types of WAITS:

1.Implicitly_wait- tells the driver to wait for seconds until it locate the element
Eg. Driver.implictly_waits(10)

# implicit wait
driver.implicitly_wait(20)

selector="#search-panel-container .nav-link"
links=driver.find_elements(By.CSS_SELECTOR, selector)
2.Explicitly_wait -makes WebDriver wait for a certain condition to occur before proceeding further
with execution
Eg: time.sleep(10)

# NOW optional explicit wait,

links=WebDriverWait(driver,10).until(EC.presence_of_all_elements_located((By.CSS_SELECTOR,
selector)))

Remember, there is chance of getting error when you run above links as parenthesis after until wants
one argument, but sometimes it shows two arguments. There is ERROR, so check there should be
three closing parenthesis. And selector without quotes.
Step 5: Now , we have to click on links

1. Write links.click()
2. Click on school_name and INSPECT tab--> copy school title
which is a class tag
3. CTRL +F now paste it after putting . school-title h1
4. Since check it should be one of its kind ,so this is our selector
name
Ques: Whenever you have “stale-element-reference-exception”

Ans: it means browser is not getting any page.

Because whenever we click on link , it move to another page and previous link[memory] becomes
Empty.

So we have to be inside the loop

for i in links[:2]:
# print(link.text)
links=WebDriverWait(driver,10).until(EC.presence_of_all_elements_located((By.CSS_SELECTOR,
selector)))
links[i].click()
Step 6: Here,

driver.find_element(By.XPATH,'//div[text()="Physical Address"]/following-sibling::div')
Here we are specific with div element that contains text =physical address

Or
driver.find_element(By.XPATH,'//*[text()="Physical Address"]/following-sibling::*')
Where * means any element
Step 7: making the chrome browser headless

# used to hide chrome visibility(headless mode)

options=ChromeOptions()
options.headless=True

Step 8: To save the output

with open('ntschools_data.csv','w', newline='', encoding='utf-8') as f:

writer=csv.DictWriter(f, fieldnames=['name','physical_add','postal_add','phone_no'])
writer.writeheader()
writer.writerows(results)

Here ntschools_data.csv=output file, newline means beginning of excel sheet, ‘w’ means writing mode
FINAL CODE:
from selenium import webdriver
from selenium.webdriver import Chrome, ChromeOptions
from selenium.webdriver.chrome.service import Service
from selenium.webdriver.common.by import By

from selenium.webdriver.support.ui import WebDriverWait

from selenium.webdriver.support import expected_conditions as EC
import time
import csv
# used to hide chrome visibility(headless mode)
options=ChromeOptions()
options.add_argument('--headless')
# options.headless=True
options.add_argument('--disable-gpu')

s=Service("D:/DataScientist/WebScraperPractical/chromedriver-win64/chromedriver.exe")
driver=webdriver.Chrome(service=s, options=options)
driver.get("https://directory.ntschools.net/#/schools")
selector="#search-panel-container .nav-link"
# NOW optional explicit wait,
links=WebDriverWait(driver,30).until(EC.presence_of_all_elements_located((By.CSS_SELECTOR, selector)))
school_name_selector=".school-title h1"
results=[]
# for i in range(len(links))
for i in range(3):
# print(link.text)
links=WebDriverWait(driver,30).until(EC.presence_of_all_elements_located((By.CSS_SELECTOR, selector)))
links[i].click()

name_e=WebDriverWait(driver,30).until(EC.presence_of_element_located((By.CSS_SELECTOR,
school_name_selector)))
# print(name_e.text)
details={
'name':name_e.text,
'physical_add':driver.find_element(By.XPATH,'//div[text()="Physical Address"]/following-sibling::div').text,
'postal_add':driver.find_element(By.XPATH,'//div[text()="Postal Address"]/following-sibling::div').text,
'phone_no':driver.find_element(By.XPATH,'//div[text()="Phone"]/following-sibling::*/a').text,

results.append(details)
driver.back() #goes one step back in browser history

# print(results)
with open('ntschools_data1.csv','w', newline='', encoding='utf-8') as f:
writer=csv.DictWriter(f, fieldnames=['name','physical_add','postal_add','phone_no'])
writer.writeheader()
writer.writerows(results)

driver.quit()

Lecture 12
No ratings yet
Lecture 12
136 pages
Automation Cheat Sheet 2.0
100% (1)
Automation Cheat Sheet 2.0
6 pages
Web Scraping Report
No ratings yet
Web Scraping Report
14 pages
Final Exam Review PDF
67% (3)
Final Exam Review PDF
19 pages
Selenium Python Bindings
No ratings yet
Selenium Python Bindings
71 pages
Automation Cheat Sheet 2.0
100% (1)
Automation Cheat Sheet 2.0
6 pages
AutoCAD User Interface
No ratings yet
AutoCAD User Interface
4 pages
Python With Slenium
No ratings yet
Python With Slenium
71 pages
Online Cake Shop
No ratings yet
Online Cake Shop
20 pages
Selenium Automation
100% (1)
Selenium Automation
58 pages
Web Scraping Cheat Sheet 2.0
No ratings yet
Web Scraping Cheat Sheet 2.0
3 pages
Important Selenium With Python Operations
100% (1)
Important Selenium With Python Operations
5 pages
Web Scraping Project
No ratings yet
Web Scraping Project
1 page
SP GTU Study Material Presentations Unit-3 11082019035023AM
100% (1)
SP GTU Study Material Presentations Unit-3 11082019035023AM
63 pages
Selenium 2.0 WD Using Python
No ratings yet
Selenium 2.0 WD Using Python
9 pages
Agilent G3335AA MassHunter Software Troubleshooting Guide V2 - 0
No ratings yet
Agilent G3335AA MassHunter Software Troubleshooting Guide V2 - 0
92 pages
Selenium
No ratings yet
Selenium
8 pages
DASA DevOps Fundamentals - Mock Exam - English
100% (1)
DASA DevOps Fundamentals - Mock Exam - English
27 pages
Controlling The Web With Python - Towards Data Science
No ratings yet
Controlling The Web With Python - Towards Data Science
14 pages
Automation Testing Case Study Solution
No ratings yet
Automation Testing Case Study Solution
15 pages
POM With Selenium in Python - Learning Guide: On Weekend
No ratings yet
POM With Selenium in Python - Learning Guide: On Weekend
10 pages
10.python Selenium Guide - Submitting Forms With Selenium - ScrapeOps
No ratings yet
10.python Selenium Guide - Submitting Forms With Selenium - ScrapeOps
20 pages
Kjfsdagkljdsalfdgasgdg
No ratings yet
Kjfsdagkljdsalfdgasgdg
5 pages
87 1
No ratings yet
87 1
10 pages
Web Scraping
No ratings yet
Web Scraping
11 pages
Selenium Cheat Sheet Advanced
No ratings yet
Selenium Cheat Sheet Advanced
5 pages
11
No ratings yet
11
3 pages
Selenium by Using Python
No ratings yet
Selenium by Using Python
5 pages
Range & Selection Objects
No ratings yet
Range & Selection Objects
19 pages
Web Scraping and Data Collection CheatSheet 1731972399
No ratings yet
Web Scraping and Data Collection CheatSheet 1731972399
10 pages
Practical Web Scraping For Economists 1744341390
No ratings yet
Practical Web Scraping For Economists 1744341390
33 pages
Software Testing Manual
No ratings yet
Software Testing Manual
24 pages
Python
No ratings yet
Python
15 pages
Hybrid Scraping Techniques
No ratings yet
Hybrid Scraping Techniques
8 pages
Python Selenium Cheat Sheet
No ratings yet
Python Selenium Cheat Sheet
8 pages
DH
No ratings yet
DH
4 pages
ST Mod 5
No ratings yet
ST Mod 5
9 pages
Selenium Quick Reference All Commands
No ratings yet
Selenium Quick Reference All Commands
6 pages
Project 1 Email Extraction Using Scrapy
No ratings yet
Project 1 Email Extraction Using Scrapy
13 pages
Testrigor Com Blog...
No ratings yet
Testrigor Com Blog...
14 pages
Unit - 01-Chapter 01 and Chapter 02 Material
No ratings yet
Unit - 01-Chapter 01 and Chapter 02 Material
79 pages
Pharmasug China 2022 AD127
No ratings yet
Pharmasug China 2022 AD127
4 pages
Python Selenium Cheat Sheet
No ratings yet
Python Selenium Cheat Sheet
6 pages
Web Scrapping
No ratings yet
Web Scrapping
3 pages
84 3
No ratings yet
84 3
10 pages
C++ The Good, Bad, and Ugly
No ratings yet
C++ The Good, Bad, and Ugly
29 pages
SL Lab Manual
No ratings yet
SL Lab Manual
39 pages
Getting Started Guide: Alfresco Content Services 6.0
No ratings yet
Getting Started Guide: Alfresco Content Services 6.0
14 pages
Oracle SOA Parallel Processing by Using Flow Activity Lab#12
100% (1)
Oracle SOA Parallel Processing by Using Flow Activity Lab#12
20 pages
SOA Suite 10g Developer Guide
No ratings yet
SOA Suite 10g Developer Guide
47 pages
Language Based Security-1
No ratings yet
Language Based Security-1
44 pages
Mini Project Report
No ratings yet
Mini Project Report
36 pages
Unit Testing of Spark Applications
No ratings yet
Unit Testing of Spark Applications
18 pages
Web Development Lab: Rajat Goyal 1 14IT056
No ratings yet
Web Development Lab: Rajat Goyal 1 14IT056
20 pages
Java Programming Lab
No ratings yet
Java Programming Lab
2 pages
'Python Notes Unit 2
No ratings yet
'Python Notes Unit 2
18 pages
White Paper - Why Customers Choose Percona For PostgreSQL
No ratings yet
White Paper - Why Customers Choose Percona For PostgreSQL
15 pages
Release Notes
No ratings yet
Release Notes
21 pages
Multithreading Notes
No ratings yet
Multithreading Notes
11 pages
READ ME Math Scheme of Work v7
No ratings yet
READ ME Math Scheme of Work v7
2 pages
Nmea Parser
No ratings yet
Nmea Parser
10 pages
Mar, 2024 - Dumpsactual C-ABAPD-2309 PDF Dumps and C-ABAPD-2309 Exam Questions (Q34-Q49)
No ratings yet
Mar, 2024 - Dumpsactual C-ABAPD-2309 PDF Dumps and C-ABAPD-2309 Exam Questions (Q34-Q49)
14 pages
Readings From Programming With C++ 1st Edition Kyla Mcmullen Download
No ratings yet
Readings From Programming With C++ 1st Edition Kyla Mcmullen Download
51 pages
Duplichecker Plagiarism Report 3
No ratings yet
Duplichecker Plagiarism Report 3
3 pages
Martunis CV
No ratings yet
Martunis CV
3 pages
Profile
No ratings yet
Profile
2 pages
Faizan Habib - Web Develoer - CV - Resume - Faisalabab, Pakistan
No ratings yet
Faizan Habib - Web Develoer - CV - Resume - Faisalabab, Pakistan
2 pages
10 Lessons in Front-end
From Everand
10 Lessons in Front-end
Krasimir Tsonev
2/5 (1)
Zend Framework 2 Cookbook
From Everand
Zend Framework 2 Cookbook
Josephus Callaars
No ratings yet
NoSQL Injection for Elasticsearch
From Everand
NoSQL Injection for Elasticsearch
Gary Drocella
No ratings yet
HTML And CSS Lab Companion: HTML And CSS Lab Companion, #1
From Everand
HTML And CSS Lab Companion: HTML And CSS Lab Companion, #1
hendra mulyanto
No ratings yet
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Quick JavaScript Learning In Just 3 Days: Fast-Track Learning Course
From Everand
Quick JavaScript Learning In Just 3 Days: Fast-Track Learning Course
Vijay K.R.
No ratings yet
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mamood Alassouli
No ratings yet
Angular HTTP: Connecting to the REST API
From Everand
Angular HTTP: Connecting to the REST API
Abdelfattah Ragab
No ratings yet
Python and SQLite Development
From Everand
Python and SQLite Development
Agus Kurniawan
No ratings yet
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
From Everand
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
equitypress
4.5/5 (3)
Working with Vue.js
From Everand
Working with Vue.js
Jack Franklin
No ratings yet
AWS Certified Developer Associate (DVA-C01) Practice Test
From Everand
AWS Certified Developer Associate (DVA-C01) Practice Test
iCertify Training
No ratings yet
Selenium Webdriver: Book1
From Everand
Selenium Webdriver: Book1
Rajan
2/5 (1)
JavaScript Essentials For Dummies
From Everand
JavaScript Essentials For Dummies
Paul McFedries
No ratings yet
Web Scraping for SEO with Python
From Everand
Web Scraping for SEO with Python
Enrique Vicente
No ratings yet
JavaScript Interview Questions, Answers, and Explanations: JavaScript Certification Review
From Everand
JavaScript Interview Questions, Answers, and Explanations: JavaScript Certification Review
equitypress
No ratings yet
Take Your First Steps into Vue.JS
From Everand
Take Your First Steps into Vue.JS
Tom Henricksen
No ratings yet
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
From Everand
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
Jens Boje
No ratings yet
ColdFusion Interview Questions, Answers, and Explanations: ColdFusion Certification Review
From Everand
ColdFusion Interview Questions, Answers, and Explanations: ColdFusion Certification Review
equitypress
No ratings yet
Learn Selenium in 24 Hours
From Everand
Learn Selenium in 24 Hours
Alex Nordeen
No ratings yet
Some Tutorials in Computer Networking Hacking
From Everand
Some Tutorials in Computer Networking Hacking
Dr. Hidaia Mahmood Alassouli
No ratings yet

Project 2 EmailbySeleniumSameProject

Uploaded by

Project 2 EmailbySeleniumSameProject

Uploaded by

Project-2 USing the Selemium+ Python(schools)

Demo url: https://directory.ntschools.net/#/schools

• Use https://selenium-python.readthedocs.io/ for further

from selenium import webdriver

• CHECK DOCUMENTATION : https://selenium-python.readthedocs.io/

• 273 here is total scraped item

# NOW optional explicit wait,

Ans: it means browser is not getting any page.

So we have to be inside the loop

# used to hide chrome visibility(headless mode)

Step 8: To save the output

with open('ntschools_data.csv','w', newline='', encoding='utf-8') as f:

from selenium.webdriver.support.ui import WebDriverWait

You might also like