0% found this document useful (0 votes)

35 views6 pages

12.using Proxies With Python Selenium - ScrapeOps

This document provides a guide on how to integrate proxies into Python Selenium for web scraping. It covers the integration of simple HTTP proxies, authenticated proxies using Selenium Wire, and the use of proxy APIs, detailing the necessary code snippets for both Chrome and Firefox browsers. Additionally, it emphasizes the importance of using proxy port integration over API endpoints for better functionality with headless browsers.

Uploaded by

Khánh Cao Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views6 pages

12.using Proxies With Python Selenium - ScrapeOps

Uploaded by

Khánh Cao Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

3/31/24, 9:33 PM Using Proxies With Python Selenium | ScrapeOps

Using Proxies With Python Selenium

Selenium is a powerful browser automation library that allows you to build bots and scrapers that can
load and interact with web pages in the browser. As a result, Selenium is very popular amongst the
Python web scraping community.

In this guide for The Python Selenium Web Scraping Playbook, we will look at how to integrate proxies
into our Python Selenium based web scraper.

There are number of different types of proxies which you need to integrate differently with Selenium, so
we will walk through how to integrate each type:

Using Proxies With Selenium

Using Authenticated Proxies With Selenium
Integrating Proxy APIs

Using Proxies With Selenium

The first and simplest type of proxy to integrate with Python Selenium are simple HTTP proxies (in the
form of a IP address) that don't require authentication. For example:

https://scrapeops.io/selenium-web-scraping-playbook/python-selenium-proxy/ 1/6
3/31/24, 9:33 PM Using Proxies With Python Selenium | ScrapeOps

"11.456.448.110:8080"

Depending on which type of browser you are using the integration method is slightly different.

Integrating Proxy With Selenium Chrome Browser

To integrate this proxy IP into a Selenium scraper that uses a Chrome Browser we just need to set the --
proxy-server arguement in our WebDriver options:

from selenium import webdriver

## Example Proxy
PROXY = "11.456.448.110:8080"

## Create WebDriver Options to Add Proxy

chrome_options = WebDriver.ChromeOptions()
chrome_options.add_argument(f'--proxy-server={PROXY}')
chrome = webdriver.Chrome(chrome_options=chrome_options)

## Make Request Using Proxy

chrome.get("http://httpbin.org/ip")

Now when we run the script we can see that Selenium is using the defined proxy IP:

{
"origin": "11.456.448.110:8080"
}

Integrating Proxy With Selenium Firefox Browser

To integrate this proxy IP into a Selenium scraper that uses a FireFox Browser we need to use the Proxy
and ProxyType classes from the Selenium Webdriver library:

from selenium import webdriver

from selenium.webdriver.common.proxy import Proxy, ProxyType

## Define Proxy
proxy = Proxy({
'proxyType': ProxyType.MANUAL,
'httpProxy': "11.456.448.110:8080",

https://scrapeops.io/selenium-web-scraping-playbook/python-selenium-proxy/ 2/6
3/31/24, 9:33 PM Using Proxies With Python Selenium | ScrapeOps

'noProxy': ''
})

## Create Driver
firefox_driver = webdriver.Firefox(proxy = proxy, executable_path=r"/root/geckodriver")

## Make Request Using Proxy

firefox_driver.get("http://httpbin.org/ip")

Now when we run the script we can see that Selenium is using the defined proxy IP:

{
"origin": "11.456.448.110:8080"
}

HTTP PROXY AUTHENTICATION

This method works fine when you don't need to add an authentication username and password to
the proxy. We will look at how to use authenticated proxies in another section.

Using Authenticated Proxies With Selenium

The above method doesn't work if you need to use proxies that require username and password
authentication.

It is very common for commercial proxy providers to sell access to their proxy pools by giving you single
proxy endpoint that you send your requests too and authenticate your account using a username and
password .

"http://USERNAME:PASSWORD@proxy-server:8080"

There are a couple ways to solve this, but one of the easiest is to use the Selenium Wire extension which
makes it very easy to use proxies with Selenium.

First, you need to install Selenium Wire using pip:

pip install selenium-wire

https://scrapeops.io/selenium-web-scraping-playbook/python-selenium-proxy/ 3/6
3/31/24, 9:33 PM Using Proxies With Python Selenium | ScrapeOps

Then update your scraper to use the seleniumwire webdriver instead of the default selenium
webdriver :

from seleniumwire import webdriver

from webdriver_manager.chrome import ChromeDriverManager

## Define Your Proxy Endpoints

proxy_options = {
'proxy': {
'http': 'http://USERNAME:PASSWORD@proxy-server:8080',
'https': 'http://USERNAME:PASSWORD@proxy-server:8080',
'no_proxy': 'localhost:127.0.0.1'
}
}

## Set Up Selenium Chrome driver

driver = webdriver.Chrome(ChromeDriverManager().install(),
seleniumwire_options=proxy_options)

## Send Request Using Proxy

driver.get('http://httpbin.org/ip')

Now when we run the script we can see that Selenium is using a proxy IP:

{
"origin": "201.88.548.330:8080"
}

Selenium Wire has a lot of other powerful functionality, so if you would like to learn more then check out
our full Selenium Wire guide here.

Integrating Proxy APIs

Over the last few years there has been a huge surge in proxy providers that offer smart proxy solutions
that handle all the proxy rotation, header selection, ban detection and retries on their end. These smart
APIs typically provide their proxy services in a API endpoint format.

However, these proxy API endpoints don't integrate well with headless browsers when the website is
using relative links as Selenium will try to attach the relative URL onto the proxy API endpoint not the
websites root URL. Resulting, in some pages not loading correctly.

https://scrapeops.io/selenium-web-scraping-playbook/python-selenium-proxy/ 4/6
3/31/24, 9:33 PM Using Proxies With Python Selenium | ScrapeOps

As a result, when integrating your Selenium scrapers it is recommended that you use their proxy port
integration over the API endpoint integration when they provide them (not all do have a proxy port
integration).

For example, in the case of the ScrapeOps Proxy Aggregator we offer a proxy port integration for
situations like this.

The proxy port integration is a light front-end for the API and has all the same functionality and
performance as sending requests to the API endpoint but allow you to integrate our proxy aggregator
as you would with any normal proxy.

The following is an example of how to integrate the ScrapeOps Proxy Aggregator into your Selenium
scraper using

from seleniumwire import webdriver

from webdriver_manager.chrome import ChromeDriverManager

SCRAPEOPS_API_KEY = 'APIKEY'

## Define ScrapeOps Proxy Port Endpoint

proxy_options = {
'proxy': {
'http': f'http://scrapeops:{SCRAPEOPS_API_KEY}@proxy.scrapeops.io:5353',
'https': f'http://scrapeops:{SCRAPEOPS_API_KEY}@proxy.scrapeops.io:5353',
'no_proxy': 'localhost:127.0.0.1'
}
}

## Set Up Selenium Chrome driver

driver = webdriver.Chrome(ChromeDriverManager().install(),
seleniumwire_options=proxy_options)

## Send Request Using ScrapeOps Proxy

driver.get('http://quotes.toscrape.com/')

Full integration docs for Python Selenium and the ScrapeOps Proxy Aggregator can be found here.

TIP

To use the ScrapeOps Proxy Aggregator, you first need an API key which you can get by signing up
for a free account here which gives you 1,000 free API credits.

https://scrapeops.io/selenium-web-scraping-playbook/python-selenium-proxy/ 5/6
3/31/24, 9:33 PM Using Proxies With Python Selenium | ScrapeOps

Or check out one of our more in-depth guides:

Selenium Undetected Chromedriver Guide: Bypass Anti-Bots With Ease

How to Scrape The Web Without Getting Blocked Guide
The Ethics of Web Scraping

https://scrapeops.io/selenium-web-scraping-playbook/python-selenium-proxy/ 6/6

Learning Alfresco Web Scripts
From Everand
Learning Alfresco Web Scripts
Ramesh Chauhan
No ratings yet
Learning Selenium Testing Tools - Third Edition
From Everand
Learning Selenium Testing Tools - Third Edition
Raghavendra Prasad MG
No ratings yet
PDF 24
No ratings yet
PDF 24
11 pages
Learn Selenium in 24 Hours
From Everand
Learn Selenium in 24 Hours
Alex Nordeen
No ratings yet
Selenium Webdriver: Book1
From Everand
Selenium Webdriver: Book1
Rajan
2/5 (1)
Browsermob Proxy Py
No ratings yet
Browsermob Proxy Py
19 pages
Aprende programación python aplicaciones web: python, #2
From Everand
Aprende programación python aplicaciones web: python, #2
Jesus Jonathan cuevas orozco
No ratings yet
Java EE 7 Development with WildFly
From Everand
Java EE 7 Development with WildFly
Francesco Marchioni
No ratings yet
Selenium Framework Design in Keyword-Driven Testing: Automate Your Test Using Selenium and Appium
From Everand
Selenium Framework Design in Keyword-Driven Testing: Automate Your Test Using Selenium and Appium
Pinakin Ashok Chaubal
No ratings yet
PDF Document 2
No ratings yet
PDF Document 2
24 pages
Selenium Essentials
From Everand
Selenium Essentials
Prashanth Sams
2.5/5 (2)
Web Scraping and Data Collection CheatSheet 1731972399
No ratings yet
Web Scraping and Data Collection CheatSheet 1731972399
10 pages
Web Scraping Report
No ratings yet
Web Scraping Report
14 pages
AngularJS Web Application Development Blueprints
From Everand
AngularJS Web Application Development Blueprints
Vinci Rufus
No ratings yet
Scrapingquickstart
No ratings yet
Scrapingquickstart
32 pages
Web Crawling - Python
No ratings yet
Web Crawling - Python
34 pages
Alfresco Developer Guide
From Everand
Alfresco Developer Guide
Jeff Potts
No ratings yet
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
From Everand
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
Ian Taylor
No ratings yet
Mastering Go Network Automation
From Everand
Mastering Go Network Automation
Ian Taylor
No ratings yet
Java: Tips and Tricks to Programming Code with Java
From Everand
Java: Tips and Tricks to Programming Code with Java
Charlie Masterson
No ratings yet
Java: Tips and Tricks to Programming Code with Java: Java Computer Programming, #2
From Everand
Java: Tips and Tricks to Programming Code with Java: Java Computer Programming, #2
Charlie Masterson
No ratings yet
PYPROXY - Market Leading Residential IP Proxy - Useful HTTP Proxy
No ratings yet
PYPROXY - Market Leading Residential IP Proxy - Useful HTTP Proxy
1 page
SDS WebScraping Bonus Scrapy Vs BeautifulSoup PDF
No ratings yet
SDS WebScraping Bonus Scrapy Vs BeautifulSoup PDF
6 pages
Perl and Apache: Your visual blueprint for developing dynamic Web content
From Everand
Perl and Apache: Your visual blueprint for developing dynamic Web content
Adam McDaniel
No ratings yet
Postman Cookbook: Hand-picked Solutions and Techniques across API Design, Testing, Performance, Networking, Kubernetes and Integration
From Everand
Postman Cookbook: Hand-picked Solutions and Techniques across API Design, Testing, Performance, Networking, Kubernetes and Integration
Oliver James
No ratings yet
Postman Cookbook
From Everand
Postman Cookbook
Oliver James
No ratings yet
Introduction To Web Crawling Chapter - 13
No ratings yet
Introduction To Web Crawling Chapter - 13
3 pages
19.python Selenium Guide - How To Bypass PerimeterX With Selenium - ScrapeOps
No ratings yet
19.python Selenium Guide - How To Bypass PerimeterX With Selenium - ScrapeOps
16 pages
AWS Certified Developer Associate (DVA-C01) Practice Test
From Everand
AWS Certified Developer Associate (DVA-C01) Practice Test
iCertify Training
No ratings yet
Real-World Web Development with .NET 9: Build websites and services using mature and proven ASP.NET Core MVC, Web API, and Umbraco CMS
From Everand
Real-World Web Development with .NET 9: Build websites and services using mature and proven ASP.NET Core MVC, Web API, and Umbraco CMS
Mark J. Price
No ratings yet
Python Selenium 4.x Notes - Web Automation - TheTestingAcademy - Pramod
No ratings yet
Python Selenium 4.x Notes - Web Automation - TheTestingAcademy - Pramod
68 pages
Ian Talks JavaScript Libraries and Frameworks A-Z: WebDevAtoZ, #4
From Everand
Ian Talks JavaScript Libraries and Frameworks A-Z: WebDevAtoZ, #4
Ian Eress
No ratings yet
Mastering Flask Web and API Development: Build and deploy production-ready Flask apps seamlessly across web, APIs, and mobile platforms
From Everand
Mastering Flask Web and API Development: Build and deploy production-ready Flask apps seamlessly across web, APIs, and mobile platforms
Sherwin John C. Tragura
No ratings yet
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mamood Alassouli
No ratings yet
Web Scraping Using Python
No ratings yet
Web Scraping Using Python
18 pages
DAP 4 Module
No ratings yet
DAP 4 Module
45 pages
DH
No ratings yet
DH
4 pages
Mastering Postman: A Comprehensive Guide to Building End-to-End APIs with Testing, Integration and Automation
From Everand
Mastering Postman: A Comprehensive Guide to Building End-to-End APIs with Testing, Integration and Automation
Oliver James
No ratings yet
ASP.NET MVC 4 Mobile App Development
From Everand
ASP.NET MVC 4 Mobile App Development
Andy Meadows
No ratings yet
jQuery For Beginners: jQuery JavaScript Library Guide For Developing Ajax Applications, Selecting DOM Elements, Creating Animations
From Everand
jQuery For Beginners: jQuery JavaScript Library Guide For Developing Ajax Applications, Selecting DOM Elements, Creating Animations
Joseph Joyner
No ratings yet
Web Scraping With Python
No ratings yet
Web Scraping With Python
21 pages
Web Scraping
No ratings yet
Web Scraping
7 pages
Scrapy Beginners Series Part 4 - User Agents and Proxies - ScrapeOps
No ratings yet
Scrapy Beginners Series Part 4 - User Agents and Proxies - ScrapeOps
8 pages
Fun With Python
100% (5)
Fun With Python
113 pages
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mahmood Alassouli
No ratings yet
Mastering Eclipse Plug-in Development
From Everand
Mastering Eclipse Plug-in Development
Dr Alex Blewitt
No ratings yet
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
From Everand
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
Jens Boje
No ratings yet
Web Crawling and Social Media Mining: Module No. 5
No ratings yet
Web Crawling and Social Media Mining: Module No. 5
77 pages
GlassFish Administration
From Everand
GlassFish Administration
Xuekun Kou
No ratings yet
The FastAPI Handbook: Simplifying Web Development with Python
From Everand
The FastAPI Handbook: Simplifying Web Development with Python
Robert Johnson
No ratings yet
Go Programming Blueprints - Second Edition
From Everand
Go Programming Blueprints - Second Edition
Mat Ryer
4.5/5 (3)
Building Websites with Microsoft Content Management Server
From Everand
Building Websites with Microsoft Content Management Server
Lim Mei Ying
3/5 (2)
Performance Tools
From Everand
Performance Tools
Ahmed Bouchefra
No ratings yet
Alfresco 3 Web Services
From Everand
Alfresco 3 Web Services
Ugo Cei
No ratings yet
A Simple Python Web Crawler...
100% (1)
A Simple Python Web Crawler...
5 pages
WebSocket Essentials – Building Apps with HTML5 WebSockets
From Everand
WebSocket Essentials – Building Apps with HTML5 WebSockets
Varun Chopra
No ratings yet
Web+Scraping+Cheat+Sheet+2 0
No ratings yet
Web+Scraping+Cheat+Sheet+2 0
3 pages
Mastering Web Application Development with Express
From Everand
Mastering Web Application Development with Express
Alexandru Vlăduțu
No ratings yet
Another Hack Test3
No ratings yet
Another Hack Test3
4 pages
Symfony 1.3 Web Application Development
From Everand
Symfony 1.3 Web Application Development
Wojciech Bancer
No ratings yet
Google Analytics
No ratings yet
Google Analytics
27 pages
A Brief History of HTML: 1993 - Present
No ratings yet
A Brief History of HTML: 1993 - Present
2 pages
As Web Design q1 Dodea-Am
No ratings yet
As Web Design q1 Dodea-Am
3 pages
Webdesigning Training Report c1
No ratings yet
Webdesigning Training Report c1
37 pages
Struts and Tiles - Steps To Use Struts and Tiles
No ratings yet
Struts and Tiles - Steps To Use Struts and Tiles
2 pages
Deepak Christhuraj Profile
No ratings yet
Deepak Christhuraj Profile
10 pages
K72S005 0 NCS
No ratings yet
K72S005 0 NCS
47 pages
Nitesh Resume
No ratings yet
Nitesh Resume
1 page
Resume Atul
No ratings yet
Resume Atul
2 pages
Web I Lecture 5
No ratings yet
Web I Lecture 5
21 pages
React Cheat Sheet
No ratings yet
React Cheat Sheet
2 pages
Computer Project Work Sample
No ratings yet
Computer Project Work Sample
19 pages
Full Stack Resume New
No ratings yet
Full Stack Resume New
4 pages
URL Trasadfdfgsdfgh
No ratings yet
URL Trasadfdfgsdfgh
11 pages
How To Change Background Skin Color For T24 Browser
No ratings yet
How To Change Background Skin Color For T24 Browser
7 pages
Chapter 03
No ratings yet
Chapter 03
55 pages
Answer Files For Chapter 21
No ratings yet
Answer Files For Chapter 21
54 pages
Interview Questions
No ratings yet
Interview Questions
3 pages
GUJARAT TOURISM SUBMIT BY SAHA ANUSHREE - Full and Final
No ratings yet
GUJARAT TOURISM SUBMIT BY SAHA ANUSHREE - Full and Final
15 pages
Assignment No 3
No ratings yet
Assignment No 3
2 pages
Angular Interview Questions Fresher Mohd Daoud
No ratings yet
Angular Interview Questions Fresher Mohd Daoud
2 pages
Web Services and Soa
100% (1)
Web Services and Soa
16 pages
PI REST Adapter - Connect To Concur
No ratings yet
PI REST Adapter - Connect To Concur
7 pages
9 Course List-1
No ratings yet
9 Course List-1
2 pages
Web Application Security Testing
No ratings yet
Web Application Security Testing
179 pages
Amare Full Stack Resume
No ratings yet
Amare Full Stack Resume
4 pages
Lab9 - Building A Basic CRUD RESTful Spring Boot MVC Application
No ratings yet
Lab9 - Building A Basic CRUD RESTful Spring Boot MVC Application
11 pages
WD File
No ratings yet
WD File
59 pages
Eternus Global Home Page
No ratings yet
Eternus Global Home Page
5 pages
HTML, CSS, Bootstrap, Javascript and Jquery: Meher Krishna Patel
No ratings yet
HTML, CSS, Bootstrap, Javascript and Jquery: Meher Krishna Patel
6 pages

12.using Proxies With Python Selenium - ScrapeOps

Uploaded by

12.using Proxies With Python Selenium - ScrapeOps

Uploaded by

3/31/24, 9:33 PM Using Proxies With Python Selenium | ScrapeOps

Using Proxies With Python Selenium

Using Proxies With Selenium

Using Proxies With Selenium

Integrating Proxy With Selenium Chrome Browser

from selenium import webdriver

## Create WebDriver Options to Add Proxy

## Make Request Using Proxy

Integrating Proxy With Selenium Firefox Browser

from selenium import webdriver

## Make Request Using Proxy

HTTP PROXY AUTHENTICATION

Using Authenticated Proxies With Selenium

First, you need to install Selenium Wire using pip:

pip install selenium-wire

from seleniumwire import webdriver

## Define Your Proxy Endpoints

## Set Up Selenium Chrome driver

## Send Request Using Proxy

Integrating Proxy APIs

from seleniumwire import webdriver

## Define ScrapeOps Proxy Port Endpoint

## Set Up Selenium Chrome driver

## Send Request Using ScrapeOps Proxy

More Web Scraping Tutorials

Or check out one of our more in-depth guides:

Selenium Undetected Chromedriver Guide: Bypass Anti-Bots With Ease

You might also like