Ruby Mechanize Cheat Sheet

The document summarizes how to use the Mechanize gem to programmatically control a web browser and interact with web pages and forms. It describes how to configure the agent, access and submit forms, navigate pages and select elements, and provides examples of common tasks like form filling, link clicking, and extracting page content.

Uploaded by

Attila Gáspár

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

467 views1 page

Ruby Mechanize Cheat Sheet

Uploaded by

Attila Gáspár

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

The Agent

require 'rubygems' require 'mechanize' agent = WWW::Mechanize.new # disable keep_alive, when running into # problems with session timeouts or getting an # EOFError agent.keep_alive = false # Setting user agent: agent.user_agent = 'Friendly Mechanize Script" # Using one of the predefined user agents: # 'Mechanize', 'Mac Mozilla', 'Linux Mozilla'. # 'Windows IE 6', 'iPhone', 'Linux Konqueror', # 'Windows IE 7', 'Mac FireFox', 'Mac Safari', # 'Windows Mozilla' agent.user_agent_alias = 'Mac Safari' # To verify server certificates: # (A collection of certificates is available # here: http://curl.haxx.se/ca/ ) agent.ca_file = 'cacert.pem' # Don't follow HTTP redirects agent.redirect_ok = false # Follow refresh in meta tags agent.follow_meta_refresh = true # Enable logging require 'logger' agent.log = Logger.new('mechanize.log')
# The current page agent.page

Accessing page elements

Forms
# Submitting a form without a button: form.submit # Submitting a form with the default button form.click_button() # Submitting a form with a specific button form.click_button(form.button_with(:name => 'OK') # Form elements form.fields, form.buttons, form.file_uploads, form.radio_buttons, form.checkboxes # Form elements can be selected just like page elements # form.element(s)_with(:criteria => value) # e.g.: form.field_with(:name => 'password') form.field_with('password') form.checkboxes(:value => /view_.*/) # Field values can also be selected directly by their name form.password = 'secret' # Setting field values # field : .value = 'something' # checkbox : .(un)check / .checked = true|false # radio_button: .(un)check / .checked = true|false # file_upload : .file_name = '/tmp/upload.dat' # e.g.: form.field_with('foo').value = 'something' form.checkbox_with(:value => 'blue').uncheck form.radio_buttons[3].check # Select lists / drop down fields: form.field_with('color').option[2].select form.field_with('color').options.find{|o| o.value == 'red'}.select form.field_with('color').select_none form.field_with('color').select_all

# The HTML page content page.body # forms, links, frames page.forms, page.links, frames # Selecting by criteria follows the pattern: # page.element(s)_with(:criteria => value) # The plural form (.elements) returns an # array, the singular form (.element) the # first matching element or nil. Criteria # is an attribute symbol and value may be # a string or a regular expression. If no # criteria attributr is given, :name will # be used. e.g.: page.form_with(:name => 'formName') page.form_with('formName') page.links_with(:text => /[0-9]*/

Ruby / Mechanize
http://mechanize.rubyforge.org/mechanize/

Nokogiri
http://nokogiri.org/

Parsing the page content Hello Mechanize! Navigation/History

# load a page agent.get('http://the.internet.net') # Go back to the last page: agent.back # Follow a link by its text agent.link_with(:text => 'click me').click # Backup history, execute block and # restore history agent.transact do ... end require 'rubygems' require 'mechanize' agent = WWW::Mechanize.new agent.get('http://rubyforge.org/') agent.page.forms.first.words = 'mechanize' agent.page.forms.first.click_button agent.page.link_with(:text => /WWW::Mechanize/).click agent.page.link_with(:text => 'Files').click links = agent.page / 'strong/a' version = links.find do |link| link['href'] =~ /shownotes.*release_id/ end.text puts "Hello Mechanize #{version}!"

# Selecting elements from the documents DOM nodes = agent.page.search('expression') nodes = agent.page / 'expression' # Selecting the first matching element or nil node = agent.page.at('expression') # 'expression' might be an XPath or CSS selector nodes = agent.page.search('//h2/a[@class="title"]') nodes = agent.page.search('.h2 a.title') # navigating the document tree: node.parent node.children # node content and attributes node.text node.inner_html node.attributes['width'] # found nodes, can be searched the same way rows = agent.page / 'table/tr' value = rows[0].at('td[@class="value"]').text

Version 2010-01-30 (c) 2010 Tobias Grimm

Creative Commons License http://creativecommons.org/licenses/by/3.0

Selenium Python PDF
100% (2)
Selenium Python PDF
80 pages
SEO Tools by Aaron Wall
100% (1)
SEO Tools by Aaron Wall
12 pages
A Complete Overview On: Web-Development
67% (3)
A Complete Overview On: Web-Development
105 pages
Advanced Web Scraping - Bypassing - 403 Forbidden, - Captchas, and More - Sangaline
No ratings yet
Advanced Web Scraping - Bypassing - 403 Forbidden, - Captchas, and More - Sangaline
12 pages
Software Requirement Specification On Web Browser
67% (3)
Software Requirement Specification On Web Browser
25 pages
Python Selenium Module Docs
No ratings yet
Python Selenium Module Docs
75 pages
How To Make OB Config Easy
100% (1)
How To Make OB Config Easy
2 pages
Project Report On Website Job Consultancy
60% (5)
Project Report On Website Job Consultancy
83 pages
Javascript Cheatsheet Pag2
100% (1)
Javascript Cheatsheet Pag2
2 pages
Selenium Python
50% (2)
Selenium Python
53 pages
Free SEO Audit Template
No ratings yet
Free SEO Audit Template
3 pages
Selenium Python Readthedocs - Selenium Python Bindings
100% (1)
Selenium Python Readthedocs - Selenium Python Bindings
116 pages
Scrapping The Web
100% (1)
Scrapping The Web
13 pages
C3SA Module 03 V1
No ratings yet
C3SA Module 03 V1
116 pages
Prompt Engineering
100% (3)
Prompt Engineering
37 pages
Basic of OB Configs
No ratings yet
Basic of OB Configs
2 pages
Final SRS
No ratings yet
Final SRS
7 pages
Landing Page Optimization
0% (1)
Landing Page Optimization
16 pages
Accessories: SEO Report For
No ratings yet
Accessories: SEO Report For
47 pages
Web Scraping and Data Collection CheatSheet 1731972399
No ratings yet
Web Scraping and Data Collection CheatSheet 1731972399
10 pages
(2022) The Browser Environment - A Systems Programmer's Perspective (Sinatra Edition)
No ratings yet
(2022) The Browser Environment - A Systems Programmer's Perspective (Sinatra Edition)
33 pages
Web Functions of Load Runner
No ratings yet
Web Functions of Load Runner
2 pages
Python Report
No ratings yet
Python Report
9 pages
RoR 8 Action View Form Helpers
No ratings yet
RoR 8 Action View Form Helpers
24 pages
Splinter Docs
No ratings yet
Splinter Docs
104 pages
Capybara Cheat Sheet
No ratings yet
Capybara Cheat Sheet
2 pages
Automation
No ratings yet
Automation
73 pages
Software Requirement Specification On Web Browser 1
No ratings yet
Software Requirement Specification On Web Browser 1
23 pages
WT QB Ans
No ratings yet
WT QB Ans
23 pages
Selenium Python Readthedocs Io en Latest
No ratings yet
Selenium Python Readthedocs Io en Latest
128 pages
Selenium Python Readthedocs Io en Latest
No ratings yet
Selenium Python Readthedocs Io en Latest
118 pages
Selenium Python Readthedocs Io en Latest
No ratings yet
Selenium Python Readthedocs Io en Latest
121 pages
Selenium Python Readthedocs Io en Latest
No ratings yet
Selenium Python Readthedocs Io en Latest
120 pages
Selenium Python Readthedocs Io en Latest
No ratings yet
Selenium Python Readthedocs Io en Latest
130 pages
Coding
No ratings yet
Coding
62 pages
Sinatra
No ratings yet
Sinatra
13 pages
MKTC - 605 (Digital Marketing)
100% (1)
MKTC - 605 (Digital Marketing)
53 pages
GET Head Post: Request Has Been Successfully Completed. Responses Are Grouped in Five Classes
No ratings yet
GET Head Post: Request Has Been Successfully Completed. Responses Are Grouped in Five Classes
3 pages
Aditya Polytechnic Beed: Microproject On
No ratings yet
Aditya Polytechnic Beed: Microproject On
29 pages
Ultimate Seo Checklist
100% (1)
Ultimate Seo Checklist
1 page
UNIT3
No ratings yet
UNIT3
7 pages
SEO Complete Guide by Surojit
No ratings yet
SEO Complete Guide by Surojit
55 pages
Active Ecommerce CMS Documentation
No ratings yet
Active Ecommerce CMS Documentation
118 pages
CAN - Digital Marketing Strategy With SOSTAC
No ratings yet
CAN - Digital Marketing Strategy With SOSTAC
63 pages
HTML and CSS Depth
No ratings yet
HTML and CSS Depth
23 pages
On Page SEO Report - Check SEO
No ratings yet
On Page SEO Report - Check SEO
5 pages
University of Salford: Ahmed Salah
No ratings yet
University of Salford: Ahmed Salah
18 pages
DM MCQ
No ratings yet
DM MCQ
11 pages
FSD Module1 HTML
No ratings yet
FSD Module1 HTML
104 pages
Pawan Prasad SEO
No ratings yet
Pawan Prasad SEO
5 pages
Ai Prompts Nova Project
No ratings yet
Ai Prompts Nova Project
118 pages
HTML
No ratings yet
HTML
68 pages
Course Material
No ratings yet
Course Material
50 pages
What Is Search Engine Optimization SEO
No ratings yet
What Is Search Engine Optimization SEO
8 pages
SEO Analyzer - Generate A Free SEO Report of Your Website
No ratings yet
SEO Analyzer - Generate A Free SEO Report of Your Website
11 pages
Survey On Search Engine Optimization Tools & Techniques
No ratings yet
Survey On Search Engine Optimization Tools & Techniques
5 pages
Tutorial Meta Tag Blogger
No ratings yet
Tutorial Meta Tag Blogger
5 pages
47 - 49-Pinterest-Marketing-101-9121
No ratings yet
47 - 49-Pinterest-Marketing-101-9121
66 pages
SEOFun Sheet
No ratings yet
SEOFun Sheet
36 pages
Exness SC Partnership Agreement
No ratings yet
Exness SC Partnership Agreement
22 pages
HTML Tag: Definition and Usage
No ratings yet
HTML Tag: Definition and Usage
5 pages
Amadeus Hotel SEO Checklist
No ratings yet
Amadeus Hotel SEO Checklist
3 pages
Essential n8n Playbook
From Everand
Essential n8n Playbook
Leandro Calado
No ratings yet
React Portfolio App Development: Increase your online presence and create your personal brand
From Everand
React Portfolio App Development: Increase your online presence and create your personal brand
Abdelfattah Ragab
No ratings yet
10 Lessons in Front-end
From Everand
10 Lessons in Front-end
Krasimir Tsonev
2/5 (1)
Angular Portfolio App Development: Create your personal brand
From Everand
Angular Portfolio App Development: Create your personal brand
Abdelfattah Ragab
No ratings yet
Angular Reactive Forms: Everything you need to know
From Everand
Angular Reactive Forms: Everything you need to know
Abdelfattah Ragab
No ratings yet
Angular for Beginners: Everything you need to know
From Everand
Angular for Beginners: Everything you need to know
Abdelfattah Ragab
No ratings yet
Angular Routing: Everything you need to know
From Everand
Angular Routing: Everything you need to know
Abdelfattah Ragab
No ratings yet
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
Introduction to PHP Web Services: PHP, JavaScript, MySQL, SOAP, RESTful, JSON, XML, WSDL
From Everand
Introduction to PHP Web Services: PHP, JavaScript, MySQL, SOAP, RESTful, JSON, XML, WSDL
Imran Ghani
No ratings yet
How to a Developers Guide in 4k: Developer edition, #2
From Everand
How to a Developers Guide in 4k: Developer edition, #2
Xinc Cyberwizard
No ratings yet
Quick JavaScript Learning In Just 3 Days: Fast-Track Learning Course
From Everand
Quick JavaScript Learning In Just 3 Days: Fast-Track Learning Course
Vijay K.R.
No ratings yet
Stripe Integration in Angular: A Step-by-Step Guide to Creating Payment Functionality
From Everand
Stripe Integration in Angular: A Step-by-Step Guide to Creating Payment Functionality
Abdelfattah Ragab
No ratings yet
Angular HTTP: Connecting to the REST API
From Everand
Angular HTTP: Connecting to the REST API
Abdelfattah Ragab
No ratings yet
Four Programming Languages Creating a Complete Website Scraper Application
From Everand
Four Programming Languages Creating a Complete Website Scraper Application
Stephen J Link
No ratings yet
50 Recipes for Programming Angular
From Everand
50 Recipes for Programming Angular
Jamie Munro
4/5 (1)
Introduction to PHP, Part 4, Second Edition
From Everand
Introduction to PHP, Part 4, Second Edition
Adam Majczak
No ratings yet
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
From Everand
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
equitypress
4.5/5 (3)
Intermediate Load Runner With Oracle/Apex Concepts.
From Everand
Intermediate Load Runner With Oracle/Apex Concepts.
Rohan Gordon
No ratings yet
Introduction to PHP, Part 5, Second Edition
From Everand
Introduction to PHP, Part 5, Second Edition
Adam Majczak
No ratings yet
JavaScript Essentials For Dummies
From Everand
JavaScript Essentials For Dummies
Paul McFedries
No ratings yet
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Web Scraping for SEO with Python
From Everand
Web Scraping for SEO with Python
Enrique Vicente
No ratings yet
Take Your First Steps into Vue.JS
From Everand
Take Your First Steps into Vue.JS
Tom Henricksen
No ratings yet
JavaScript Interview Questions, Answers, and Explanations: JavaScript Certification Review
From Everand
JavaScript Interview Questions, Answers, and Explanations: JavaScript Certification Review
equitypress
No ratings yet
JavaScript Fundamentals: JavaScript Syntax, What JavaScript is Use for in Website Development, JavaScript Variable, Strings, Popup Boxes, JavaScript Objects, Function, and Event Handlers
From Everand
JavaScript Fundamentals: JavaScript Syntax, What JavaScript is Use for in Website Development, JavaScript Variable, Strings, Popup Boxes, JavaScript Objects, Function, and Event Handlers
Steven Bright
No ratings yet
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
From Everand
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
Jens Boje
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet
JSP-Servlet Interview Questions You'll Most Likely Be Asked
From Everand
JSP-Servlet Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Ruby Mechanize Cheat Sheet

Uploaded by

Ruby Mechanize Cheat Sheet

Uploaded by

The Agent

Accessing page elements

Parsing the page content Hello Mechanize! Navigation/History

Version 2010-01-30 (c) 2010 Tobias Grimm

Creative Commons License http://creativecommons.org/licenses/by/3.0

You might also like