Web Scraping
Web Scraping
Collection Data
From Websites
TRMG
April 16, 2012
Agenda
Part 1: Technology
How do web pages work?
Why would you want data from a web page?
What are your options for web page data?
Technology
Part 1
F o nt
SIZE
XML
JavaScript
Location
Color
$549.95
$549.95
$549.95
Browser add-in
Defeated by modern websites (JavaScript, etc.)
Limited targets, cant integrate applications
Making a Decision?
How many sites? How much data?
Are updates frequent? Are new sites
added often?
How complex are the sites?
Login
Navigation
Multiple pages
AJAX
Web Extraction
For
Credit and Collections
Operations
Part 2
Employees
logging on and
off of sites daily.
manual entry.
My
Employ
ee
Automation
could reduce
your labor
associated with
this by 85
100%
Check Numbers
Acknowledgement of Receipt
Amount Paid
Dispute
Information
2. Acknowledgement of Receipt
Prime
Point of
Data
Gathering
Approximat
ely 27 Day
Gain!!
A/P
A/R
Carrier Perspective:
Using Technology to Increase
Productivity and Avoid Costs
The Situation
Credit & Collections
Seven collectors handling over 1,500
accounts using 3rd party freight
payment agencies
Over 70 different websites with different
logins and navigation
Manual process to research disputes and
transfer payment information to
commercial C&C software
Customer Service
3 processes built with several in queue
Approximately $190,000 labor savings per year for
these 3 accounts alone.
Overall Benefits
Credit & Collections (both solutions)
Improved productivity by reducing website
inquiries
Shortened time for resolving rejects or
disputes due to earlier notification
Streamlined processes by routing rejects to
the appropriate resolvers
Allows for better analysis of payment
patterns and disputes
Ability to establish robots to capture data
for large customers not using freight
payment agencies
Overall Benefits
Customer Service
Reduced labor by automating managed
accounts processes
Reduced complexity by streamlining
processes
Reduced errors and improved timeliness
of updates through customization
Potential to increase revenue by taking
on more managed accounts without
adding staff
Other Uses
Completed
Credit & Collections Software Upgrade
State Tax Forms
Rate Web Probe
Future
Canadian Customs Form
ECM (TruckLoad) Customer Service
Carrier Perspective:
Web Scraping
CHAOS
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
Payer
Website
A/R
Analyst
Payer
Website
A/R
Analyst
Payer
Website
A/R
Analyst
Payer
Website
Customer
Website
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
A/R
Analyst
Ope
ce
voi
n In
Payer Website
Data
Capture
a
Dat
Payer Website
Note: Today Analysts Vlookup from excel to
update invoice status on
master agings.
Payer Website
In development feed of
web scrape results directly
to Collection Software
Payer Website
Payer Website
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
A/R Analyst
Payer Website
Question
s
Jeff Jones
Gallium Technologies
Diana Early
PITT OHIO
[email protected]
Cindy Douglass
Swift Transportation
[email protected]