SlideShare a Scribd company logo
Dig into the Deep Web Going on a treasure hunt… Wendy Sellors MLWGS, March 2008 By  cameronparkins
How do sources get buried? Requires log-on Accessed by query Protected from robots Public access expired Can’t be bookmarked (scripted, dynamic, etc.) More than 3 clicks deep (He, et al., 2007)
How big is the deep web? Conflicting answers… 91,000 terabytes  (Deep Web, n.d.) 500x larger than surface  (Sullivan as cited in He, et al., 2007) 63%  (He, et al., 2007)  - 95% of web (Dash, 2007)
= 1/3 Note: Alltheweb is powered by Yahoo!
What’s in the treasure chest? Images and multimedia files, including the content of many online exhibits Diaries, letters, historic maps Patents, laws, bills, and treaties Articles licensed/protected by copyright “ Old” newspaper articles (esp. pre-1980’s) Collections of papers, preprints (conference, etc.) Declassified documents, Congressional testimony Government reports, data sets And much more…
Strategies for finding an X, and then digging for… By  merfam
Why search for it? By  brittanyg
Treasure Hunt Mapping Planning Selecting the proper tools Digging
(Concept) mapping Broader concepts Browse down into categories of directories Browse down into archives, databases, etc. Check subject headings of databases using its internal Subject Search, Thesaurus search, etc. Related/narrower concepts Amend your map as you search Note subject headings of valuable finds Note key words in valuable finds
Preparing for the journey Reference Shelf  page of library site Reference articles in ebooks, books General and subject encyclopedias Reference books from Gale PowerSearch – Books tab U.S. and World History RC’s – Reference tab Biography, Science, and Literature RC’s Google Book/Worldcat
Databases Reference shelf Open sources for research Library research wiki – Model UN
 
 
The U.S. and World History RC’s include  American Decades, American Eras, American Journeys, Encyclopedia of World Cultures, History in Dispute, World Eras,  and many more reference sources…
Preparing for the journey Finding reference sources, overviews, etc. Recommended sources from Wikipedia/EB Literature reviews Reference lists Further/recommended reading lists Briefs/reports from Congressional Research Service, State Dept, think tanks, RAND, etc. Finding compilations of sources Search subject +  bibliography, research guide, resource guide, study guide
 
Search for promising book titles in Google Book…
View the table of contents and index, and/or search for term in book. 6. Ancient Fuel for a Modern Inferno: Time Collapse in Bosnia-Herzegovina
 
Tools for the dig MW databases Gale resources Gale PowerSearch Biography Resource Center Expanded Academic ASAP Opposing Viewpoints Resource Center U.S. and World History Resource Centers Declassified Documents (postWW2-1970s) JSTOR & Project MUSE LexisNexis Scholastic
Digging tips Gale Reference sources  Search by broad concept See Reference / Books result tabs Refine by document type Go beyond PowerSearch Follow leads within and between databases Journal/mag/news articles & primary sources Start broad, then narrow Use Advanced search to combine keywords and subject headings
Multimedia includes transcripts, podcasts, photographs, videos, etc.
Digging tips JSTOR and Project MUSE Use Advanced Search In general, limit to articles If looking for books, limit to reviews JSTOR Refine by limiting by discipline and/or date range Project MUSE Refine by combining key word and subject searching May also refine by limiting to  All Except Text
Digging tips LexisNexis Supreme court decisions Laws News General Regional Foreign news sources Wires Transcripts
Tools for the dig VCU databases Browse by topic Search by journal title or article citation Follow  to find in other databases Not in a database? Check online catalog for copies in print, and on microfilm or microfiche VCU is also a government repository VCU library’s resource guides
Tools for the dig Federated search tools e.g. Google Scholar, USAsearch.gov Finding tools =  portals  in library’s bookmarks Directories e.g. Infomine, Intute Finding tools =  directories  in library’s bookmarks Metasearch engines e.g. Clusty, Dogpile Finding tools =  metasearch  in library’s bookmarks

More Related Content

PPTX
PPTX
INTL 190 Libguide
PPTX
Poli102 guide
PPTX
Library Research for Legal Researchers at UCSD
PPTX
SLSguide
PPTX
Chls 335 i_14
PPTX
Honors English - Surface
PPTX
Library 101 82208
INTL 190 Libguide
Poli102 guide
Library Research for Legal Researchers at UCSD
SLSguide
Chls 335 i_14
Honors English - Surface
Library 101 82208

What's hot (20)

PPT
Economics History
PPTX
Anthro 561 2015
PPTX
Poli127 guide (2020)
PPTX
Sls guide2018
PPTX
Poli125 guide
PPTX
PIR advanced information skills 2018
PPTX
Poli151 guide
PPT
Chls 300 spring_2016_serrano-najera
PPT
Speech-Language Pathology, Research Methods 696
PPTX
POLI 122 Library Research Guide
PPTX
Poli153 guide
PPTX
Library Research for Human Rights Guide
PPSX
Types of Resources
PPT
Gradbeginningresearch Fall2008
PPTX
E-Research Strategies - NUF 2013
PPTX
Chls 104 moran_spring_15
PPT
Beginning Research
PPTX
Hank Coleman
PPTX
Types of resources
PPTX
Immigration guide
Economics History
Anthro 561 2015
Poli127 guide (2020)
Sls guide2018
Poli125 guide
PIR advanced information skills 2018
Poli151 guide
Chls 300 spring_2016_serrano-najera
Speech-Language Pathology, Research Methods 696
POLI 122 Library Research Guide
Poli153 guide
Library Research for Human Rights Guide
Types of Resources
Gradbeginningresearch Fall2008
E-Research Strategies - NUF 2013
Chls 104 moran_spring_15
Beginning Research
Hank Coleman
Types of resources
Immigration guide
Ad

Viewers also liked (20)

PPTX
Why do some people believe in myths?
PPTX
World Geography: Sample PowerPoint
PPTX
Internet servers
PPTX
Whats app
PPTX
Left, right or middle
PPS
Power Point Lesson 08 P1
PPT
World Without Wires 2007
PPT
Introduction of Computer Network
PDF
ICANN 51: Thick WHOIS Implementation (working session)
PPTX
E-COMMERCE: The Dark Web
PPT
Infobrokering And Searching The Deep Web
PPT
02 Network Models
PPS
Power Point Lesson 08 P2
PPSX
Making domain name and IP address policy at ICANN
PDF
ICANN 51: DNS Risk Framework
PPT
How the internet works
PDF
What is ICANN? (Russian)
PPS
Power Point Lesson 07 P1
PPT
W 10 introduction to network
Why do some people believe in myths?
World Geography: Sample PowerPoint
Internet servers
Whats app
Left, right or middle
Power Point Lesson 08 P1
World Without Wires 2007
Introduction of Computer Network
ICANN 51: Thick WHOIS Implementation (working session)
E-COMMERCE: The Dark Web
Infobrokering And Searching The Deep Web
02 Network Models
Power Point Lesson 08 P2
Making domain name and IP address policy at ICANN
ICANN 51: DNS Risk Framework
How the internet works
What is ICANN? (Russian)
Power Point Lesson 07 P1
W 10 introduction to network
Ad

Similar to Digging into the Deep Web (20)

PPTX
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
PDF
Internet content as research data
PDF
Digital methods for Social Sciences: origin and definitions
PPTX
Deep Web and Digital Investigations
PPT
Just keep clicking Till You Find It: Building a Library Digital Collection In...
PPT
History On Web Where is it Headed?
PPTX
Presentation Deep Web Technology.pptx
PPTX
Historical methods 2012
PDF
Slides anu talkwebarchivingaug2012
PPT
Rs detective afpl
PPT
Deep Web Presentation April 25
PDF
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
PPTX
Five star resources one star budget
PPT
Why Use The Library
PPTX
Finding Primary Sources and Digital Collections on the Web
PPTX
Searching the Deep Web
PPTX
Internet & Library Use 2022 .pptx
PPTX
MC3Lib-Research-4-FindWebsites
PPTX
Annotated bib and research strategies
PPTX
Writing esl
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
Internet content as research data
Digital methods for Social Sciences: origin and definitions
Deep Web and Digital Investigations
Just keep clicking Till You Find It: Building a Library Digital Collection In...
History On Web Where is it Headed?
Presentation Deep Web Technology.pptx
Historical methods 2012
Slides anu talkwebarchivingaug2012
Rs detective afpl
Deep Web Presentation April 25
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Five star resources one star budget
Why Use The Library
Finding Primary Sources and Digital Collections on the Web
Searching the Deep Web
Internet & Library Use 2022 .pptx
MC3Lib-Research-4-FindWebsites
Annotated bib and research strategies
Writing esl

More from Wendy DeGroat (19)

PPTX
Finding and Citing Online Images & Sources
PPT
Taking Notes with Noodle Tools
PPTX
CMS: Sight Site Cite
PPTX
Welcome to your MW Library
PPTX
Collaboration Tools for UUs
PPTX
Socialstudiesresearch fall09
PPTX
Welcome to the Maggie Walker Library
PPTX
Welcome to the Maggie Walker Library
PPTX
Welcome to the Maggie Walker Library
PPT
Taking Notes With Noodle Tools Mwl
PPTX
MLA: Sight Site Cite
PPT
Conducting a Lit Review
PPT
Online Resources for Research
PPT
To blog or to wiki
PPT
Finding the right words
PPT
MW Library 2.0
PPT
Delicious Tutorial for Students
PPT
Collaboration 2.0
PPT
Are your students college-ready? Start by ensuring they're research ready.
Finding and Citing Online Images & Sources
Taking Notes with Noodle Tools
CMS: Sight Site Cite
Welcome to your MW Library
Collaboration Tools for UUs
Socialstudiesresearch fall09
Welcome to the Maggie Walker Library
Welcome to the Maggie Walker Library
Welcome to the Maggie Walker Library
Taking Notes With Noodle Tools Mwl
MLA: Sight Site Cite
Conducting a Lit Review
Online Resources for Research
To blog or to wiki
Finding the right words
MW Library 2.0
Delicious Tutorial for Students
Collaboration 2.0
Are your students college-ready? Start by ensuring they're research ready.

Recently uploaded (20)

PDF
Transforming Manufacturing operations through Intelligent Integrations
PDF
madgavkar20181017ppt McKinsey Presentation.pdf
PDF
AI And Its Effect On The Evolving IT Sector In Australia - Elevate
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
CIFDAQ's Market Wrap: Ethereum Leads, Bitcoin Lags, Institutions Shift
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Chapter 2 Digital Image Fundamentals.pdf
PPT
Teaching material agriculture food technology
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
MYSQL Presentation for SQL database connectivity
PDF
KodekX | Application Modernization Development
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
Transforming Manufacturing operations through Intelligent Integrations
madgavkar20181017ppt McKinsey Presentation.pdf
AI And Its Effect On The Evolving IT Sector In Australia - Elevate
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Advanced Soft Computing BINUS July 2025.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
CIFDAQ's Market Wrap: Ethereum Leads, Bitcoin Lags, Institutions Shift
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Big Data Technologies - Introduction.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Empathic Computing: Creating Shared Understanding
Chapter 2 Digital Image Fundamentals.pdf
Teaching material agriculture food technology
Understanding_Digital_Forensics_Presentation.pptx
Spectral efficient network and resource selection model in 5G networks
MYSQL Presentation for SQL database connectivity
KodekX | Application Modernization Development
NewMind AI Monthly Chronicles - July 2025
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
“AI and Expert System Decision Support & Business Intelligence Systems”

Digging into the Deep Web

  • 1. Dig into the Deep Web Going on a treasure hunt… Wendy Sellors MLWGS, March 2008 By cameronparkins
  • 2. How do sources get buried? Requires log-on Accessed by query Protected from robots Public access expired Can’t be bookmarked (scripted, dynamic, etc.) More than 3 clicks deep (He, et al., 2007)
  • 3. How big is the deep web? Conflicting answers… 91,000 terabytes (Deep Web, n.d.) 500x larger than surface (Sullivan as cited in He, et al., 2007) 63% (He, et al., 2007) - 95% of web (Dash, 2007)
  • 4. = 1/3 Note: Alltheweb is powered by Yahoo!
  • 5. What’s in the treasure chest? Images and multimedia files, including the content of many online exhibits Diaries, letters, historic maps Patents, laws, bills, and treaties Articles licensed/protected by copyright “ Old” newspaper articles (esp. pre-1980’s) Collections of papers, preprints (conference, etc.) Declassified documents, Congressional testimony Government reports, data sets And much more…
  • 6. Strategies for finding an X, and then digging for… By merfam
  • 7. Why search for it? By brittanyg
  • 8. Treasure Hunt Mapping Planning Selecting the proper tools Digging
  • 9. (Concept) mapping Broader concepts Browse down into categories of directories Browse down into archives, databases, etc. Check subject headings of databases using its internal Subject Search, Thesaurus search, etc. Related/narrower concepts Amend your map as you search Note subject headings of valuable finds Note key words in valuable finds
  • 10. Preparing for the journey Reference Shelf page of library site Reference articles in ebooks, books General and subject encyclopedias Reference books from Gale PowerSearch – Books tab U.S. and World History RC’s – Reference tab Biography, Science, and Literature RC’s Google Book/Worldcat
  • 11. Databases Reference shelf Open sources for research Library research wiki – Model UN
  • 12.  
  • 13.  
  • 14. The U.S. and World History RC’s include American Decades, American Eras, American Journeys, Encyclopedia of World Cultures, History in Dispute, World Eras, and many more reference sources…
  • 15. Preparing for the journey Finding reference sources, overviews, etc. Recommended sources from Wikipedia/EB Literature reviews Reference lists Further/recommended reading lists Briefs/reports from Congressional Research Service, State Dept, think tanks, RAND, etc. Finding compilations of sources Search subject + bibliography, research guide, resource guide, study guide
  • 16.  
  • 17. Search for promising book titles in Google Book…
  • 18. View the table of contents and index, and/or search for term in book. 6. Ancient Fuel for a Modern Inferno: Time Collapse in Bosnia-Herzegovina
  • 19.  
  • 20. Tools for the dig MW databases Gale resources Gale PowerSearch Biography Resource Center Expanded Academic ASAP Opposing Viewpoints Resource Center U.S. and World History Resource Centers Declassified Documents (postWW2-1970s) JSTOR & Project MUSE LexisNexis Scholastic
  • 21. Digging tips Gale Reference sources Search by broad concept See Reference / Books result tabs Refine by document type Go beyond PowerSearch Follow leads within and between databases Journal/mag/news articles & primary sources Start broad, then narrow Use Advanced search to combine keywords and subject headings
  • 22. Multimedia includes transcripts, podcasts, photographs, videos, etc.
  • 23. Digging tips JSTOR and Project MUSE Use Advanced Search In general, limit to articles If looking for books, limit to reviews JSTOR Refine by limiting by discipline and/or date range Project MUSE Refine by combining key word and subject searching May also refine by limiting to All Except Text
  • 24. Digging tips LexisNexis Supreme court decisions Laws News General Regional Foreign news sources Wires Transcripts
  • 25. Tools for the dig VCU databases Browse by topic Search by journal title or article citation Follow to find in other databases Not in a database? Check online catalog for copies in print, and on microfilm or microfiche VCU is also a government repository VCU library’s resource guides
  • 26. Tools for the dig Federated search tools e.g. Google Scholar, USAsearch.gov Finding tools = portals in library’s bookmarks Directories e.g. Infomine, Intute Finding tools = directories in library’s bookmarks Metasearch engines e.g. Clusty, Dogpile Finding tools = metasearch in library’s bookmarks
  • 27. Clusty searches Ask, Gigablast, Live, Open Directory, Wikipedia Dopgile searches Ask, MSN, Yahoo and Google (note: first result = ad)
  • 28. Bookmarks for MW Library - focus on the Finding Tools bundle (on right) or browse by source type or topic tag
  • 29. Tools for the dig Search for search tools Finding tools = portals, directories, metasearch Search subject + directory, portal , or search Social bookmarking sites del.icio.us digg stumbleupon
  • 30. Choose the right Google Google Book Google Scholar Google News Save customizations Specify news source location Available for other countries Set up news alerts Google by country Google custom search engine By shawnbot
  • 31. Surveying the landscape Who cares about this topic? Where do these people live/gather? Research centers Regulating/monitoring agencies Professional/stakeholder organizations Where would their work be collected? Key publications, conferences, databases Historical societies, museums, galleries
  • 32. Search subject + studies, center, research center, conference, library
  • 33.  
  • 34. The elusive X “ X” marks the spot Finding Tools = archives, databases Find Source by Type = statistics, govdocs, etc. Search subject + archive, database, collection, exhibit, statistics , library or desired source type Searching within expert / stakeholder sites e.g. Yale Genocide Program, Amnesty International Use site search feature or view site map Follow leads
  • 35. Even sites of less established, authoritative sources may offer clues…
  • 36.  
  • 37. Try searching Google Book directly (instead of with known title), and click Find this book in a library …
  • 38.  
  • 39. Staying on track Stop periodically See where you are Reflect on what you’ve found/learned Decide where to look next Follow leads Amend your (concept) map Take notes (interacting vs. pasting) Document your sources
  • 41. What questions do you have? MW Library web site http://mwlibrary.wordpress.com (or select Media Center on school’s home page) Navigation menus at top and on left MW Library bookmark account http://del.icio.us/dragonlibrary MW Library research wiki http://mwlibrary.wetpaint.com See Dig into the Deep Web
  • 43. References Dash, R. (2007, April 24). Exposing the invisible web to search engines. Search Engine Journal . Retrieved March 8, 2008, from http://www.searchenginejournal.com/exposing-the-invisible-web-to-search-engines/4771/ Deep web. (n.d.). Wikipedia . Retrieved March 8, 2008, from http://en.wikipedia.org/wiki/Deep_web He, B., Patel, M., Zhang, Z., & Chang, K. (2007). Accessing the deep web. Communications of the ACM 50 (5), 94-101. Retrieved March 8, 2008, from ACM Digital Library.