Smarter Internet Searching Guide Introduction To Internet Searching
Smarter Internet Searching Guide Introduction To Internet Searching
The guide is currently made up of two parts. The different parts of each topic are listed on the home page of each guide. Use the arrows to go forwards and backwards through the different sub-sections of each topic. Within each topic there may be several individual screens so there is never too much to read on one screen. Each topic will list the different screens at the start, for this topic they are: Search icons Fast swapping between windows Refer to the top of the page to keep track where you are in the guide. Try It! For example, if you look at top of this page you will see: Smarter Searching Guide: 1. Introduction to Internet searching Topic: 2. How the Guides work Screen: 1 of 1 Now try moving to the next page and you will see the header change >> Search icons On each page you will see these icons Try it!:
This will usually be an instruction describing the activity and a link to click on such as: Visit Searchenginewatch.com, the best site to learn about how search engines work for both searchers and marketers. www.searchenginewatch.com Results:
This shows what you should see or gives an answer to a question. Activity:
These are to make the guides more interactive and to test learning Search experts tip
This summarises a handy tip for saving time or highlighting a good information source. Here, is one to get you started >> Swapping between different Windows. You will currently use the Windows Task bar at the bottom of the screen to change between different programme applications, such as Microsoft Word and Internet Explorer by clicking on them. With these Guides you will be swapping a lot between Internet Explorer windows. To save time, a useful keyboard shortcut known as Fast Alt-tabbing is to press these two keys at the same time: the Alt key (usually to the left of the spacebar) and the Tab key (usually above the Shift key). Try it!: Press Alt and Tab keys together, then release the Tab key to go through the programs or documents you have loaded until you find the one you want.
More detailed information for advanced users and background information if youd like to know.
This can be broken down into http:// This tells the web browser to retrieve a web address using a data transfer standard known as the Hyper Text Transfer Protocol (http://). You can save time by omitting this in most browsers (as long as the site address starts with www). www. Most web sites have this prefix, but it is sometimes not present if content on a separate server is referred to, for example http://news.bbc.co.uk) bbc this is the name of the site .co.uk this is known as the Top Level Domain name more on these in a moment index.html this is the name of the individual page. When the name of the page is index.html or index.htm it can be omitted since the browser add this automatically. Sub-folders or directories also contain a file index.html to display their content. For example www.bbc.co.uk/sport. Search experts tip: You dont need to type in http:// or index.html they are added automatically So in this case all that needs to be typed in is: www.bbc.co.uk Try it! Type in www.bbc.co.uk to the address bar of a browser:
What you will see: The BBC site will be loaded. On other pages in the site, the address will be more complicated. This is because the BBC site, like many large sites uses a Content Management System (CMS) to make it easy to edit and publish pages. >> Top level domains where in the world is that web site? As you surf around the Internet by clicking on links, you will see the web address change as well as changes in the label used to name the site such as BBC to Google, you will also see the last part of the main name change this is the global top level domain. It is useful to monitor the top-level domain name since it helps assess the quality of the data. Common global top level domains are: .com represents an international or American company such as www.3m.com .co.uk represents a company based in the UK such as www.thomascook.co.uk .edu or .ac.uk a US university such as www.mit.edu or a UK-based university (e.g. www.leeds.ac.uk) .org.uk or .org are not-for-profit organizations or trade associations (e.g. www.foldoc.org A UK web site with definitions of information technology terms .gov.uk or .gov are government sites such as www.statistics.gov.uk .net a network provider such as www.virgin.net .int an international site, e.g. the EC site www.europa.eu.int .info a new domain introduced in 2002 which is starting to be used for informational sites Search expert tip: Sites with domain names of .gov, .org, .ac. or .uk are usually relatively independent and are often good sources of detailed, unbiased information. Activity: Place the elements of this web address for the National Institute for Clinical Excellence (NICE)in order. <correct order> http:// - web address identifier part 1 4
www. - web address identifier part 2 nice. main domain name org. first part of top level domain .uk/ - second part of top level domain index.html home page >>History and Development of the web Purpose: This Find Out more topic is a straightforward introduction to the Internet and World Wide Web you may know this already, but there is also some trivia which may be useful for storing up for a pub quiz. Screens for this topic: The History of the Internet and World Wide Web How is the World Wide Web structured? How are World Wide Web sites labeled? Who uses the Internet in the UK? and what are they looking at? >> The History of the Internet and World Wide Web Its generally known that the Internet is a global communication network linking millions of separate computers on different networks. It can be traced back to 1958, in the days of the cold war and space race, when Americans were concerned about defence and the Soviet Union launched its Sputnik satellite. They wanted a network to connect military bases and academic institutions which would still function if some parts of the network were destroyed. It has long been used for e-mail with the first being sent in 1973 and the Queen sending one in 1976 making Her Majesty one of the most experienced UK Internet users. The Internet didnt really take off as a consumer and business tool until the early 1990s when British Scientist Tim Berners Lee invented the World Wide Web (WWW) as a way of publishing and sharing information amongst scientists at the CERN lab in Geneva. To see a timeline of the development of the Internet that summarises its growth visit: http://www.zakon.org/robert/internet/timeline/ >> Who is using the Internet in the UK? Lets now look at some examples of different sites. Say we want to know what percentage of the UK adult population use the Internet. To answer this we can turn to research agency MORI who has a unit eMORI that publish a monthly update of who is using the Internet in the UK. Try it First go to their web site MORI.com by clicking on this link http://www.mori.com/emori and then select e-MORI Technology Tracker Survey: This hyperlink will take you to this page: http://www.mori.com/emori/tracker.shtml You can see that the web page tracker.shtml is a combination of text and graphics summarizing current Internet usage. >>..and what are they looking at? Nielsen is the best source for finding out the popular online sites. Nielsen Netratings use a panel of volunteers whose PCs are monitored to see who is watching what. They publish the top 10 sites and their visitors.
Try it! Find out the top 3 UK sites: Click on: http://www.nielsen-netratings.com and choose Press Centre.
>> Moving backwards and forwards between pages This uses the prominent toolbar buttons showing left and right arrows.
The option is then available to either add a Favourite page to an existing folder or to create a new folder to save Favorites to. To add a Favorite to an existing folder select the folder by clicking on it and then click the OK button. To create a new folder to save Favorites to, click on the New Folder button, enter the name of the new folder and then click the OK button.
Range of services: search engines, directories, news recruitment, personal information management, shopping, etc.
A range of resources are available, but not as good as specialist vertical portals for specific information
3. Vertical portal
May cover a single function such as search or news or a particular industry sector
Google (www.google.com) Construction Weekly a vertical portal for engineers, based on a trade magazine: www.constructionweekly.com Silicon (www.silicon.com) A
Google is the best search engine since it focuses on excelling on search. Specialist industry sites are often the best place to start looking for sector specific research and content.
Marketing Online (www.marketing-online.co.uk) a portal providing content about e-marketing for students and professionals.
Portals created for your own country may be best, but you may miss out on more detailed information in other countries, so these should be used with care.
vertical
Countyweb for UK regions (www.countyweb.com) OMNI a UK portal for health and medical resources (www.omni.ac.uk)
5. Meta portal
These sites can be useful to set to your home page since they list all the main search and news portals and can let you search several search engines from one place.
>> More portals for research (Find out more topic) It also helps to think about these types of portal they are often rich sources of specialist information. Trade association sites These often commission their own research for their members which may be available free-of-charge International Telecommunications Union data on Internet access in every country worldwide. www.ITU.int Government sites Governments commission detailed research which is freely available on a range of topics The Office of National Statistics focuses on social and economic statistics www.statistics.gov.uk Market Research aggregators 9
Some sites have been created to sell data and services to market researchers. These can contain free research since these sites also make their money through advertising. Sometimes paying for the good stuff may be the best way. MRWeb www.mrweb.co.uk Freepint a community for UK search specialists www.freepint.com >> Meta portals Meta portals are portals which link to other portals they give a high-level view of the different portals including specialist search engines SearchEngineGuide http://www.searchengineguide.com/searchengines.html Most comprehensive list of specialist search engines, arranged in Yahoo-like directory structure Wonderport http://www.wonderport.com/ukindex.html A useful list of the main portals that are likely to be useful >> Meta search engines Meta search engines pass keywords onto several different search engines (but not Google). Results are prioritised according to popularity and duplicates removed. Ask Jeeves is the best known search engine that uses this approach. The main search engines are listed here: http://www.searchenginewatch.com/links/metacrawlers.html Try it! Go to IXQuick one of the best metasearch engines and type healthcare portal http://ixquick.com/do/metasearch.pl? cat=web&cat=web&cmd=process_search&language=english&query=healthcare+portal Results: You will see that the top results are those sites which are highly ranked at several search engines such as www.healthlinks.net. Expert search tip: Some search professionals prefer downloadable software search tools which are usually integrated with web browsers and also link to main search engines. Copernic.com (www.copernic.com) WebFerret (www.ferretsoft.com)
Search engines compile an index of keywords on web pages by regularly sending out automatic software tools known as spiders or robots to crawl around sites that are registered with that search engine. The spider compiles an index containing every word on every page against the page address. It weights the index according to different parameters and then stores the index as part of a database on a web server. This index rather than the whole web is what is searched when you type keywords into the search engine. Find out more: For more information on technically how search engines work, see: http://computer.howstuffworks.com/search-engine1.htm For more on why search engines rank some web sites higher and how site owners optimize their sites to appear higher than others see: http://www.searchenginewatch.com/webmasters >> What does the search results listing show? After you type in your keywords a list of matching pages (sometimes called hits) will be displayed in order of relevance. In Google, these list: The title of the page you can click on this to go through to the page An excerpt of the page containing the words you have typed in The page URL you can also click on this and the data published to web Try it! Clicking on this link shows you the results of a search engine query 1. This example shows how to find the BBC news web site keywords BBC and news http://www.google.com/search?q=BBC+news Results: You will see that BBC news site is at the top of the list: Search experts tip: Adding the name or abbreviation of country such as UK to your search keywords can help find UK specific information or portals.
6. What is a directory?
What is it? Web directories or catalogues are constructed and presented differently to search engines. Directories are not constructed automatically by robots and spiders, but are human generated. A human being will place each reference to a site in a category. After you submit your URL to a site such as Yahoo! it will be reviewed by someone and then included if it is thought to be of a suitable standard. A disadvantage of directories is that they do not give comprehensive access to all web pages. When you search a directory, you are not searching the entire web, but the list of company names, categories and for Yahoo!, the 25 word description of the site. Search experts tip: Most search engines such as Yahoo! or Google include both a search engine facility and a directory component. Try It: If you go to Yahoo! (www.yahoo.co.uk) you will see a search box at the top. Type in: BBC News. Results: You will see a list titled: Search experts tip: Only use directories if you cant find what you want using a search engine. You may find useful information or portals in this way, but it is less likely.
11
The problems with directories are that most sites are in a single or limited number of categories, so that if you go to a specialist category there are very few sites listed there. Google also offers a directory at http://directory.google.com But it suffers from a similar problem of limited number of sites, which tend to be mainly US oriented, see for example: http://directory.google.com/Top/Business/Healthcare/Nursing/ Activity True / False questions: 1. A directory gives a more complete view of pages on the web than a search engine Ans False search engines index every page of each registered sites, whereas as directories only list a sites name and a brief description. 2. Search engines rank sites found against keywords according to the quality of information Ans False It depends how the site has been optimized for search engines the main criteria is number of times a keyword is found which may or may not indicate quality information. 3. A portal is the same as a search engine: Ans False a search engine is a service of a portal, others may include news and directory services.
12
To make the invisible web visible use search engines to find the database. For example: Use database in search keyphrase. Searching for .NSF (the Lotus Notes database format) can also be used to find documents in this specific database).
13
First use Google's special Advanced search page. Second learn the special Google codes or syntax that produce the advanced search results. Once familiar with them, you can type the codes into the Google Search box.
Try it! Let's take an example: A student or information systems manager is researching information on outsourcing information systems activities. They want to know about best practice for this area. In the next topic, we will look at a structured method for building up the best keywords to answer this question, but for now lets say the information we want must: Refer to outsourcing. Specifically refer to information systems outsourcing Refer to best practice or benchmarking in outsourcing Exclude reference to industry or trade associations Go to Google Advanced search (www.google.com/advanced) and then follow the stages in the four screens that follow. Google gives these main options for advanced search: >> >> >> >> 1. 2. 3. 4. All the Words Exact phrase With at least one of the words Excluding the words
>> 1. All the Words Meaning This means that the word MUST be present on the page. Equivalent code +, AND (this is not essential since this is the default in Google) Example: +ulcers
Boolean AND Try It!: Just type outsourcing into the with all of the words box of www.google.com/advanced Results: There will be hundreds of thousands of results >> 2. Exact phrase Meaning The exact phrase MUST be present on the page Equivalent code Single or double quotes Example: wound treatment, wound treatment
Try It!: Now add information systems into the with the exact phrase box of www.google.com/advanced Results: There are still hundreds of thousands of results. The top ones combine outsourcing and information systems and these keywords are in the titles of the pages >> 3. With at least one of the words Meaning It is not essential the word is on the page, but if it is, the resulting page will be ranked higher Equivalent code OR (although this is useful for understanding complex queries, this is the default in Google, so OR is not strictly required) Example: Example: steps
Boolean OR Try It!: Now add the two words practice benchmarking into the with at least one of the words box of www.google.com/advanced Results: There are now around 300 results. The top ones combine now refer to specific 15
stages of treatment in other words the procedure. >> 4. Excluding the words Meaning Excludes the specified word(s) Equivalent code Example: Example: -venous
Boolean NOT Try It!: Now add association into the without the words box of www.google.com/advanced Results: The number of results is now reduced. All pages that referred to venous ulcers are now excluded. Expert searchers tip: Note that in Google, parentheses or brackets are unnecessary to combine search items. Google also does not support stemming, where you add a star to a word with different endings, so you would have to use wound OR wounds rather than woun* which is supported on some search engines.
If you were to type this straight into the Google search box, you would type: outsourcing practice OR benchmarking "information systems" -association or for clarity +outsourcing practice OR benchmarking "information systems" -association Searchers tip: Quotes are an excellent way of narrowing down the search in complex searches. They are also needed to include what Google terms Stop words which are automatically excluded from the search but sometimes useful to define a phrase, e.g. and, or, the. Activity: 1. Match these features of Google advanced search to the codes that can be typed into Google to refine search: Advanced search option With All the Words With exact phrase With at least one of the words Without the words Code + Quotes OR -
2. Which is the best search syntax to find out about trends in childhood asthma in the UK from UK government sites.
16
a) child children asthma number of cases +site:gov.uk b) child children asthma number of cases +site:gov.uk c) child OR children asthma "number of cases" +site:gov.uk d) childhood asthma "number of cases" +site:gov.uk Correct answer c) This returns an article from the office of National Statistics titled: New episodes of asthma: by sex and age, 1986 to 2001: Social Trends 33
included near the topic of the ranking because of the number of words they contain. Microsoft Word (.doc) again could be reports or articles that have been posted to the web in this format Microsoft Powerpoint (.ppt) often useful for summarizing approaches, or very useful if you have to prepare a presentation on a topic!
>> Specify Occurrence of keywords on web page The options include: In the title of the page. For example, intitle:outsourcing will look for web sites that have outsourcing in the title. In the URL of the page. For example, allinurl: outsourcing Search experts tip: Using these specify occurrences options is not normally necessary since Google uses the best match anyway. They can be useful for finding portals, for example allinurl:outsourcing portal. >> Filter results to specific domain(s) (Site:) This is useful if you want to show results for sites with UK registered domains only such as .co.uk, .gov.uk and .ac.uk. To do this add +site:.uk to your query. Domain filtering can also be useful if you want to display all the pages on a site which contain a key phrase. For example, we could search the archives of a site: information systems site:www.ismbc.org Search experts tip: 'Site' can also be used to limit your search to domains or specific sites. For example, if you know you want information from .gov or .org sites only, you can make your request more specific by using the site: qualifier before your search words. For example, site:.ac.uk information systems" outsourcing will only return information from UK academic (.ac.uk) web sites. >> Date This can be used to specify pages updated in the last 3, 6 or 12 months.
17
Search experts tip: Unfortunately this does not necessarily give up-to-date articles or research since older pages may have been refreshed recently even though the content stays the same.
Outsourcing Information systems Contracts Best practice Academic site or organisation site
Search experts tip: Think like the captioner. Think how the title of the page or caption on a figure or table would be labeled by its authors. Also think how the search terms might be referred to in a sentence.
18
For example: A page title: Outsourcing IS agreements Table caption: Summary of items to include in an IS outsourcing contract Body text: In this article we review best practice for outsourcing information systems. key items to include in the contract are: >> Step 3. Think laterally identify synonyms and alternative terms Example: Subject: Specific type of subject: Application: Type of information needed: Likely source / or publication type Wound treatment Outsourcing Information systems Contracts Best practice Academic site, teaching hospital or government Alternative terms IT, ICT Legal agreements Benchmarking guidelines Filter using +site:.ac.uk or +site:..edu or +site:.org
>> Step 4 Combine different search concepts The next step is review the different keywords you have generated to identify the search query: Which will commonly occur in a phrase, e.g. information systems Which are essential e.g. contracts Which are alternatives e.g. best practice OR benchmarking A Google search query string can then be devised: Try it: +outsourcing "information systems" contract OR legal site:.ac.uk (see also the previous topic on using Google Advanced Search) Activity Go to Google, try these out >> Information source strategies: Your searching strategy will naturally depend on the type of information you are looking for. So dont only think of the type of information, also think of the type of source you are using. Examples of information sources include: - Published research data academic or government - Published research report - Market information - Online news article - Supplier info - Company information information about a particular supplier - Customer information finding - Product information finding the best product for purpose Activity Put these stages of refining a search in order: <correct answer> Quick 2 or 3 keyword search Break down search into themes Find alternative words Structure complex search using quotes and + Try filtering to uk government sites using +site:.gov.uk
4. Google extras and must haves Additional tools to make you more productive
Google has several tools which can make you more productive. Of these the first two are most useful: Google Toolbar 19
Google Glossary Google Answer Google Sets Tools to assess value of web pages
The Google toolbar is a must have add-in to Microsoft Internet Explorer which enables you to start searching using Google.com any time you have your browser open without needing to go back to the Google site each time to type in keywords. The Google toolbar also records previous searches and includes advanced search facilities. Expert searchers tip: This tool will save you a lot of time. If you havent already got it, go to http://toolbar.google.com and follow the instructions to download. >> Google Glossary The glossary is one of several tools being developed in the Google labs Great for learning about new topics. Type in a keyword and a range of definitions are provided from different sites. Try It! http://labs.google.com/glossary?q=back-office >> Google Answers If youre really short of time and have ready cash, you can get a more expert searcher to find information for as little as $2 per question. You can browse previous questions and their answers which may, in fact, answer your question. http://answers.google.com >> Google Sets This intriguing technology finds related terms which can be useful for identifying unknown members of a set. http://labs.google.com/sets Try typing in the name of three supermarkets and then press the button to return a larger set related supermarkets are then displayed as if by magic. Practical applications may be limited, but this can be used to find related information or keywords about a topic. >> Tools to help assess value of pages Google Webquotes gives ratings from other sites to help determine whether the items are worth clicking through to http://labs.google.com/cgi-bin/webquotes Google Viewer gives previews of all pages within the search engine. http://labs.google.com/gviewer.html
20