CH - 01 - Final How Google Works
CH - 01 - Final How Google Works
Google works.
• Trust is at the very core of Google’s major changes and updates the
past several years. Google wants to keep poor-quality, shoddy sites
out of the search results, and keep high-quality, legit sites at the top.
• If your site has high-quality content and backlinks from reputable
sources, your site is more likely to be considered a trustworthy
source, and more likely to rank higher in the search results.
• The "/robots.txt" file is a text file, with one or more records. Usually
contains a single record looking like this:
• User-agent: [Required, one or more per group] The name of a search
engine robot (web crawler software) that the rule applies to. This is
the first line for any rule. Most Google user-agent names are listed in
the Web Robots Database or in the Google list of user agents.
Supports the asterisk (*) wildcard for a path prefix, suffix, or entire
string.
• Disallow: [At least one or more Disallow or Allow entries per rule] A
directory or page, relative to the root domain, that should not be
crawled by the user agent.
• Allow: [At least one or more Disallow or Allow entries per rule] A
directory or page, relative to the root domain, that should be crawled
by the user agent just mentioned.
• A robots.txt file consists of one or more groups, each beginning with a User-agent line that specifies
the target of the groups.
• Here is a file with two group; inline comments explain each group:
• # Block googlebot from example.com/directory1/... and example.com/directory2/...
• # but allow access to directory2/subdirectory1/...
• # All other directories on the site are allowed by default.
• User-agent: googlebot
• Disallow: /directory1/
• Disallow: /directory2/
• Allow: /directory2/subdirectory1/
• # Block the entire site from anothercrawler.
• User-agent:*
• Disallow: /
Chapter one How Google Works 38
Useful robots.txt rules
Rule Sample
Disallow crawling of the entire website. Keep in mind
User-agent: *
that in some situations URLs from the website may still
Disallow: /
be indexed, even if they haven't been crawled.
Disallow crawling of a directory and its contents by User-agent: *
following the directory name with a forward slash. Disallow: /calendar/
Calender and junk folders Disallow: /junk/
User-agent: Googlebot-news
Allow access to a single crawler named (Googlebot- Allow: /
news) User-agent: *
Disallow: /
User-agent: Unnecessarybot
Allow access to all crawlers except the crawler named Disallow: /
(Unnecessarybot) User-agent: *
Allow: /
Disallow crawling of a single webpage named
User-agent: * Disallow: /private_file.html
(private_file.html)
Disallow google crawling of files of a specific file
User-agent: Googlebot Disallow: /*.gif$
type (for example, .gif):
• X.com
• User-agent:Googlebot-news
Allow:
User-agent:*
Disallow:/
Disallow crawling of a single webpage named (private_file.html)
User-agent:*
Disallow:/private_file.html
Disallow crawling of files of a specific file type such as pdf
User-agent:*
Disallow:/*.pdf$
Chapter one How Google Works 40
2. Page Speed (Including Mobile
Page Speed)
• Page speed has been cited as one of the main SEO ranking factors for years.
Google wants to improve users’ experience of the web, and fast-loading
web pages will definitely do that.
• Google announced a search engine algorithm update focused on mobile
page speed that will start to affect sites from July 2018. If your site doesn’t
load fast on mobile devices, then it could be penalized.
• Accelerated Mobile Pages (AMP). AMP gives web pages a lightning speed to
load on users mobile devices. Faster the speed, higher the rank and the
more chances that users will get to see content in less time. Apart from
improving SERP, fast loading speed will also reduce Bouncing Rate.
• Use Google’s mobile testing tool to see how your site stacks up.
Chapter one How Google Works 41
3. Mobile Friendliness
• While we’re on the subject of mobile, mobile-friendliness is another
major SEO ranking factor. More people use mobile devices than
desktops to access the web, and that’s one reason there’ve been
changes in how Google ranks search results.
• Things to look at include:
• Whether you have a responsive site that automatically resizes to fit the device
Eg. www.m.example.com
• Whether you’re using large fonts for easy readability on a small screen
• Accessibility and navigability, including making it easy to tap menus
• Ensuring that essential content isn’t hidden by ads
• As we said at the start, the web is built on links, so naturally, links are
a crucial SEO ranking signal. There are three kinds of links to think
about:
• Inbound links
• Outbound links
• Internal links