<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>WebReach, Cavan, Ireland &#187; search engine spiders</title>
	<atom:link href="http://webreach.ie/blog/tag/search-engine-spiders/feed/" rel="self" type="application/rss+xml" />
	<link>http://webreach.ie</link>
	<description>SEO and Internet Marketing, Ireland</description>
	<lastBuildDate>Tue, 26 Jan 2010 22:51:55 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Google Analytics Disadvantages &#8211; It Can&#8217;t Track Website Crawlers</title>
		<link>http://webreach.ie/blog/google-analytics-disadvantages-you-cant-track-your-website-crawlers/</link>
		<comments>http://webreach.ie/blog/google-analytics-disadvantages-you-cant-track-your-website-crawlers/#comments</comments>
		<pubDate>Tue, 23 Jun 2009 12:31:49 +0000</pubDate>
		<dc:creator>ollie</dc:creator>
				<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Search Engine Marketing]]></category>
		<category><![CDATA[Search Engine Optimisation]]></category>
		<category><![CDATA[crawltrack]]></category>
		<category><![CDATA[googlebot]]></category>
		<category><![CDATA[search engine spiders]]></category>
		<category><![CDATA[Seo]]></category>
		<category><![CDATA[slurp inktomi]]></category>
		<category><![CDATA[website crawler statistics]]></category>

		<guid isPermaLink="false">http://webreach.ie/?p=510</guid>
		<description><![CDATA[Ok first up I have to say that we are all fans of Google Analytics here, with good reason: it&#8217;s zero-cost, its rich array of features: brilliant reporting capabilities, the ability to slice and dice the data in almost every conceivable way, the slick ajax-ified interface, and the support for advanced stuff like advanced segmentation [...]]]></description>
			<content:encoded><![CDATA[<p>Ok first up I have to say that we are all fans of Google Analytics here, with good reason: it&#8217;s zero-cost, its rich array of features: brilliant reporting capabilities, the ability to slice and dice the data in almost every conceivable way, the slick ajax-ified interface, and the support for advanced stuff like advanced segmentation and something that I really love using on e-commerce sites &#8211; goals and funnels, and for the nerds among us &#8211; filters.</p>
<ol>
<li>The analytics data is not available in real-time, you have to wait until midnight passes the next day to see yesterday&#8217;s data</li>
<li>The data is not yours &#8211; it&#8217;s Google&#8217;s &#8211; and for all of the clean image that Google has (&#8221;don&#8217;t be evil&#8221;), forgive me when I say that I prefer to have my own copy of my website trend and visitor data.</li>
<li>Google Analytics works by including a JavaScript snippet in your pages, but search engine spiders (&#8221;crawlers&#8221;) don&#8217;t execute JavaScript when they load your site&#8217;s pages, so their visits aren&#8217;t logged by Google Analytics. So when spiders from GoogleBot, Inktomi (Yahoo), Bing (Microsoft&#8217;s new search engine), Ask and Baidu crawl your site, you know absolutely zero about it from looking at your GA reports.</li>
</ol>
<p><span id="more-510"></span><br />
To track these visitors that in the vast majority of cases go totally unnoticed, you need a server-side scripting package that logs all visits to a database, including search engine spiders. There are a few free ones out there, and one of the best of these is CrawlTrack &#8211; freely available to download at <a href="http://www.crawltrack.net/" target="_blank">http://www.crawltrack.net/</a></p>
<p>The install for CrawlTrack is reasonably straightforward, and can be completed in a few minutes. All you need to do is upload the CrawlTrack files to a new directory on your hosting account, create a new database on your hosting account&#8217;s MySQL server, include a piece of CrawlTrack PHP code in your website code (in a header file, or another appropriate file, as indicated by the CrawlTrack documentation) and go through the automated installation procedure. </p>
<p>CrawlTrack does not interfere with Google Analytics, you can run them side-by-side no problem, and enables you to see which spiders are crawling your site and how often. Here&#8217;s a screenshot of the &#8220;Crawlers&#8221; activity view from one of my new websites just taken today.</p>
<p><img src="http://webreach.ie/wp-content/uploads/crawltrack-crawlers-statistics-2.png" alt="crawltrack-crawlers-statistics-2" title="crawltrack-crawlers-statistics-2" width="700" height="800" class="aligncenter size-full wp-image-519" /></p>
<p>This data shows the percentage of visits from certain spiders. A table at the bottom shows how often each spider is crawling your site, broken down by each individual spider.  Note: a &#8220;Visit&#8221; is one individual page load by the spider. </p>
<p>You can click on the name of the spider (e.g. GoogleBot) to drill down into detailed crawl stats for that specific robot. This view will give you individual crawl stats for each page of the site crawled, as well as stats into what percentage of your site pages were not crawled.</p>
<p>This is great data for SEO as it shows the regard by which the search engines are giving to your site. This is very useful if you are publishing new content on your site, or updating it, and you want to see when the new content is indexed by the search engines, therefore enabling you to monitor the effectiveness of site changes, e.g. you update your site with a special offer, and wait patiently for enquiries to come in. But the thing is that you don&#8217;t know when the search engines will index the new content. With CrawlTrack, you can see exactly when this new content is indexed, therefore giving you the ability to correlate the offer with the response more accurately.  </p>
<p>With CrawlTrack, you will also be able to see trends in your website crawl data that indicates interest from the search engines because of SEO activities like inbound link-building. </p>
<p>Then comparing your CrawlTrack data with your Google Analytics data may give you interesting correlates.   </p>
<p>All comments welcome!</p>
]]></content:encoded>
			<wfw:commentRss>http://webreach.ie/blog/google-analytics-disadvantages-you-cant-track-your-website-crawlers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
