<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>RevaHealth.com Blog &#187; data mining</title>
	<atom:link href="http://blog.revahealth.com/tag/data-mining/feed" rel="self" type="application/rss+xml" />
	<link>http://blog.revahealth.com</link>
	<description>Tech, Marketing, Health 2.0 and stuff from the RevaHealth.com team</description>
	<lastBuildDate>Mon, 16 Nov 2009 13:22:56 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How To Spot Trends In The Long Tail &#8211; May 2009</title>
		<link>http://blog.revahealth.com/2009/06/what-our-visitors-want-revahealth-com-trends-may-2009.html</link>
		<comments>http://blog.revahealth.com/2009/06/what-our-visitors-want-revahealth-com-trends-may-2009.html#comments</comments>
		<pubDate>Thu, 18 Jun 2009 11:56:07 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Stuff we've learned]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[keyword]]></category>
		<category><![CDATA[long tail]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[trends]]></category>

		<guid isPermaLink="false">http://blog.revahealth.com/?p=452</guid>
		<description><![CDATA[We&#8217;ve talked before about the advantages of using a long tail strategy to bring traffic to our site, but in digging into what&#8217;s been happening in the last month, some of the disadvantages of the strategy became apparent too. The pros far outweigh the cons, but it&#8217;s worth taking a little time to discuss them nonetheless.
With [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve talked before about the advantages of using a long tail strategy to bring traffic to our site, but in digging into what&#8217;s been happening in the last month, some of the disadvantages of the strategy became apparent too. The pros far outweigh the cons, but it&#8217;s worth taking a little time to discuss them nonetheless.</p>
<p>With just short of 80,000 pages from RevaHealth.com in Google&#8217;s active index there are an awful lot of ways that visitors can find pages on our site. Looking at the keywords they searched for before arriving on the site should be an easy way to gain an insight into what our visitors want, but the sheer volume of unique keywords arising from a long tail strategy makes this much harder.</p>
<p>For example, in May 2009 there were 83,153 unique keywords that brought people to RevaHealth.com, and the average number of words (fragments) in each keyword was 3.88. Compare this to the 2008 global average of 2.2 words per query and you start to see how different our traffic is. The graph below helps illustrate the problem &#8211; there were 13,097 fragments that were searched for just once in May, while there was one word that was searched for 17,130 times!</p>
<div id="attachment_454" class="wp-caption alignnone" style="width: 326px"><img class="size-full wp-image-454" title="Keyword fragment usage rates May 2009" src="http://blog.revahealth.com/wp-content/uploads/2009/06/keyword-fragment-usage-rates-may-2009.jpg" alt="Keyword fragment usage rates May 2009" width="316" height="190" /><p class="wp-caption-text">Keyword fragment usage rates May 2009</p></div>
<p>So what was this most searched for fragment? Not surprisingly for a site organised by locations, it was &#8220;in&#8221;. Also unsurprisingly for RevaHealth.com, numbers 2 and 3 were &#8220;dentist&#8221; (12,837 times) and &#8220;dental&#8221; (10,041 times) respectively. The difficulty however comes with how quickly these numbers drop off. For instance, our hundredth most popular keyword fragment was &#8220;leicester&#8221;, with just 379 searches in May. In fact, you may have noticed that the above graph&#8217;s vertical axis uses a logarithmic scale just so you can see any detail!</p>
<p>The point of all this is that using a long tail strategy to attract traffic does make it harder to spot the underlying trends by mining your keywords. Some tips I would suggest are:</p>
<ol>
<li>splitting keywords into fragments and aggregating &#8211; this at least helps to point out the most repeated individual words</li>
<li>create groups of keyword fragments under different themes or meanings and measure their collective popularity &#8211; this can help to identify broader trends i.e. which is more important to your visitors &#8211; price or quality?</li>
<li>watch out for rising trends by looking for unexpected keyword fragments appearing high on your list &#8211; these can help to identify timely sales opportunities</li>
</ol>
<p>I&#8217;d also be interested in hearing how you go about analysing your keywords to identify important SEO terms, visitor trends and sales opportunities, so leave a comment and let us know.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.revahealth.com/2009/06/what-our-visitors-want-revahealth-com-trends-may-2009.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
