<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Java. Internet. Algorithms. Ideas. &#187; google trends</title>
	<atom:link href="http://philippeadjiman.com/blog/category/google-trends/feed/" rel="self" type="application/rss+xml" />
	<link>http://philippeadjiman.com/blog</link>
	<description>Just Another Blog About Geek Stuff, by Philippe Adjiman</description>
	<lastBuildDate>Tue, 25 May 2010 06:58:19 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>What Are The 10 Most Cited Websites On Twitter When Tweeting About Hot Trends?</title>
		<link>http://philippeadjiman.com/blog/2010/02/06/what-are-the-10-most-cited-websites-on-twitter-when-tweeting-about-hot-trends/</link>
		<comments>http://philippeadjiman.com/blog/2010/02/06/what-are-the-10-most-cited-websites-on-twitter-when-tweeting-about-hot-trends/#comments</comments>
		<pubDate>Sat, 06 Feb 2010 18:15:09 +0000</pubDate>
		<dc:creator>padjiman</dc:creator>
				<category><![CDATA[experiments]]></category>
		<category><![CDATA[google trends]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://philippeadjiman.com/blog/?p=907</guid>
		<description><![CDATA[Lately I wrote a post on how to build a relevant real time search engine prototype in few hundreds lines of code.  Using a tailored ranking algorithm based on link popularity in twitter,  I showed that the prototype was able to return very relevant answers in response to very hot queries like the ones that can [...]]]></description>
			<content:encoded><![CDATA[<p>Lately I wrote a post on<a href="http://philippeadjiman.com/blog/2010/01/06/how-to-build-a-relevant-real-time-search-engine-prototype-in-few-hundred-lines-of-code/" target="_blank"> how to build a relevant real time search engine prototype in few hundreds lines of code</a>.  Using a tailored ranking algorithm based on link popularity in twitter,  I showed that the prototype was able to return very relevant answers in response to very hot queries like the ones that can be found in the hourly updated list of <a href="http://www.google.com/trends/hottrends?sa=X" target="_blank">google hot trends</a>.</p>
<p>I wrote a small program on top of this prototype to run an experiment: each hour, the program crawl the new list of hot queries from google hot trends, then it runs the prototype on each of those queries and keep the hottest link found in twitter for the corresponding hot query. I wanted to see which websites were mostly cited in those tweets talking about hot trends.</p>
<p>So I let ran the program for a week, collected the  links (more than a thousand), expanded all those into their long URLs version (using an improved version of my <a href="http://philippeadjiman.com/blog/2009/09/07/the-trick-to-write-a-fast-universal-java-url-expander/" target="_blank">java universal URL expander</a>),  extracted the domain names and compiled the whole into a top 10 list of the most cited websites. Here it is (click to enlarge):</p>
<div id="attachment_966" class="wp-caption aligncenter" style="width: 244px"><a href="http://philippeadjiman.com/blog/wp-content/uploads/2010/02/top10twitterBuzzWebsites.jpg" target="_blank"><img class="size-medium wp-image-966 " title="top10twitterBuzzWebsites" src="http://philippeadjiman.com/blog/wp-content/uploads/2010/02/top10twitterBuzzWebsites-234x300.jpg" alt="top10twitterBuzzWebsites" width="234" height="300" /></a><p class="wp-caption-text">The Most Cited Websites When Tweeting About Hot Trends. Click to enlarge.</p></div>
<p>I was surprised to see some websites that I&#8217;ve never heard about before (like wpparty.com or actionnewsblast.com).</p>
<p>To have a better idea for which kind of hot queries/topics those websites are most cited in twitter, find below, for each of those top website, a sample of 5 <a href="http://www.google.com/trends/hottrends?sa=X" target="_blank">google hot trends</a> query they covered last week.</p>
<div>
<table class="pretty" border="0" align="center">
<tbody>
<tr>
<th>Website</th>
<th>Sample of 5 covered google hot trends of this past week</th>
</tr>
<tr>
<td style="text-align: center; "><a href="http://edition.cnn.com/" target="_blank">www.cnn.com</a></td>
<td style="text-align: center; "><a href="http://www.cnn.com/2010/POLITICS/02/01/obama.budget.explainer/index.html?eref=rss_topstories&amp;utm_source=twitterfeed&amp;utm_medium=twitter&amp;utm_campaign=Feed%3A+rss%2Fcnn_topstories+%28RSS%3A+Top+Stories%29" target="_blank">2011 budget</a><br />
<a href="http://www.cnn.com/video/?utm_source=twitterfeed&amp;utm_medium=twitter&amp;utm_campaign=Feed%3A+rss%2Fcnn_freevideo+%28RSS%3A+Video%29#/video/tech/2010/01/27/barnett.ipad.specs.strategy.cnn" target="_blank">ipad tablet</a><br />
<a href="http://www.cnn.com/interactive/2010/01/world/haiti.360/index.html?video=haiti.flv" target="_blank">cnn.com/haiti360</a><br />
<a href="http://www.cnn.com/2010/WORLD/europe/02/02/france.concorde.trial/index.html?eref=rss_topstories&amp;utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+rss%2Fcnn_topstories+%28RSS%3A+Top+Stories%29" target="_blank">concorde crash</a><br />
<a href="http://www.cnn.com/2010/SPORT/01/31/tennis.australia.open.final.federer.murray/index.html" target="_blank">federer murray</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://sports.espn.go.com/" target="_blank">sports.espn.go.com</a></td>
<td style="text-align: center; "><a href="http://sports.espn.go.com/sports/tennis/aus10/news/story?id=4867619&amp;campaign=rss&amp;source=ESPNHeadlines&amp;utm_source=twitterfeed&amp;utm_medium=twitter" target="_blank">federer tsonga australian open</a><br />
<a href="http://sports.espn.go.com/mlb/news/story?id=4877065&amp;campaign=rss&amp;source=MLBHeadlines" target="_blank">aaron miles</a><br />
<a href="http://sports.espn.go.com/nfl/news/story?id=4872278&amp;campaign=rss&amp;source=twitter&amp;ex_cid=Twitter_espn_4872278" target="_blank">tom brookshier</a><br />
<a href="http://sports.espn.go.com/dallas/news/story?id=4869265&amp;campaign=rss&amp;source=ESPNHeadlines" target="_blank">jackson jeffcoat</a><br />
<a href="http://sports.espn.go.com/boston/nba/news/story?id=4881306&amp;campaign=rss&amp;source=ESPNHeadlines&amp;utm_source=twitterfeed&amp;utm_medium=twitter" target="_blank">paul pierce</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://wpparty.com/" target="_blank">wpparty.com</a></td>
<td style="text-align: center; "><a href="http://wpparty.com/2010/01/29/espn-football-henderson-jeffcoat-and-more-battles-usa-news/" target="_blank">jackson jeffcoat</a><br />
<a href="http://wpparty.com/2010/02/01/foghat-and-leon-russell-coming-to-spotlight-29-casino/" target="_blank">leon russell</a><br />
<a href="http://wpparty.com/2010/01/30/lagat-wins-6th-wanamaker-mile/" target="_blank">wanamaker mile</a><br />
<a href="http://wpparty.com/2010/01/26/hey-its-that-studded-blazer-again/" target="_blank">buffalo exchange</a><br />
<a href="http://wpparty.com/2010/02/03/lahood-tells-owners-of-recalled-toyotas-to-stop-driving-vehicles/" target="_blank">recalled toyotas</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://www.huffingtonpost.com" target="_blank">www.huffingtonpost.com</a></td>
<td style="text-align: center; "><a href="http://www.huffingtonpost.com/2010/01/27/bob-mcdonnell-speech-full_n_439508.html" target="_blank">governor of virginia</a><br />
<a href="http://www.huffingtonpost.com/thenewswire/archive/../../2010/01/29/transcript-of-president-o_n_442423.html" target="_blank">obama republican retreat</a><br />
<a href="http://www.huffingtonpost.com/thenewswire/archive/../../2010/01/29/obama-goes-to-the-gop-lio_n_442331.html" target="_blank">obama gop</a><br />
<a href="http://www.huffingtonpost.com/2010/01/26/apple-tablet-announcement_n_436859.html" target="_blank">apple tablet announcement</a><br />
<a href="http://www.huffingtonpost.com/2010/02/02/groundhog-day-prediction-_n_445601.html" target="_blank">groundhog prediction</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://twitpic.com/" target="_blank">twitpic.com</a></td>
<td style="text-align: center; "><a href="http://twitpic.com/10mo6p" target="_blank">miss america 2010 winner</a><br />
<a href="http://twitpic.com/10wusc" target="_blank">what celeb do i look like</a><br />
<a href="http://twitpic.com/10szff" target="_blank">footprints in the sand</a><br />
<a href="http://twitpic.com/zzq98" target="_blank">apple itablet</a><br />
<a href="http://twitpic.com/zz5by" target="_blank">itablet</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://www.youtube.com/" target="_blank">www.youtube.com</a></td>
<td style="text-align: center;"><a href="http://www.youtube.com/watch?v=YFNQE_TzQNI&amp;feature=youtu.be" target="_blank">i pad</a><br />
<a href="http://www.youtube.com/watch?v=XDCeXrZgbjs&amp;feature=youtu.be" target="_blank">grammy awards 2010</a><br />
<a href="http://www.youtube.com/watch?v=mfZ60a1QbCY" target="_blank">bob kellar</a><br />
<a href="http://www.youtube.com/watch?v=KQmtKOOBO2I" target="_blank">lakers celtics</a><br />
<a href="http://www.youtube.com/watch?v=lQnT0zp8Ya4" target="_blank">ipad a disappointment</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://www.facebook.com/" target="_blank">www.facebook.com</a></td>
<td style="text-align: center; "><a href="http://www.facebook.com/AtlantaHistoryCenter/posts/306004651553" target="_blank">general beauregard lee</a><br />
<a href="http://www.facebook.com/photo.php?pid=3865141&amp;l=4a527701cc&amp;id=93944052260" target="_blank">roberta flack</a><br />
<a href="http://www.facebook.com/GRANDAMRoadRacing/posts/312377461577" target="_blank">action express racing</a><br />
<a href="http://www.facebook.com/JimmyKimmelLive/posts/272447853002" target="_blank">slightly stoopid</a><br />
<a href="http://www.facebook.com/permalink.php?story_fbid=273782412571&amp;id=218464326195" target="_blank">rolex 24 hours daytona</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://www.actionnewsblast.com/" target="_blank">www.actionnewsblast.com</a></td>
<td style="text-align: center; "><a href="http://www.actionnewsblast.com/codswallop-codswallop-meaning" target="_blank">codswallop meaning</a><br />
<a href="http://www.actionnewsblast.com/blow-out-star-antin-joins-bravos-shear-genius-as-judge" target="_blank">jonathan antin</a><br />
<a href="http://www.actionnewsblast.com/ex-edwards-aide-money-was-no-object" target="_blank">fred baron</a><br />
<a href="http://www.actionnewsblast.com/codswallop-codswallop-meaning" target="_blank">codswallop definition</a><br />
<a href="http://www.actionnewsblast.com/taylor-swift-grammy-fascinating-fact" target="_blank">stevie nicks</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://www.netnewsticker.com/" target="_blank">www.netnewsticker.com</a></td>
<td style="text-align: center; "><a href="http://www.netnewsticker.com/how-do-i-get-more-energy" target="_blank">arc energy</a><br />
<a href="http://www.netnewsticker.com/jr-ego-ferguson-goes-for-lsu-the-composed-gentleman" target="_blank">ego ferguson</a><br />
<a href="http://www.netnewsticker.com/schoolcraft-womens-team-keeps-rolling" target="_blank">kim burrell</a><br />
<a href="http://www.netnewsticker.com/how-do-i-know-if-i-reserved-a-camping-spot-correctly-online" target="_blank">reserveamerica</a><br />
<a href="http://www.netnewsticker.com/andy-staples-running-analysis-for-2010-signing-day" target="_blank">ivan mccartney</a></td>
</tr>
<tr>
<td style="text-align: center;"><a href="http://mashable.com/" target="_blank">mashable.com</a></td>
<td style="text-align: center; "><a href="http://mashable.com/2010/01/29/national-lady-gaga-day/" target="_blank">national lady gaga day</a><br />
<a href="http://mashable.com/apple-tablet/" target="_blank">ipad tablet</a><br />
<a href="http://mashable.com/2010/01/27/ipad-whats-missing/#comment-31633181" target="_blank">ipad thoughts</a><br />
<a href="http://mashable.com/2010/01/29/doppelganger-week-facebook/" target="_blank">doppelganger week facebook</a><br />
<a href="http://mashable.com/2010/01/26/tim-tebow-super-bowl-ad/" target="_blank">tebow super bowl ad</a></td>
</tr>
</tbody>
</table>
</div>
<p>Few remarks:</p>
<ul>
<li>All the links spotted by <a href="http://philippeadjiman.com/blog/2010/01/06/how-to-build-a-relevant-real-time-search-engine-prototype-in-few-hundred-lines-of-code/" target="_blank">my prototype</a> and that appear in the table are coming from real tweets around those google hot trends queries.</li>
<li>You&#8217;ll notice that apple iPad announcement is a theme that was covered by 4 of those top 10 websites!</li>
<li>I recommend you to have a look on the youtube video in the table around the google hot trend &#8220;ipad a disappointment&#8221; <img src='http://philippeadjiman.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</li>
<li>I also recommend you to have a look at the haiti 360 view covered by cnn.</li>
<li>For <a href="http://twitpic.com/" target="_blank">twitpic</a>, it is only pics, so what you&#8217;ll find there is a sample of &#8220;trendy pics&#8221; (see below for more on that&#8230;)</li>
<li>Sometimes the hot query seems to be not connected with the related article at first view (like with <a style="color: #114477; text-decoration: underline;" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.actionnewsblast.com');" href="http://www.actionnewsblast.com/ex-edwards-aide-money-was-no-object" target="_blank">fred baron</a>). But when you take a closer look, there is always a connection! This is not for nothing that people tweet about a link with the text of the hot query in the tweet&#8230;</li>
</ul>
<p>To finish, find below a picasa collage that I built using the most cited twitpic pictures in twitter for this past week of hot trends (not only the 5 cited in the table). You&#8217;ll identify easily some sarcastic pictures before the iPad announcement or pics around the election of Miss USA. Click the picture to enlarge.</p>
<div id="attachment_984" class="wp-caption aligncenter" style="width: 310px"><a href="http://philippeadjiman.com/blog/wp-content/uploads/2010/02/picasaCollageTopPics.jpg" target="_blank"><img class="size-medium wp-image-984" title="picasaCollageTopPics" src="http://philippeadjiman.com/blog/wp-content/uploads/2010/02/picasaCollageTopPics-300x225.jpg" alt="picasaCollageTopPics" width="300" height="225" /></a><p class="wp-caption-text">Collage of the most cited twitpic links in twitter for a week of google hot trends (Click to enlarge) </p></div>
<p>If you&#8217;re curious to map some pictures with its related hot topic, click the collage to enlarge it and try to guess which pics correspond to which google hot query below <img src='http://philippeadjiman.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
<p><a href="http://twitpic.com/10mo6p" target="_blank">miss america 2010 winner</a>, <a href="http://twitpic.com/10wusc" target="_blank">what celeb do i look like</a>, <a href="http://twitpic.com/10llra" target="_blank">miss america 2010</a>, <a href="http://twitpic.com/10drx8" target="_blank">roberta flack</a>, <a href="http://twitpic.com/10seq4" target="_blank">lady gaga and elton john</a>, <a href="http://twitpic.com/108d4s" target="_blank">addicted to love</a>, <a href="http://twitpic.com/zuomi" target="_blank">jim florentine</a>, <a href="http://twitpic.com/zzq98" target="_blank">apple itablet</a>, <a href="http://twitpic.com/111q7d" target="_blank">lost season 6 premiere</a>, <a href="http://twitpic.com/10pd73" target="_blank">candy crowley</a>, <a href="http://twitpic.com/zrbar" target="_blank">to make you feel my love</a>, <a href="http://twitpic.com/101scz" target="_blank">swagger crew</a>, <a href="http://twitpic.com/10szff" target="_blank">footprints in the sand</a>, <a href="http://twitpic.com/10j2ao" target="_blank">gasparilla</a>, <a href="http://twitpic.com/10m77i" target="_blank">miss virginia</a>, <a href="http://twitpic.com/10ja2k" target="_blank">duke georgetown</a>, <a href="http://twitpic.com/107dml" target="_blank">celebrity look alike</a>, <a href="http://twitpic.com/10ly58" target="_blank">katherine putnam</a>, <a href="http://twitpic.com/zz5by" target="_blank">itablet</a>, <a href="http://twitpic.com/10swcm" target="_blank">andrea bocelli</a>, <a href="http://twitpic.com/wygnz" target="_blank">monster diesel</a>, <a href="http://twitpic.com/z2wv5" target="_blank">peta ad</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://philippeadjiman.com/blog/2010/02/06/what-are-the-10-most-cited-websites-on-twitter-when-tweeting-about-hot-trends/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Hot Trends Clustering: The 100 Hottest Queries Tell You About 67.76 Stories In Average</title>
		<link>http://philippeadjiman.com/blog/2009/09/27/google-hot-trends-clustering-the-100-hottest-queries-tell-you-about-67-76-stories-in-average/</link>
		<comments>http://philippeadjiman.com/blog/2009/09/27/google-hot-trends-clustering-the-100-hottest-queries-tell-you-about-67-76-stories-in-average/#comments</comments>
		<pubDate>Sun, 27 Sep 2009 09:05:37 +0000</pubDate>
		<dc:creator>padjiman</dc:creator>
				<category><![CDATA[experiments]]></category>
		<category><![CDATA[google trends]]></category>
		<category><![CDATA[algorithm]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://philippeadjiman.com/blog/?p=199</guid>
		<description><![CDATA[Did you noticed that among the 100 (hourly updated) Google Hot Trends, there are always several hot queries that are related one to the other?
Let&#8217;s take  a look at the Hot Trends of the current hour by the time I&#8217;m writing this post: Hot Trends of  September 24 at 11PM PST Time (clicking on the [...]]]></description>
			<content:encoded><![CDATA[<p>Did you noticed that among the 100 (hourly updated) <a href="http://www.google.com/trends/hottrends?sa=X" target="_blank">Google Hot Trends</a>, there are always several hot queries that are related one to the other?</p>
<p>Let&#8217;s take  a look at the Hot Trends of the current hour by the time I&#8217;m writing this post: <a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/2009-9-24.html" target="_blank">Hot Trends of  September 24 at 11PM PST Time</a> (clicking on the keywords won&#8217;t work, it is just a local copy of the file at that time). In few seconds, we can spot some similar queries, for instance Hot Trend #5 &#8220;sean salisbury&#8221; is clearly related to Hot Trend #45 &#8220;sean salisbury internet postings&#8221; and also to Hot Trend #57 &#8220;sean salisbury cell phone incident&#8221; (click the picture to enlarge).</p>
<p><a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/ScreenShot0561.jpg" target="_blank"><img class="aligncenter size-medium wp-image-292" title="SeanClust3" src="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/ScreenShot0561-300x158.jpg" alt="SeanClust3" width="300" height="158" /></a></p>
<p>Now, a small quizz: is there a link between Hot Trend #48 &#8220;julia grovenburg&#8221; and Hot Trend #8 &#8220;superfetation&#8221;, and what the hell is &#8220;superfetation&#8221;??.</p>
<p>So first, yes, there is a link between those two queries, and you can discover it if you click on &#8220;superfetation&#8221; which will give you its related searches:</p>
<p><a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/ScreenShot0461.jpg" target="_blank"><img class="aligncenter size-medium wp-image-206" title="superfetationDetails" src="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/ScreenShot0461-300x75.jpg" alt="superfetationDetails" width="300" height="75" /></a></p>
<p>So if you had time to loose, you would be able to click on the 100 queries and use this method to eventually build this cluster of 8 queries:</p>
<p><a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/ScreenShot057.jpg"><img class="aligncenter size-medium wp-image-306" title="superfetationClust8" src="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/ScreenShot057-300x160.jpg" alt="superfetationClust8" width="300" height="160" /></a></p>
<ul>
<li>The words in the cluster can give more insights of what this story is all about: Julia Grovenburg was pregnant and was pregnant again (apparently during the same pregnancy) which is a phenomenon called superfetation. You can verify it on a news article of the same day:</li>
</ul>
<p style="text-align: center;"><a href="http://www.nydailynews.com/lifestyle/health/2009/09/24/2009-09-24_woman_is_pregnant_with_two_babies_not_twins_rare_case_of_superfetation_say_docs.html" target="_blank"><img class="alignnone size-full wp-image-227" title="newsPregnancy" src="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/superfetation.jpg" alt="newsPregnancy" width="351" height="386" /></a></p>
<ul>
<li>Looking at the cluster, you can also think that the baby after birth was a &#8220;19 pound baby&#8221; but actually this a completely different breaking news, not linked at all with the previous one. This misleading link shows that <strong>related searches</strong> is a great feature but not an exact science and sometimes (not often however) some errors can arise in related searches:</li>
</ul>
<p style="text-align: center;"><a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/ScreenShot050.jpg" target="_blank"><img class="size-full wp-image-233 aligncenter" title="wrongRelatedSearches" src="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/ScreenShot050.jpg" alt="wrongRelatedSearches" width="476" height="87" /></a></p>
<p>I have some intuitions about how those related searches are detected and how those errors happens. It&#8217;s beyond the scope of this post but if you are interested about it, shoot me an email.</p>
<p>So I implemented a link-based clustering algorithm that knows how to plug to google hot trends data ant that build all that stuff automatically. Two queries are in the same cluster if one of the 3 following conditions is true:</p>
<ul>
<li> the queries themselves are similar</li>
<li>one of the query is similar to one of the related searches of the other</li>
<li>one of the query related searches is similar to one of the related searches of the other</li>
</ul>
<p>I used a similarity measure that works well for short text like queries, along with a black list of words to not disturb the similarity with words like &#8220;the&#8221; or &#8220;a&#8221;, etc&#8230; . I also empirically determined different thresholds for the three different cases described above. If you have more questions about that stuff, feel free to shoot a comment or to contact me.</p>
<p><strong>So How Many Clusters Can I Build Out Of The 100 Google Hot Trends Queries?</strong></p>
<p>You got it from this post title: 67.76 clusters in average (based on crawled data that represents few months of hot trends). Each cluster is supposed to represent a same &#8220;story&#8221; or breaking news. Note that this number is also dependent of my thresholds and that other algorithms and/or thresholds (more or less strict) can obtain slightly different numbers.</p>
<p>Of course, some errors can also arise, either because of some misleading related searches (like showed above) or because is some cases two queries look very similar but in reality they are speaking about two different things.</p>
<p>As an example of output, see the <a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/clusters.txt" target="_blank">file generated for the 100 keywords studied in this post</a>.</p>
<p><strong>What It Is Useful For?</strong></p>
<p>First of all it is fun <img src='http://philippeadjiman.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . Second, in information retrieval, order is always better than the opposite. But much more than that: if you are a breaking news website or blog, you&#8217;d better use in your article all the keywords of the same cluster since they represent the hottest searched queries of that particular story represented in its cluster! From an <strong>SEO </strong>point of view, I think the interest is pretty clear.</p>
<p><strong>BONUS</strong></p>
<p>If you read the post up to here, I&#8217;d like to offer you a small bonus <img src='http://philippeadjiman.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . It is the <strong>HUGEST </strong>cluster that I was able to observe running my program on the last few years of google hot trends data. I think you already guessed to which breaking news it is related.  <a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/sortedHugestClustersComments.txt" target="_blank"><strong>C</strong><strong>heck it out!</strong></a></p>
<p><strong>Update</strong>: Coincidence, the day after I wrote this post the hot trends list <a href="http://googleblog.blogspot.com/2009/09/keep-up-with-latest-trends-using-google.html" target="_blank">was reduced from 100 to 40</a>, so the screenshots and data above are in souvenir of the older version <img src='http://philippeadjiman.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
]]></content:encoded>
			<wfw:commentRss>http://philippeadjiman.com/blog/2009/09/27/google-hot-trends-clustering-the-100-hottest-queries-tell-you-about-67-76-stories-in-average/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Can You Guess What Is The Hottest Trend Of Google Hot Trends ?</title>
		<link>http://philippeadjiman.com/blog/2009/09/01/can-you-guess-what-is-the-hottest-trend-of-google-hot-trends/</link>
		<comments>http://philippeadjiman.com/blog/2009/09/01/can-you-guess-what-is-the-hottest-trend-of-google-hot-trends/#comments</comments>
		<pubDate>Tue, 01 Sep 2009 21:19:50 +0000</pubDate>
		<dc:creator>padjiman</dc:creator>
				<category><![CDATA[experiments]]></category>
		<category><![CDATA[google trends]]></category>

		<guid isPermaLink="false">http://philippeadjiman.com/blog/?p=10</guid>
		<description><![CDATA[Either if you are working in SEO, or if you are a  &#8220;trends hacker&#8221;, or if you love like me doing useless comparisons like hanukkah vs passover, you obviously know the fantastic google trends tool.
I&#8217;m even more fascinated by the google hot trends functionality that shows the 100 hottest English queries typed in the world [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.google.com/trends"><img class="alignnone size-full wp-image-17" title="GoogleTrends" src="http://philippeadjiman.com/blog/wp-content/uploads/2009/08/screenshot019.jpg" alt="screenshot019" width="299" height="139" align="left" /></a>Either if you are working in SEO, or if you are a  &#8220;<a href="http://mashable.com/2008/10/03/google-trends-malicious-hackers/" target="_blank">trends hacker&#8221;</a>, or if you love like me doing useless comparisons like <a href="http://www.google.com/trends?q=hannukah%2Cpassover&amp;ctab=0&amp;geo=all&amp;date=all&amp;sort=0" target="_blank">hanukkah vs passover</a>, you obviously know the fantastic google trends tool.</p>
<p>I&#8217;m even more fascinated by the google <a href="http://www.google.com/trends/hottrends?sa=X" target="_blank">hot trends</a> functionality that shows the 100 hottest English queries typed in the world right now (actually the 100 fastest-rising ones in the current hour, else you would always see generic terms like &#8216;weather&#8217;).</p>
<p>I asked myself a simple question: is there some queries that always appearing over and over in this top 100 list? Can we discover patterns of queries? To answer it, I write for fun a simple crawler to crawl the daily list since the service exists (May 15, 2007) and I generated a list of the hottest phrases (meaning the hottest n-grams of words, not queries).</p>
<p><strong>Can you guess if there is a clear winner?</strong></p>
<p>Actually there is one. The phrase &#8220;lyrics&#8221;.  As of today (August 31 2009), it always appears to be the most frequent hottest keyword in different settings:</p>
<ul>
<li>759 occurrences if you consider the whole daily top 100 list. Think about it: since May 15, 2007,  it&#8217;s been 809 days (thanks <a href="http://www.jeffpalm.com/dayssince/" target="_blank">Jeffrey</a>). Even if it appears sometimes several times in a single day, it means that <strong>almost everyday</strong>, the word lyrics appears in the 100 hottest English queries in the world!!!</li>
<li>207 occurrences if you consider only the daily top 10 list.</li>
<li>124 occurrences if you consider only the daily top 5 list.</li>
<li>34 occurrences if you consider only <span style="text-decoration: underline;">the</span> daily hottest keyword.</li>
</ul>
<p>But again, &#8216;lyrics&#8217; is always the top ranked phrase of all the lists  I generated. Seems however like a <a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/screenshot001.jpg" target="_blank">decreasing trend</a>.</p>
<p>What about other phrases?  Here are <a href="http://philippeadjiman.com/blog/wp-content/uploads/2009/09/topexamples.txt" target="_blank">few other examples</a> of the top phrases appearing over and over in all day top world queries. Note that you don&#8217;t necessarily want to  build a business around one of those hot topics since all of them are in general already overcrowded niches.</p>
<p>What about patterns? If you perform some entity extraction  you can observe some recurring patterns  like <em>&#8216;XXX <strong>death</strong>&#8216;</em> or <em>&#8216;XXX <strong>divorce</strong>&#8216;</em> where <em>XXX </em>is the name of a celebrity. I also noticed that users are much more interested in celebrities divorces than marriages <img src='http://philippeadjiman.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
<p>In summary, Google hot trends is fun. In the new <a href="http://blogs.alianzo.com/socialnetworks/2009/01/21/the-real-time-web-the-new-buzz-word-for-2009/" target="_blank">real time web buzz</a>, this service is not really meant to be a competitor, but it is still my favorite way of feeling the pulse of the web.</p>
]]></content:encoded>
			<wfw:commentRss>http://philippeadjiman.com/blog/2009/09/01/can-you-guess-what-is-the-hottest-trend-of-google-hot-trends/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>
