<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>StanleyWong.org &#187; SEO</title>
	<atom:link href="http://stanleywong.org/category/seo/feed/" rel="self" type="application/rss+xml" />
	<link>http://stanleywong.org</link>
	<description>Musings from the Interactive Adman</description>
	<lastBuildDate>Fri, 11 Sep 2009 22:06:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Real Time Streams becoming more relevant</title>
		<link>http://stanleywong.org/2009/05/18/23/</link>
		<comments>http://stanleywong.org/2009/05/18/23/#comments</comments>
		<pubDate>Mon, 18 May 2009 00:24:46 +0000</pubDate>
		<dc:creator>SWong</dc:creator>
				<category><![CDATA[Interactive Advertising]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Startups]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://stanleywong.org/?p=23</guid>
		<description><![CDATA[Great article on TechCrunch today about &#8220;Jump Into the Stream&#8221; and how we are in an interesting shift to real time streamed information.  The biggest challenge is to make sense of this deluge of information and to figure out what is meaningful and what is meaningless. With Google making some rumbling about real time search [...]]]></description>
			<content:encoded><![CDATA[<p>Great article on TechCrunch today about &#8220;<a title="TechCrunch Jump Into the Stream" href="http://www.techcrunch.com/2009/05/17/jump-into-the-stream/" target="_blank">Jump Into the Stream</a>&#8221; and how we are in an interesting shift to real time streamed information.  The biggest challenge is to make sense of this deluge of information and to figure out what is meaningful and what is meaningless.</p>
<p>With Google making some rumbling about real time search this is going to get really interesting.  Much like SEO is used to drive traffic to web 1.0 websites, social bookmarking and social discovery was used to drive traffic to websites in Web 2.0.  Web 3.0 may be where we see another source of traffic generation through different real time feeds and streams.  An indication of this is <a title="How Twitter &amp; Facebook Now Compete with Google" href="http://blogmaverick.com/2009/05/15/how-twitter-and-facebook-now-compete-with-google/" target="_blank">Mark Cuban&#8217;s recent blog post</a> about how he is getting more and more traffic from twitter and facebook than Google.  In his case it is definitely Streams (Facebook + Twitter) &gt; SEO (Google).</p>
<p>Very intent to see where this leads us.</p>
]]></content:encoded>
			<wfw:commentRss>http://stanleywong.org/2009/05/18/23/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Price of Internet Content SPAM</title>
		<link>http://stanleywong.org/2007/04/03/the-price-of-internet-content-spam/</link>
		<comments>http://stanleywong.org/2007/04/03/the-price-of-internet-content-spam/#comments</comments>
		<pubDate>Tue, 03 Apr 2007 06:25:16 +0000</pubDate>
		<dc:creator>SWong</dc:creator>
				<category><![CDATA[Interactive Advertising]]></category>
		<category><![CDATA[SEM]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://stanleywong.org/2007/04/03/the-price-of-internet-content-spam/</guid>
		<description><![CDATA[A recent research report from Microsoft (NY Times article) claimed that approximately 11% of search results and as much as 30% of some competitive search results contain Internet Content SPAM. Spammers create these pages for the sole purpose of profiting from advertisements by fooling the search engines (i.e. Google, Yahoo!,etc.) to rank their pages well. [...]]]></description>
			<content:encoded><![CDATA[<p>A <a target="_blank" title="Microsoft Research" href="http://research.microsoft.com/SearchRanger/">recent research report from Microsoft</a> (<a target="_blank" href="http://www.nytimes.com/2007/03/19/technology/19spam.html?ex=1331956800&#038;en=44a8402e53db4153&#038;ei=5090&#038;partner=rssuserland&#038;emc=rss">NY Times article</a>) claimed that approximately 11% of search results and as much as 30% of some competitive search results contain Internet Content SPAM.  Spammers create these pages for the sole purpose of profiting from advertisements by fooling the search engines  (i.e. Google, Yahoo!,etc.) to rank their pages well.  Typically these SPAM sites are computer generated to create huge numbers of pages each targeted to rank well with a specific keyword.</p>
<p>There are many strategies, but typically these SPAM sites are computer generated to produce a huge numbers of pages, each targeted to rank well with a specific keyword.  I&#8217;ve found several SPAM content generator scripts (<a target="_blank" title="YACH" href="http://getyacg.com/">YACG</a>, <a target="_blank" title="RSSGM" href="http://www.rssgm.com/">RSSGM</a>, <a target="_blank" title="WP Autoblog" href="http://elliottback.com/wp/archives/2006/06/06/wp-autoblog-a-syndication-plugin/">WP AutoBlog</a>, &#038; <a title="MyGen" target="_blank" href="http://www.boogybonbon.com/2006/08/17/mygen-v102-now-available/">MyGen</a>) on the web that take content snippets from a variety of articles cribbed from across the web.  These scripts auto generate content by jumbling content snippets into search engine readable pages using statistical algorithms such as Markov Chains.  Add a custom template to make the site unique and create an internal link network between pages within the site.</p>
<p>Once the site is built, Spammers create huge numbers of bogus blogs (Blogger) and webpages (from free hosting sites such as Yahoo!â€™s GeoCities) to build links directed towards the Spam site.  This confirmed by the Microsoft study where most free web hosting and free blog sites have over 70% Internet Content SPAM.</p>
<p>Internet Content SPAM is a huge problem for Search Engines because it degrades the quality their search results.  As Albert Einstein famously said &#8220;<span class="body">We can&#8217;t solve problems by using the same kind of thinking we used when we created them.&#8221;, it is tough for Search Engines to identify Internet Content SPAM.  Typical approaches entail usage of filters and/or blacklists when a SPAM signature or domain is discovered.  This makes Internet Content SPAM one of the most vexing issues for this generation of Search Engines.</span></p>
<blockquote><p><span class="body" /></p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://stanleywong.org/2007/04/03/the-price-of-internet-content-spam/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
