<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Help Needed for Web Scraping</title>
	<atom:link href="http://itredux.com/2007/02/20/help-needed-for-web-scraping/feed/" rel="self" type="application/rss+xml" />
	<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/</link>
	<description>New Rules for a New IT World</description>
	<lastBuildDate>Wed, 25 Aug 2010 01:35:21 -0400</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Ismael Ghalimi</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-58951</link>
		<dc:creator>Ismael Ghalimi</dc:creator>
		<pubDate>Wed, 21 Mar 2007 21:09:26 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-58951</guid>
		<description>Mike,

I will give it a shot!

-Ismael</description>
		<content:encoded><![CDATA[<p>Mike,</p>
<p>I will give it a&nbsp;shot!</p>
<p>-Ismael</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike Parsons</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-58931</link>
		<dc:creator>Mike Parsons</dc:creator>
		<pubDate>Wed, 21 Mar 2007 20:14:15 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-58931</guid>
		<description>Hi Ismael,

As a result of reading this blog entry and trying out some of the services mentioned in the comments, I decided to go ahead and create my own &quot;simple&quot; solution... You can read about it &lt;a href=&quot;http://geekswithblogs.net/mparsons/archive/2007/03/21/109421.aspx&quot;&gt;here&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>Hi&nbsp;Ismael,</p>
<p>As a result of reading this blog entry and trying out some of the services mentioned in the comments, I decided to go ahead and create my own &#8220;simple&#8221; solution&#8230; You can read about it&nbsp;<a href="http://geekswithblogs.net/mparsons/archive/2007/03/21/109421.aspx">here</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jack Pipe</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-53885</link>
		<dc:creator>Jack Pipe</dc:creator>
		<pubDate>Wed, 07 Mar 2007 19:45:35 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-53885</guid>
		<description>Need web scraping? You name the website, &lt;a href=&quot;http://www.existonline.com/webscraping.php&quot;&gt;we&lt;/a&gt; scrape the data for you.</description>
		<content:encoded><![CDATA[<p>Need web scraping? You name the website, <a href="http://www.existonline.com/webscraping.php">we</a> scrape the data for&nbsp;you.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ismael Ghalimi</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-51819</link>
		<dc:creator>Ismael Ghalimi</dc:creator>
		<pubDate>Wed, 28 Feb 2007 23:44:28 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-51819</guid>
		<description>Mike,

Thanks for sharing. I&#039;ll take a look.

Best regards
-Ismael</description>
		<content:encoded><![CDATA[<p>Mike,</p>
<p>Thanks for sharing. I&#8217;ll take a&nbsp;look.</p>
<p>Best regards<br />&nbsp;-Ismael</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike Parsons</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-51297</link>
		<dc:creator>Mike Parsons</dc:creator>
		<pubDate>Tue, 27 Feb 2007 14:37:17 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-51297</guid>
		<description>&lt;a href=&quot;http://www.clipmarks.com/&quot;&gt;Clipmarks&lt;/a&gt; looks interesting as well.</description>
		<content:encoded><![CDATA[<p><a href="http://www.clipmarks.com/">Clipmarks</a> looks interesting as&nbsp;well.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ismael Ghalimi</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49828</link>
		<dc:creator>Ismael Ghalimi</dc:creator>
		<pubDate>Thu, 22 Feb 2007 23:32:29 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49828</guid>
		<description>Stefan,

Thanks for the clarification. I&#039;ll give it a try then.

Any plans to develop a version that would not require any software at all?

Best regards
-Ismael</description>
		<content:encoded><![CDATA[<p>Stefan,</p>
<p>Thanks for the clarification. I&#8217;ll give it a try&nbsp;then.</p>
<p>Any plans to develop a version that would not require any software at&nbsp;all?</p>
<p>Best regards<br />&nbsp;-Ismael</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stefan Andreasen</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49819</link>
		<dc:creator>Stefan Andreasen</dc:creator>
		<pubDate>Thu, 22 Feb 2007 23:16:49 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49819</guid>
		<description>OpenKapow only requires software to build the API&#039;s. When built, everyone on the web can access them from the OpenKapow service, without any plugin or software. Thus, only you building it need to download the software, nobody else needs it. What you download from OpenKapow is basically a power web browser, with built-in visual scripting that allows you to build REST, JSON, RSS, and ATOM services, and deploy them on OpenKapow. Doing what you need should only take 15 minutes, after you ran over the OpenKapow tutorial and downloaded the software.

-Stefan</description>
		<content:encoded><![CDATA[<p>OpenKapow only requires software to build the <span class="caps">API</span>&#8217;s. When built, everyone on the web can access them from the OpenKapow service, without any plugin or software. Thus, only you building it need to download the software, nobody else needs it. What you download from OpenKapow is basically a power web browser, with built-in visual scripting that allows you to build <span class="caps">REST</span>, <span class="caps">JSON</span>, <span class="caps">RSS</span>, and <span class="caps">ATOM</span> services, and deploy them on OpenKapow. Doing what you need should only take 15 minutes, after you ran over the OpenKapow tutorial and downloaded the&nbsp;software.</p>
<p>-Stefan</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ismael Ghalimi</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49809</link>
		<dc:creator>Ismael Ghalimi</dc:creator>
		<pubDate>Thu, 22 Feb 2007 22:22:07 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49809</guid>
		<description>Cory,

That&#039;s a great idea! I&#039;ll give it a shot.

Best regards
-Ismael</description>
		<content:encoded><![CDATA[<p>Cory,</p>
<p>That&#8217;s a great idea! I&#8217;ll give it a&nbsp;shot.</p>
<p>Best regards<br />&nbsp;-Ismael</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ismael Ghalimi</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49807</link>
		<dc:creator>Ismael Ghalimi</dc:creator>
		<pubDate>Thu, 22 Feb 2007 22:21:37 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49807</guid>
		<description>Mike,

OpenKapow and screen-scraper are software based. Not an option.

I will give Ponyfish a shot though.

Best regards
-Ismael</description>
		<content:encoded><![CDATA[<p>Mike,</p>
<p>OpenKapow and screen-scraper are software based. Not an&nbsp;option.</p>
<p>I will give Ponyfish a shot&nbsp;though.</p>
<p>Best regards<br />&nbsp;-Ismael</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ismael Ghalimi</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49806</link>
		<dc:creator>Ismael Ghalimi</dc:creator>
		<pubDate>Thu, 22 Feb 2007 22:20:47 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49806</guid>
		<description>Assaf,

Thanks for the tip. Looks good indeed.

Best regards
-Ismael</description>
		<content:encoded><![CDATA[<p>Assaf,</p>
<p>Thanks for the tip. Looks good&nbsp;indeed.</p>
<p>Best regards<br />&nbsp;-Ismael</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ismael Ghalimi</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49804</link>
		<dc:creator>Ismael Ghalimi</dc:creator>
		<pubDate>Thu, 22 Feb 2007 22:20:19 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49804</guid>
		<description>Nanek,

I could use Alexa&#039;s web service indeed, but I need one for Google too.

Dapper looks pretty cool indeed. I&#039;ll give it a shot.

OpenKapow seems to require the installation of software, so it&#039;s not an option.

Best regards
-Ismael</description>
		<content:encoded><![CDATA[<p>Nanek,</p>
<p>I could use Alexa&#8217;s web service indeed, but I need one for Google&nbsp;too.</p>
<p>Dapper looks pretty cool indeed. I&#8217;ll give it a&nbsp;shot.</p>
<p>OpenKapow seems to require the installation of software, so it&#8217;s not an&nbsp;option.</p>
<p>Best regards<br />&nbsp;-Ismael</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cory</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49655</link>
		<dc:creator>Cory</dc:creator>
		<pubDate>Thu, 22 Feb 2007 14:07:50 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49655</guid>
		<description>Check out the &#039;Content Analysis&#039; functionality in Yahoo! Pipes. This &lt;a href=&quot;http://pipes.yahoo.com/pipes/vvW1cD212xGMiR9aqu5lkA/edit&quot;&gt;pipe&lt;/a&gt; that mashes up the New York Times front page with Flickr provides a good example.</description>
		<content:encoded><![CDATA[<p>Check out the &#8216;Content Analysis&#8217; functionality in Yahoo! Pipes. This <a href="http://pipes.yahoo.com/pipes/vvW1cD212xGMiR9aqu5lkA/edit">pipe</a> that mashes up the New York Times front page with Flickr provides a good&nbsp;example.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike Parsons</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49613</link>
		<dc:creator>Mike Parsons</dc:creator>
		<pubDate>Thu, 22 Feb 2007 11:25:05 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49613</guid>
		<description>Take a peek at one of these services: &lt;a href=&quot;http://www.openkapow.com/&quot;&gt;OpenKapow&lt;/a&gt;, &lt;a href=&quot;http://www.ponyfish.com/&quot;&gt;Ponyfish&lt;/a&gt;, &lt;a href=&quot;http://www.screen-scraper.com/&quot;&gt;screen-scraper&lt;/a&gt;.
</description>
		<content:encoded><![CDATA[<p>Take a peek at one of these services: <a href="http://www.openkapow.com/">OpenKapow</a>, <a href="http://www.ponyfish.com/">Ponyfish</a>,&nbsp;<a href="http://www.screen-scraper.com/">screen-scraper</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan Armasu</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49544</link>
		<dc:creator>Ryan Armasu</dc:creator>
		<pubDate>Thu, 22 Feb 2007 07:04:35 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49544</guid>
		<description>Take a look at &lt;a href=&quot;http://www.scrubyt.org/&quot;&gt;scRUBYt!&lt;/a&gt; This is not necessarily the tool you may be looking for, but I found it pretty interesting nonetheless.

Congratulation on the new addition to the family.

-Ryan</description>
		<content:encoded><![CDATA[<p>Take a look at <a href="http://www.scrubyt.org/">scRUBYt!</a> This is not necessarily the tool you may be looking for, but I found it pretty interesting&nbsp;nonetheless.</p>
<p>Congratulation on the new addition to the&nbsp;family.</p>
<p>-Ryan</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Assaf</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49539</link>
		<dc:creator>Assaf</dc:creator>
		<pubDate>Thu, 22 Feb 2007 06:18:37 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49539</guid>
		<description>I&#039;m guessing you want something like &lt;a href=&quot;http://www.dappit.com/&quot;&gt;Dapper&lt;/a&gt;, although it seems to be down now.</description>
		<content:encoded><![CDATA[<p>I&#8217;m guessing you want something like <a href="http://www.dappit.com/">Dapper</a>, although it seems to be down&nbsp;now.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nanek</title>
		<link>http://itredux.com/2007/02/20/help-needed-for-web-scraping/comment-page-1/#comment-49502</link>
		<dc:creator>Nanek</dc:creator>
		<pubDate>Thu, 22 Feb 2007 03:31:12 +0000</pubDate>
		<guid isPermaLink="false">http://itredux.com/blog/2007/02/20/help-needed-for-web-scraping/#comment-49502</guid>
		<description>Well, for the Alexa information, most of it is available through the &lt;a href=&quot;http://docs.amazonwebservices.com/AlexaWebInfoService/2005-07-11/&quot;&gt;Alexa Web Information Service&lt;/a&gt; provided by Amazon. You would have to write a simple script that would pull this, and then output as JSON or RSS.

Alternatively, and more generically, you could use a service like &lt;a href=&quot;http://www.dappit.com/&quot;&gt;Dapper&lt;/a&gt; that will walk you through a wizard and scrape information off of pages, then output it in many different formats. Its definitely worth a try, but I didn&#039;t have any luck with the Alexa pages.

Another tool that is new is &lt;a href=&quot;http://www.openkapow.com&quot;&gt;OpenKapow&lt;/a&gt;. I have not tried this one, but it looks promising based on the sample content out there.</description>
		<content:encoded><![CDATA[<p>Well, for the Alexa information, most of it is available through the <a href="http://docs.amazonwebservices.com/AlexaWebInfoService/2005-07-11/">Alexa Web Information Service</a> provided by Amazon. You would have to write a simple script that would pull this, and then output as <span class="caps">JSON</span> or&nbsp;<span class="caps">RSS</span>.</p>
<p>Alternatively, and more generically, you could use a service like <a href="http://www.dappit.com/">Dapper</a> that will walk you through a wizard and scrape information off of pages, then output it in many different formats. Its definitely worth a try, but I didn&#8217;t have any luck with the Alexa&nbsp;pages.</p>
<p>Another tool that is new is <a href="http://www.openkapow.com">OpenKapow</a>. I have not tried this one, but it looks promising based on the sample content out&nbsp;there.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
