<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

	<title>Comments on: Bots, spiders and crawlers, oh my!</title>
	<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my/</link>
	<description>Comments on MetaFilter post Bots, spiders and crawlers, oh my!</description>
	<pubDate>Thu, 16 Mar 2006 07:14:33 -0800</pubDate>
	<lastBuildDate>Thu, 16 Mar 2006 07:14:33 -0800</lastBuildDate>
	<language>en-us</language>
	<docs>http://blogs.law.harvard.edu/tech/rss</docs>
	<ttl>60</ttl>

	<item>
		<title>Bots, spiders and crawlers, oh my!</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my</link>	
		<description>&quot;Imagine this: a &lt;a href=&quot;http://www.crt.net.au/etopics/webbots.htm&quot;&gt;digital butler&lt;/a&gt; that roams the Internet, intuitively knowing your likes and dislikes, retrieving perfect strands of news and information that you never would have discovered through old-fashioned surfing.&quot; There&apos;s &lt;a href=&quot;http://www.geek.com/news/geeknews/2001jan/gee20010201004094.htm&quot;&gt;RumorBot&lt;/a&gt; and the &lt;a href=&quot;http://prime.jsc.nasa.gov/iliad/&quot;&gt;Iliad fetchbot (perfect for bot newbies)&lt;/a&gt;, or you can try your hand at writing your own in php using &lt;a href=&quot;http://thiefsystems.org/ccs/phpregexspider&quot;&gt;this&lt;/a&gt; as a tutorial, or if you prefer, &lt;a href=&quot;http://www.codeguru.com/Cpp/I-N/internet/generalinternet/article.php/c3413&quot;&gt;C++&lt;/a&gt; or &lt;a href=&quot;http://search.cpan.org/dist/WWW-Robot/lib/WWW/Robot.pm#VERSION&quot;&gt;Perl&lt;/a&gt;.</description>
		<guid isPermaLink="false">post:www.metafilter.com,2006:site.50113</guid>
		<pubDate>Thu, 16 Mar 2006 07:08:08 -0800</pubDate>
		<dc:creator>sluglicker</dc:creator>		<category>bots</category>		<category>digitalbutler</category>		<category>rumorbot</category>		<category>iliadfetchbot</category>		<category>fetchbot</category>
	</item>	<item>
		<title>By: Yer-Ol-Pal</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247185</link>	
		<description>Ithought this was the purpose of Metafilter?</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247185</guid>
		<pubDate>Thu, 16 Mar 2006 07:14:33 -0800</pubDate>
		<dc:creator>Yer-Ol-Pal</dc:creator>
	</item>	<item>
		<title>By: Faint of Butt</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247193</link>	
		<description>Y-O-P beat me to it. All of you guys are my content-aggregation bots.</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247193</guid>
		<pubDate>Thu, 16 Mar 2006 07:17:26 -0800</pubDate>
		<dc:creator>Faint of Butt</dc:creator>
	</item>	<item>
		<title>By: sluglicker</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247199</link>	
		<description>@Yer-Ol-Pal 
Sure, if you only want to leech. If you want to contribute, try a bot!

Correction: the Perl link should be &lt;a href=&quot;http://search.cpan.org/dist/WWW-Robot/lib/WWW/Robot.pm#&quot;&gt;Perl&lt;/a&gt;.</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247199</guid>
		<pubDate>Thu, 16 Mar 2006 07:23:22 -0800</pubDate>
		<dc:creator>sluglicker</dc:creator>
	</item>	<item>
		<title>By: Brian James</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247223</link>	
		<description>Haven&apos;t personalized content-aggregation tools such as this one been the Next Big Thing for the last, like, 15 years?</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247223</guid>
		<pubDate>Thu, 16 Mar 2006 07:44:15 -0800</pubDate>
		<dc:creator>Brian James</dc:creator>
	</item>	<item>
		<title>By: mekanic</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247226</link>	
		<description>Speaking as a php, c++, etc. illiterate, it seems that &lt;a href=&quot;http://www.php-for-beginners.co.uk/article/item/61/Basic_PHP_Page_Crawler/&quot;&gt;this page&lt;/a&gt; might be a little more helpful as a tutorial for building your own bot. It has lots of other php beginner tutorials as well.</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247226</guid>
		<pubDate>Thu, 16 Mar 2006 07:46:59 -0800</pubDate>
		<dc:creator>mekanic</dc:creator>
	</item>	<item>
		<title>By: mikepop</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247236</link>	
		<description>There is also &lt;a href=&quot;http://www.crummy.com/software/UltraGleeper/&quot;&gt;The Ultra Gleeper&lt;/a&gt;</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247236</guid>
		<pubDate>Thu, 16 Mar 2006 07:55:06 -0800</pubDate>
		<dc:creator>mikepop</dc:creator>
	</item>	<item>
		<title>By: killdevil</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247269</link>	
		<description>I begin to imagine a digital butler, thinking back to 1997 or so when such concepts were aired regularly, but then I start getting angry.</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247269</guid>
		<pubDate>Thu, 16 Mar 2006 08:21:32 -0800</pubDate>
		<dc:creator>killdevil</dc:creator>
	</item>	<item>
		<title>By: dammitjim</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247276</link>	
		<description>The main link, the one that the quote is taken from, reads as though it was written in 1996, in a breathless issue of Wired (&quot;Push! The next big thing!&quot;). There&apos;s a reason that nothing has really happened with this idea: nobody knows quite how to do it yet. At least, nobody knows what to build that will be:&lt;ul&gt;&lt;li&gt;broadly useful&lt;li&gt;simple to instruct and direct&lt;li&gt;more than just a script that periodically performs searches that the user tells it to&lt;/li&gt;&lt;/li&gt;&lt;/li&gt;&lt;/ul&gt;How do you make this &quot;butler&quot; universally available to the user? From where does it execute its actions? On the user&apos;s home computer? That&apos;s not very useful, not unless the user is home all the time. On the user&apos;s phone service, and talking to them through their mobile? This idea is not ready for prime time yet. Somebody will make a lot of money if it does come true, though.</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247276</guid>
		<pubDate>Thu, 16 Mar 2006 08:26:48 -0800</pubDate>
		<dc:creator>dammitjim</dc:creator>
	</item>	<item>
		<title>By: ernie</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247299</link>	
		<description>Just ask &lt;a href=&quot;www.ask.com&quot;&gt;Jeeves&lt;/a&gt;</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247299</guid>
		<pubDate>Thu, 16 Mar 2006 08:45:01 -0800</pubDate>
		<dc:creator>ernie</dc:creator>
	</item>	<item>
		<title>By: Ynoxas</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247401</link>	
		<description>Didn&apos;t I have something just like this back in like 1997 or 1998?  And didn&apos;t it completely NOT work?  And wasn&apos;t it pretty well decided that web viewing is an ACTIVE activity, not a passive one?

Am I the only one that thinks push is unnecessary?  

I just fail to see much difference between having a story from CNN &quot;pushed&quot; to me so I click the link to view it, or viewing the link directly because I navigated to CNN.com.

Agents have been highly overrated for a very long time.</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247401</guid>
		<pubDate>Thu, 16 Mar 2006 09:46:05 -0800</pubDate>
		<dc:creator>Ynoxas</dc:creator>
	</item>	<item>
		<title>By: Dr. Twist</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247675</link>	
		<description>does anyone else remember &lt;a href=&quot;http://en.wikipedia.org/wiki/Knowledge_navigator&quot;&gt;knowledge navigator&lt;/a&gt;?</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247675</guid>
		<pubDate>Thu, 16 Mar 2006 12:05:55 -0800</pubDate>
		<dc:creator>Dr. Twist</dc:creator>
	</item>	<item>
		<title>By: Artw</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247830</link>	
		<description>Imagine a digital paperclip, appearing as you compose a document to give you help and advice...</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247830</guid>
		<pubDate>Thu, 16 Mar 2006 13:33:38 -0800</pubDate>
		<dc:creator>Artw</dc:creator>
	</item>	<item>
		<title>By: reklaw</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1247931</link>	
		<description>ernie: &lt;a href=&quot;http://sp.askforkids.com/en/docs/askforkids/help/where_is_jeeves.htm&quot;&gt;Jeeves is gone.&lt;/a&gt;</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1247931</guid>
		<pubDate>Thu, 16 Mar 2006 14:27:18 -0800</pubDate>
		<dc:creator>reklaw</dc:creator>
	</item>	<item>
		<title>By: Pinback</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1248364</link>	
		<description>&lt;em&gt;Haven&apos;t personalized content-aggregation tools such as this one been the Next Big Thing for the last, like, 15 years?&lt;/em&gt;

Yes. And, if it ever comes to pass, it&apos;ll be like all those people who listen religiously to &lt;a href=&quot;http://en.wikipedia.org/wiki/Alan_Jones_%28radio%29&quot;&gt;Alan Jones&lt;/a&gt;, &lt;a href=&quot;http://en.wikipedia.org/wiki/John_Laws&quot;&gt;John Laws&lt;/a&gt;, or &lt;a href=&quot;http://en.wikipedia.org/wiki/Bill_O%27Reilly_%28commentator%29&quot;&gt;Bill O&apos;Reilly&lt;/a&gt; -  a self-selected audience who only see what they want to see, who never have their beliefs challenged or opinions confronted (except to foment outrage), and who believe everything they&apos;re told.

Imagine a million individual LGFs - left wing, right wing, and every shade in between - consisting of one person plus their &quot;intelligent agent&quot; / &quot;digital butler&quot;. Just as the porn industry can provide for your every fantasy no matter how twisted, so will the content industry come to supply information/opinion tailor-made for every prejudice and belief.

The only winners will be those who provide this content. And those who choose not to play.</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1248364</guid>
		<pubDate>Fri, 17 Mar 2006 00:17:30 -0800</pubDate>
		<dc:creator>Pinback</dc:creator>
	</item>	<item>
		<title>By: sluglicker</title>
		<link>http://www.metafilter.com/50113/Bots-spiders-and-crawlers-oh-my#1248603</link>	
		<description>With the exception of dammitjim, did any of you actually read the linked texts? Your comments have nothing to do with the content of this post. And Pinback: WTF are you talking about? Bah!...Perl before swine.</description>
		<guid isPermaLink="false">comment:www.metafilter.com,2006:site.50113-1248603</guid>
		<pubDate>Fri, 17 Mar 2006 08:48:30 -0800</pubDate>
		<dc:creator>sluglicker</dc:creator>
	</item>
	</channel>
</rss>
