<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Internal links, and search engine crawlers</title>
	<atom:link href="http://www.mysociety.org/2008/07/17/internal-links-and-search-engine-crawlers/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mysociety.org/2008/07/17/internal-links-and-search-engine-crawlers/</link>
	<description>Relentless user-focus on civic websites</description>
	<lastBuildDate>Fri, 10 Feb 2012 01:00:32 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
	<item>
		<title>By: Matthew</title>
		<link>http://www.mysociety.org/2008/07/17/internal-links-and-search-engine-crawlers/comment-page-1/#comment-10754</link>
		<dc:creator>Matthew</dc:creator>
		<pubDate>Mon, 08 Sep 2008 13:13:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysociety.org/2008/07/17/internal-links-and-search-engine-crawlers/#comment-10754</guid>
		<description>The problem is there is no quote - they simply say &quot;I refer the hon. Gentleman to the previous answer I gave [Official Report, 29 February 2008, column 1425]&quot; - so there&#039;s no way to know which speech/answer within that column they mean automatically (it could try and match e.g. speaker name, even subject, but that&#039;s hard graft for not much gain).</description>
		<content:encoded><![CDATA[<p>The problem is there is no quote &#8211; they simply say &#8220;I refer the hon. Gentleman to the previous answer I gave [Official Report, 29 February 2008, column 1425]&#8221; &#8211; so there&#8217;s no way to know which speech/answer within that column they mean automatically (it could try and match e.g. speaker name, even subject, but that&#8217;s hard graft for not much gain).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Longair</title>
		<link>http://www.mysociety.org/2008/07/17/internal-links-and-search-engine-crawlers/comment-page-1/#comment-10286</link>
		<dc:creator>Mark Longair</dc:creator>
		<pubDate>Sun, 07 Sep 2008 15:47:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysociety.org/2008/07/17/internal-links-and-search-engine-crawlers/#comment-10286</guid>
		<description>For what it&#039;s worth, quotations in the Scottish Parliament text on TheyWorkForYou should mostly link to the quoted text by using a different mechanism - the parser tries to match substrings of the quotation in the speeches from the particular day that&#039;s referenced.  This isn&#039;t generally useful in the same way, of course, but I thought it worked surprisingly well, and is perhaps worth mentioning in this context.

&lt;i&gt;&quot;Perhaps in future we’ll be able to add some crowd-sourcing game to match the reference to the exact speech&quot;&lt;/i&gt;

I don&#039;t think you need to do this (in most cases, anyway) based on the experience of automatic matching of quotations in the SP parser.</description>
		<content:encoded><![CDATA[<p>For what it&#8217;s worth, quotations in the Scottish Parliament text on TheyWorkForYou should mostly link to the quoted text by using a different mechanism &#8211; the parser tries to match substrings of the quotation in the speeches from the particular day that&#8217;s referenced.  This isn&#8217;t generally useful in the same way, of course, but I thought it worked surprisingly well, and is perhaps worth mentioning in this context.</p>
<p><i>&#8220;Perhaps in future we’ll be able to add some crowd-sourcing game to match the reference to the exact speech&#8221;</i></p>
<p>I don&#8217;t think you need to do this (in most cases, anyway) based on the experience of automatic matching of quotations in the SP parser.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Francis Irving</title>
		<link>http://www.mysociety.org/2008/07/17/internal-links-and-search-engine-crawlers/comment-page-1/#comment-1988</link>
		<dc:creator>Francis Irving</dc:creator>
		<pubDate>Thu, 17 Jul 2008 18:55:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysociety.org/2008/07/17/internal-links-and-search-engine-crawlers/#comment-1988</guid>
		<description>Seems Yahoo parses the robots file in an interesting way. Firstly, our file was wrong for not including the Disallow lines for Yahoo as well - explaining why Yahoo was browsing the /user. But not why it was going quickly still.

Matthew&#039;s patch to our robots.txt:
https://secure.mysociety.org/cvstrac/chngview?cn=12284</description>
		<content:encoded><![CDATA[<p>Seems Yahoo parses the robots file in an interesting way. Firstly, our file was wrong for not including the Disallow lines for Yahoo as well &#8211; explaining why Yahoo was browsing the /user. But not why it was going quickly still.</p>
<p>Matthew&#8217;s patch to our robots.txt:<br />
<a href="https://secure.mysociety.org/cvstrac/chngview?cn=12284" rel="nofollow">https://secure.mysociety.org/cvstrac/chngview?cn=12284</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

