<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: We have an API</title>
	<atom:link href="http://nat.org/blog/2010/03/we-have-an-api/feed/" rel="self" type="application/rss+xml" />
	<link>http://nat.org/blog/2010/03/we-have-an-api/</link>
	<description></description>
	<lastBuildDate>Mon, 07 May 2012 20:05:57 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Nat Friedman</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6534</link>
		<dc:creator>Nat Friedman</dc:creator>
		<pubDate>Fri, 02 Apr 2010 23:04:56 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6534</guid>
		<description>Thanks for responding again. Time/resources - that&#039;s what I figured.

I felt like your web signup form was a good thing because it did ask for a fair amount of info about the app and why I am using your API. So I would expect that you have decent info there.</description>
		<content:encoded><![CDATA[<p>Thanks for responding again. Time/resources &#8211; that&#8217;s what I figured.</p>
<p>I felt like your web signup form was a good thing because it did ask for a fair amount of info about the app and why I am using your API. So I would expect that you have decent info there.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Derek Gottfrid</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6532</link>
		<dc:creator>Derek Gottfrid</dc:creator>
		<pubDate>Fri, 02 Apr 2010 12:51:28 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6532</guid>
		<description>It is a two fold problem one is simply finding the time. It is isn&#039;t much work - and I do love the idea of a sqllite db (we have used it here internally for a very very long time) but setting aside the time amidst all the other things we are trying to do. 

One of the things we want to do and haven&#039;t entirely solved is tracking where our data/apis are being used. It is important for the business to know where the data is flowing so we can identify new opportunities and be an actually useful partner w/ developers.</description>
		<content:encoded><![CDATA[<p>It is a two fold problem one is simply finding the time. It is isn&#8217;t much work &#8211; and I do love the idea of a sqllite db (we have used it here internally for a very very long time) but setting aside the time amidst all the other things we are trying to do. </p>
<p>One of the things we want to do and haven&#8217;t entirely solved is tracking where our data/apis are being used. It is important for the business to know where the data is flowing so we can identify new opportunities and be an actually useful partner w/ developers.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mattmole</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6530</link>
		<dc:creator>mattmole</dc:creator>
		<pubDate>Thu, 01 Apr 2010 10:40:51 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6530</guid>
		<description>Why not provide the API in couchDB format. That way the built in replication feature could be used to pull down new data at specified times. This could also count as the full allocation of requests for a day</description>
		<content:encoded><![CDATA[<p>Why not provide the API in couchDB format. That way the built in replication feature could be used to pull down new data at specified times. This could also count as the full allocation of requests for a day</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Niel</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6529</link>
		<dc:creator>Niel</dc:creator>
		<pubDate>Wed, 31 Mar 2010 14:58:23 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6529</guid>
		<description>Just in the last couple of days I&#039;ve been researching some claims by our local school board member that 70% of the voters in our county are 62 or older. Trying to find the data has been an interesting experiment in open data.  It turns out that the elections commission only offers the data in a PDF of a printed report, and that comes from the state anyway. I would much prefer to have been able to download raw data, I don&#039;t want an API at all.

Oh, and it turns out that the number is 17%, not 70%.  So now I&#039;m not sure if my source misheard or if the board member misspoke, but either way it&#039;s good to be able to dig into the raw data myself.</description>
		<content:encoded><![CDATA[<p>Just in the last couple of days I&#8217;ve been researching some claims by our local school board member that 70% of the voters in our county are 62 or older. Trying to find the data has been an interesting experiment in open data.  It turns out that the elections commission only offers the data in a PDF of a printed report, and that comes from the state anyway. I would much prefer to have been able to download raw data, I don&#8217;t want an API at all.</p>
<p>Oh, and it turns out that the number is 17%, not 70%.  So now I&#8217;m not sure if my source misheard or if the board member misspoke, but either way it&#8217;s good to be able to dig into the raw data myself.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nat Torkington</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6528</link>
		<dc:creator>Nat Torkington</dc:creator>
		<pubDate>Wed, 31 Mar 2010 02:37:56 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6528</guid>
		<description>@Clay if &quot;API&quot; is sexy and &quot;bulk data download&quot; is not, perhaps we need to talk about offering &quot;BD2 Compatibility&quot;. (half-serious)</description>
		<content:encoded><![CDATA[<p>@Clay if &#8220;API&#8221; is sexy and &#8220;bulk data download&#8221; is not, perhaps we need to talk about offering &#8220;BD2 Compatibility&#8221;. (half-serious)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nat Friedman</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6527</link>
		<dc:creator>Nat Friedman</dc:creator>
		<pubDate>Wed, 31 Mar 2010 02:32:16 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6527</guid>
		<description>&gt; It’s remarkable how many gov agencies want to jump into API land without providing bulk access *first*, but generally I don’t think it’s because of “information control” issues, rather, it’s because API is a three letter acronym– and the government thinks three letter acronyms are really sexy.

That&#039;s what I figured too. But it&#039;s so much HARDER in many cases to provide an API than to just put up a database dump. So it&#039;s funny that they get it backwards.

(Thanks for commenting, I&#039;m a big fan of Sunlight Labs!)</description>
		<content:encoded><![CDATA[<p>> It’s remarkable how many gov agencies want to jump into API land without providing bulk access *first*, but generally I don’t think it’s because of “information control” issues, rather, it’s because API is a three letter acronym– and the government thinks three letter acronyms are really sexy.</p>
<p>That&#8217;s what I figured too. But it&#8217;s so much HARDER in many cases to provide an API than to just put up a database dump. So it&#8217;s funny that they get it backwards.</p>
<p>(Thanks for commenting, I&#8217;m a big fan of Sunlight Labs!)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nat Friedman</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6526</link>
		<dc:creator>Nat Friedman</dc:creator>
		<pubDate>Wed, 31 Mar 2010 02:27:32 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6526</guid>
		<description>Thanks for writing Derek! For the record, I am very impressed and happy that NYTimes provides this data in a computer-readable form at all. 

If you do add a downloadable list in the future, that would be great - and I&#039;d be curious if you do have any business reasons not to do that, or if it&#039;s just a matter of time and priorities.</description>
		<content:encoded><![CDATA[<p>Thanks for writing Derek! For the record, I am very impressed and happy that NYTimes provides this data in a computer-readable form at all. </p>
<p>If you do add a downloadable list in the future, that would be great &#8211; and I&#8217;d be curious if you do have any business reasons not to do that, or if it&#8217;s just a matter of time and priorities.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nat Friedman</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6525</link>
		<dc:creator>Nat Friedman</dc:creator>
		<pubDate>Wed, 31 Mar 2010 02:25:05 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6525</guid>
		<description>Well put Francis. I&#039;m not against API, I just want an additional entry point that gives me a dump of all the data.</description>
		<content:encoded><![CDATA[<p>Well put Francis. I&#8217;m not against API, I just want an additional entry point that gives me a dump of all the data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6524</link>
		<dc:creator>Michael</dc:creator>
		<pubDate>Wed, 31 Mar 2010 00:48:48 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6524</guid>
		<description>Hard to say that they are or not - they definitely make money on the list itself. I think rate limits are fine for most people. If you need to have it raised just send an email.</description>
		<content:encoded><![CDATA[<p>Hard to say that they are or not &#8211; they definitely make money on the list itself. I think rate limits are fine for most people. If you need to have it raised just send an email.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Oliver Charles</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6523</link>
		<dc:creator>Oliver Charles</dc:creator>
		<pubDate>Tue, 30 Mar 2010 21:27:58 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6523</guid>
		<description>Amen! At MusicBrainz, we offer a public database dump that is regenerated weekly I think. It does require some PostgreSQL specific extensions, but it&#039;s the best we can do. We also offer a live replication stream (paid for) with hourly replication packets. And then on top of that we also offer a web service with a 1 query per second limit. They all get heavily used, and we certainly wouldn&#039;t ever think about removing one of the options now.</description>
		<content:encoded><![CDATA[<p>Amen! At MusicBrainz, we offer a public database dump that is regenerated weekly I think. It does require some PostgreSQL specific extensions, but it&#8217;s the best we can do. We also offer a live replication stream (paid for) with hourly replication packets. And then on top of that we also offer a web service with a 1 query per second limit. They all get heavily used, and we certainly wouldn&#8217;t ever think about removing one of the options now.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Clay Johnson</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6522</link>
		<dc:creator>Clay Johnson</dc:creator>
		<pubDate>Tue, 30 Mar 2010 20:51:07 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6522</guid>
		<description>At Sunlight Labs (we advocate for Gov open data), the concept of we&#039;re advocating for bulk access to data. Generally, what we think government ought to be doing is the following-- for just about every data-driven website it creates:

1. Provide bulk access to the data that powers the site in some kind of machine readable format. CSV, SQLite, JSON, XML-- just use something that makes some kind of sense. Not a .tiff file or a .pdf (unless it has the source documents appended)

2. If the datasets are significantly large-- it&#039;s cool to provide a functional, registration-less API on top of that dataset, but once you go into registration-land and I have to give you my email address to get a key to that dataset-- well, no. I already paid for the data. It&#039;s public domain. Make it free with no strings attached.

3. Make your website powered by the same bulk data or API you&#039;re providing. Don&#039;t provide bulk data, then build your own website with some better, cleaner copy of that data. That&#039;ll make it so you take care of the bulk data like you take care of the graphics on your website. 

It&#039;s remarkable how many gov agencies want to jump into API land without providing bulk access *first*, but generally I don&#039;t think it&#039;s because of &quot;information control&quot; issues, rather, it&#039;s because API is a three letter acronym-- and the government thinks three letter acronyms are really sexy.</description>
		<content:encoded><![CDATA[<p>At Sunlight Labs (we advocate for Gov open data), the concept of we&#8217;re advocating for bulk access to data. Generally, what we think government ought to be doing is the following&#8211; for just about every data-driven website it creates:</p>
<p>1. Provide bulk access to the data that powers the site in some kind of machine readable format. CSV, SQLite, JSON, XML&#8211; just use something that makes some kind of sense. Not a .tiff file or a .pdf (unless it has the source documents appended)</p>
<p>2. If the datasets are significantly large&#8211; it&#8217;s cool to provide a functional, registration-less API on top of that dataset, but once you go into registration-land and I have to give you my email address to get a key to that dataset&#8211; well, no. I already paid for the data. It&#8217;s public domain. Make it free with no strings attached.</p>
<p>3. Make your website powered by the same bulk data or API you&#8217;re providing. Don&#8217;t provide bulk data, then build your own website with some better, cleaner copy of that data. That&#8217;ll make it so you take care of the bulk data like you take care of the graphics on your website. </p>
<p>It&#8217;s remarkable how many gov agencies want to jump into API land without providing bulk access *first*, but generally I don&#8217;t think it&#8217;s because of &#8220;information control&#8221; issues, rather, it&#8217;s because API is a three letter acronym&#8211; and the government thinks three letter acronyms are really sexy.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris Anderson</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6519</link>
		<dc:creator>Chris Anderson</dc:creator>
		<pubDate>Tue, 30 Mar 2010 16:53:49 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6519</guid>
		<description>You might be interested in looking at CouchDB&#039;s replication support. It was designed with exactly this use case in mind.</description>
		<content:encoded><![CDATA[<p>You might be interested in looking at CouchDB&#8217;s replication support. It was designed with exactly this use case in mind.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nader</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6518</link>
		<dc:creator>Nader</dc:creator>
		<pubDate>Tue, 30 Mar 2010 15:04:51 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6518</guid>
		<description>For our Open Policitics API &quot;Deutschland API&quot; (http://deutschland-api.de) we&#039;re not limiting requests and provide all kinds of Query Languages (for ex. YQL). 

Publishing the full data set as well is a good suggestion though!</description>
		<content:encoded><![CDATA[<p>For our Open Policitics API &#8220;Deutschland API&#8221; (<a href="http://deutschland-api.de" rel="nofollow">http://deutschland-api.de</a>) we&#8217;re not limiting requests and provide all kinds of Query Languages (for ex. YQL). </p>
<p>Publishing the full data set as well is a good suggestion though!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Derek Gottfrid</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6517</link>
		<dc:creator>Derek Gottfrid</dc:creator>
		<pubDate>Tue, 30 Mar 2010 14:56:24 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6517</guid>
		<description>Great stuff here. I like Nat&#039;s point and the discussion that follows. Ultimately, we want to provide data in a variety of different means.  It is still early on for nytimes and apis/data - stay tuned and keep the thoughtful discussion going because we are listening.</description>
		<content:encoded><![CDATA[<p>Great stuff here. I like Nat&#8217;s point and the discussion that follows. Ultimately, we want to provide data in a variety of different means.  It is still early on for nytimes and apis/data &#8211; stay tuned and keep the thoughtful discussion going because we are listening.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jay Woods</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6516</link>
		<dc:creator>Jay Woods</dc:creator>
		<pubDate>Tue, 30 Mar 2010 14:46:35 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6516</guid>
		<description>To me this looks like the reason that RSS came into existance. It is not necessary that the copy of the database needs to be available from the original database. One person can take it upon themselves to have a sync&#039;d copy and makes available a downloadable copy when called for and updates as requested. 

We seldom ask why a person would subscribe to an RSS feed rather than an email list nor is it fruitful here to ask why someone would want a sync&#039;d copy of the database. They just do. Overall it is easier to handle the results of an RSS feed than the multiple formats of the received emails. Likewise, it is easier for the recipient to squirrel away the results of a sync&#039;d copy than the multiple formats of the received queries against a lot of different databases.</description>
		<content:encoded><![CDATA[<p>To me this looks like the reason that RSS came into existance. It is not necessary that the copy of the database needs to be available from the original database. One person can take it upon themselves to have a sync&#8217;d copy and makes available a downloadable copy when called for and updates as requested. </p>
<p>We seldom ask why a person would subscribe to an RSS feed rather than an email list nor is it fruitful here to ask why someone would want a sync&#8217;d copy of the database. They just do. Overall it is easier to handle the results of an RSS feed than the multiple formats of the received emails. Likewise, it is easier for the recipient to squirrel away the results of a sync&#8217;d copy than the multiple formats of the received queries against a lot of different databases.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pete Warden</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6515</link>
		<dc:creator>Pete Warden</dc:creator>
		<pubDate>Tue, 30 Mar 2010 14:43:04 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6515</guid>
		<description>I&#039;m on a crusade to create analyzable data sets too, but to a lot of companies it&#039;s quite a threatening act. You can see the result of my planned release of crawled public profile data from Facebook here:
http://petewarden.typepad.com/searchbrowser/2010/03/facebook-data-destruction.html

You should also check out Infochimps, they&#039;re trying to build a business around distributing these sort of data sets:
http://infochimps.com/</description>
		<content:encoded><![CDATA[<p>I&#8217;m on a crusade to create analyzable data sets too, but to a lot of companies it&#8217;s quite a threatening act. You can see the result of my planned release of crawled public profile data from Facebook here:<br />
<a href="http://petewarden.typepad.com/searchbrowser/2010/03/facebook-data-destruction.html" rel="nofollow">http://petewarden.typepad.com/searchbrowser/2010/03/facebook-data-destruction.html</a></p>
<p>You should also check out Infochimps, they&#8217;re trying to build a business around distributing these sort of data sets:<br />
<a href="http://infochimps.com/" rel="nofollow">http://infochimps.com/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6514</link>
		<dc:creator>Mike</dc:creator>
		<pubDate>Tue, 30 Mar 2010 14:10:02 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6514</guid>
		<description>You could have just used Tor to make your connections originate from a large number of different IPs - http://www.torproject.org/</description>
		<content:encoded><![CDATA[<p>You could have just used Tor to make your connections originate from a large number of different IPs &#8211; <a href="http://www.torproject.org/" rel="nofollow">http://www.torproject.org/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hans Granqvist</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6513</link>
		<dc:creator>Hans Granqvist</dc:creator>
		<pubDate>Tue, 30 Mar 2010 13:57:51 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6513</guid>
		<description>Good APIs let you download the main bulk and let you do the deltas. It makes sense for all. Here&#039;s how we do it for Netflix: http://developer.netflix.com/docs/REST_API_Reference#0_42335</description>
		<content:encoded><![CDATA[<p>Good APIs let you download the main bulk and let you do the deltas. It makes sense for all. Here&#8217;s how we do it for Netflix: <a href="http://developer.netflix.com/docs/REST_API_Reference#0_42335" rel="nofollow">http://developer.netflix.com/docs/REST_API_Reference#0_42335</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jack</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6512</link>
		<dc:creator>Jack</dc:creator>
		<pubDate>Mon, 29 Mar 2010 22:26:55 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6512</guid>
		<description>@Miguel de Icaza

&quot;There is also the Dallas market place that allows people to publish their oData sources and decide how they want to charge for it:

https://www.sqlazureservices.com/Catalog.aspx&quot;

One has to admire Microsoft for doing again something ambiguous. That page is filled with *free* things that in fact are *TRIAL OFFERS*; even one from a government.</description>
		<content:encoded><![CDATA[<p>@Miguel de Icaza</p>
<p>&#8220;There is also the Dallas market place that allows people to publish their oData sources and decide how they want to charge for it:</p>
<p><a href="https://www.sqlazureservices.com/Catalog.aspx" rel="nofollow">https://www.sqlazureservices.com/Catalog.aspx</a>&#8221;</p>
<p>One has to admire Microsoft for doing again something ambiguous. That page is filled with *free* things that in fact are *TRIAL OFFERS*; even one from a government.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nat Friedman</title>
		<link>http://nat.org/blog/2010/03/we-have-an-api/comment-page-1/#comment-6511</link>
		<dc:creator>Nat Friedman</dc:creator>
		<pubDate>Mon, 29 Mar 2010 19:15:01 +0000</pubDate>
		<guid isPermaLink="false">http://nat.org/blog/?p=1565#comment-6511</guid>
		<description>Doesn&#039;t have to be sqlite. I just find it to be a really handy way to pass around large datasets. But CSV or JSON or XML is fine too. Actually CSV is not that great since no one implements it properly - the others are fine though.

I&#039;m curious why you dislike sqlite though...</description>
		<content:encoded><![CDATA[<p>Doesn&#8217;t have to be sqlite. I just find it to be a really handy way to pass around large datasets. But CSV or JSON or XML is fine too. Actually CSV is not that great since no one implements it properly &#8211; the others are fine though.</p>
<p>I&#8217;m curious why you dislike sqlite though&#8230;</p>
]]></content:encoded>
	</item>
</channel>
</rss>

