<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>dataist</title>
	<atom:link href="http://dataist.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://dataist.wordpress.com</link>
	<description>a blog about data exploration</description>
	<lastBuildDate>Fri, 17 Feb 2012 08:09:53 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='dataist.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>dataist</title>
		<link>http://dataist.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://dataist.wordpress.com/osd.xml" title="dataist" />
	<atom:link rel='hub' href='http://dataist.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Interactive: Haavisto&#8217;s great challenge</title>
		<link>http://dataist.wordpress.com/2012/01/25/interactive-haavistos-great-challange/</link>
		<comments>http://dataist.wordpress.com/2012/01/25/interactive-haavistos-great-challange/#comments</comments>
		<pubDate>Tue, 24 Jan 2012 22:44:42 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Own projects]]></category>
		<category><![CDATA[elections]]></category>
		<category><![CDATA[interactive]]></category>
		<category><![CDATA[pekka haavisto]]></category>
		<category><![CDATA[raphael]]></category>
		<category><![CDATA[sauli niinistö]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=251</guid>
		<description><![CDATA[The first round of the presidential election in Finland was held on Sunday. It was a super-exciting race with Sauli Niinistö coming out on top, but with Pekka Haavisto of the Green Party as the great surprise. Haavisto finished second, just before the grand ol&#8217; man of the Center Party, Paavo Väyrynen. Haavisto is having [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=251&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The first round of the presidential election in Finland was held on Sunday. It was a super-exciting race with Sauli Niinistö coming out on top, but with Pekka Haavisto of the Green Party as the great surprise. Haavisto finished second, just before the grand ol&#8217; man of the Center Party, Paavo Väyrynen.</p>
<p>Haavisto is having great momentum and is quickly rising in the polls. But can he really take on Sauli Niinistö who has been the favorite for years already?</p>
<p>With this interactive visualization I will show that Pekka Haavisto face a great challenge in the second round. In the first round he got 570.000 votes. Niinistö got twice as much. 1,4 million voters will have to find a new candidate in the second round. Haavisto will have to get about 70 percent of those votes, which won&#8217;t be easy considering he is the liberal alternative of the two finalists and a lot of the undecided voters are conservatives.</p>
<p>Anyway here is the visualization. It lets you drag and drop the votes of the candidates that didn&#8217;t make it to the second round. Hopefully it gives you an idea of the effort that Haavisto will have to go through to stand a chance. But who knows? He has surprised us once already.</p>
<div id="attachment_252" class="wp-caption aligncenter" style="width: 600px"><a href="http://jensfinnas.com/dataist/presidential_puzzle/"><img class="size-full wp-image-252 " title="open viz." src="http://dataist.files.wordpress.com/2012/01/skacc88rmavbild-2012-01-24-kl-23-24-40.png?w=590&#038;h=358" alt="" width="590" height="358" /></a><p class="wp-caption-text">Open interactive visualization in new window.</p></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/251/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=251&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2012/01/25/interactive-haavistos-great-challange/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2012/01/skacc88rmavbild-2012-01-24-kl-23-24-40.png" medium="image">
			<media:title type="html">open viz.</media:title>
		</media:content>
	</item>
		<item>
		<title>Pimp your Google Map</title>
		<link>http://dataist.wordpress.com/2011/12/29/pimp-your-google-map/</link>
		<comments>http://dataist.wordpress.com/2011/12/29/pimp-your-google-map/#comments</comments>
		<pubDate>Thu, 29 Dec 2011 09:14:48 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Own projects]]></category>
		<category><![CDATA[crime]]></category>
		<category><![CDATA[google maps]]></category>
		<category><![CDATA[helsingin sanomat]]></category>
		<category><![CDATA[maps]]></category>
		<category><![CDATA[yahoo place finder]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=243</guid>
		<description><![CDATA[Today I&#8217;m having my first visualization published at Helsingin Sanomat. A map of all the discharges in Helsinki during one night (21-22.12) showing reported crimes and accidents. Check it out: It&#8217;s a pretty basic visualizations. I got a bunch of addresses that I geocoded using the Yahoos place finder API and projected on Google Maps. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=243&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Today I&#8217;m having my first visualization published at Helsingin Sanomat. A map of all the discharges in Helsinki during one night (21-22.12) showing reported crimes and accidents. Check it out:</p>
<div id="attachment_244" class="wp-caption aligncenter" style="width: 563px"><a href="http://www.hs.fi/kotimaa/Arkinen+y%C3%B6+Helsingiss%C3%A4+on+tyyni+-+katso+rikokset+ja+onnettomuudet+kartalta+/a1305552184997" target="_blank"><img class="size-full wp-image-244" title="Crimes and accidents mapped" src="http://dataist.files.wordpress.com/2011/12/bild-25.png?w=590" alt=""   /></a><p class="wp-caption-text">Go to HS.fi.</p></div>
<p>It&#8217;s a pretty basic visualizations. I got a bunch of addresses that I geocoded using the Yahoos place finder API and projected on Google Maps. I&#8217;m a huge fan of a lot of the Google tools out there. Most notably the Docs platform, which step by step is phasing out my decency on MS Office (at least MS Word).</p>
<p>However, I have been quite sceptical towards Google Maps. Mostly because of its – in lack of a better word – ugliness. We&#8217;ve seen enough of that pale blue-green-yellow layout.</p>
<p><a href="http://dataist.files.wordpress.com/2011/12/bild-26.png"><img class="aligncenter size-full wp-image-245" title="Google Maps as we know it." src="http://dataist.files.wordpress.com/2011/12/bild-26.png?w=590" alt=""   /></a></p>
<p>The good news is that Google has made it possible to easily style your maps using a simple online interface. In this example I&#8217;ve just inverted the lightness, added a pink hue and reduced the saturation of the water.</p>
<p><a href="http://gmaps-samples-v3.googlecode.com/svn/trunk/styledmaps/wizard/index.html"><img class="aligncenter size-full wp-image-246" title="Go crazy." src="http://dataist.files.wordpress.com/2011/12/bild-27.png?w=590&#038;h=258" alt="" width="590" height="258" /></a></p>
<p>The options are endless. You can easily spend hours playing around with the different settings.</p>
<p>Not too many developers and designers seem to have found this tool yet, but my prediction for 2012 is that there will be a lot more styled Google Maps. And why not a portal with open map skins for anyone to use? I can&#8217;t find anything like that at the moment.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/243/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/243/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/243/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/243/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/243/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/243/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/243/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/243/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/243/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/243/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/243/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/243/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/243/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/243/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=243&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/12/29/pimp-your-google-map/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/12/bild-25.png" medium="image">
			<media:title type="html">Crimes and accidents mapped</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/12/bild-26.png" medium="image">
			<media:title type="html">Google Maps as we know it.</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/12/bild-27.png" medium="image">
			<media:title type="html">Go crazy.</media:title>
		</media:content>
	</item>
		<item>
		<title>Interactive: The 100 richest people in Finland</title>
		<link>http://dataist.wordpress.com/2011/11/29/interactive-the-100-richest-people-in-finland/</link>
		<comments>http://dataist.wordpress.com/2011/11/29/interactive-the-100-richest-people-in-finland/#comments</comments>
		<pubDate>Tue, 29 Nov 2011 14:19:11 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Own projects]]></category>
		<category><![CDATA[income]]></category>
		<category><![CDATA[interactive]]></category>
		<category><![CDATA[raphael]]></category>
		<category><![CDATA[slideshow]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=234</guid>
		<description><![CDATA[November is the big gossip fest in Finland. Every year in the beginning of the month the tax records from last year are published. In other words: you get to know who made the most money. Every year the Finnish media outlets do a very conventional presentation of this material. Page after page of lists [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=234&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>November is the big gossip fest in Finland. Every year in the beginning of the month the tax records from last year are published. In other words: you get to know who made the most money.</p>
<p>Every year the Finnish media outlets do a very conventional presentation of this material. Page after page of lists of top-earners. Rarely does anyone do anything more creative with the data.</p>
<p>I gave it a shot. This is what came out:</p>
<div id="attachment_235" class="wp-caption aligncenter" style="width: 364px"><a href="http://jensfinnas.com/dataist/taxtop/"><img class=" wp-image-235 " title="Bild 20" src="http://dataist.files.wordpress.com/2011/11/bild-20.png?w=354&#038;h=216" alt="" width="354" height="216" /></a><p class="wp-caption-text">Open the interactive visualization in new window.</p></div>
<h2>How?</h2>
<p>This is my first visualization in <a href="http://raphaeljs.com/" target="_blank">Raphael.js</a>. Previously I have been working with <a href="http://dataist.wordpress.com/tag/d3-js/" target="_blank">D3</a> and <a href="http://dataist.wordpress.com/tag/protovis/" target="_blank">Protovis</a>, but the weak browser support of these two libraries is becoming a growing concern. Especially when one tries to do sell the work. However, I have found Raphael to be very useful and somehow more intuitive than D3.</p>
<p>The idea for this presentation came from the super-visualization, <a href="http://sexperienceuk.channel4.com/the-sexperience-1000" target="_blank">The Sexperience</a>, by British Channel 4, a survey about the sex life of ordinary Brits (don&#8217;t worry, you can open it at work as well). I think the geniality behind this setup is that you can follow the respondents in the quiz from question to question, which gives the user the possibility to explore the relation between different questions instead of just looking at one question at a time. What are for example the sexual preferences of the people who lost their virginity late?</p>
<p>To some extent my presentation of the 100 top earners let you do the same thing. You can select the persons you are interested in and follow them through the presentation. This is a potential of the modern web that I think we will see much more of in the future.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/234/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/234/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=234&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/11/29/interactive-the-100-richest-people-in-finland/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/11/bild-20.png" medium="image">
			<media:title type="html">Bild 20</media:title>
		</media:content>
	</item>
		<item>
		<title>Tutorial: How to extract street coordinates from Open Street Map geodata</title>
		<link>http://dataist.wordpress.com/2011/11/25/tutorial-how-to-extract-street-coordinates-from-open-street-map-geodata/</link>
		<comments>http://dataist.wordpress.com/2011/11/25/tutorial-how-to-extract-street-coordinates-from-open-street-map-geodata/#comments</comments>
		<pubDate>Fri, 25 Nov 2011 14:15:03 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Tutorials]]></category>
		<category><![CDATA[geojson]]></category>
		<category><![CDATA[housing prices]]></category>
		<category><![CDATA[maps]]></category>
		<category><![CDATA[open street map]]></category>
		<category><![CDATA[polymaps]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[tutorial]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=222</guid>
		<description><![CDATA[I&#8217;ve spent almost a year learning about data-driven journalism and tools for analysis and visualization of data. I have now become confident enough to think that I might even be able to teach someone else something. So here it goes: my first tutorial. The task Earlier this fall Helsingin Sanomat published a huge dump of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=222&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve spent almost a year learning about data-driven journalism and tools for analysis and visualization of data. I have now become confident enough to think that I might even be able to teach someone else something. So here it goes: my first tutorial.</p>
<h2>The task</h2>
<p>Earlier this fall Helsingin Sanomat published <a href="http://blogit.hs.fi/hsnext/asuntojen-vuokrat-ja-hintapyynnot-oikotiella-2010-2011">a huge dump of price data</a> from <a href="http://www.oikotie.fi">Oikotie</a>, a Finnish market place for apartments. I had an idea to build a kind of heat map where every street would be colored based on the average price of the apartments.</p>
<p>With the JavaScript library <a href="http://polymaps.org/">Polymaps</a> you can easily make stylish web maps. The problem is that you need an overlay GeoJSON layer with the colored streets. Finnish authorities do not – <a href="http://www.maanmittauslaitos.fi/tiedotteet/2010/11/maanmittauslaitos-avaa-kartta-aineistoa-vapaaseen-kayttoon">yet!</a> – provide open street-level geodata. Fortunately Open Street Map does.</p>
<h2>From .shp to .geojson</h2>
<p>The raw data from Open Street Map is <a href="http://downloads.cloudmade.com/">downloadable in shape-format</a>. So in my case I download the shapefile package of Finland and opened it in Quantum GIS (<em>Layer &gt; Add vector layer</em>). This is what the finland_highway.shp file looks like.</p>
<p><a href="http://dataist.files.wordpress.com/2011/11/bild-14.png"><img class="aligncenter size-full wp-image-223" title="Bild 14" src="http://dataist.files.wordpress.com/2011/11/bild-14.png?w=590&#038;h=339" alt="" width="590" height="339" /></a>This is A LOT of geodata, but in this case I&#8217;m only interested in the Helsinnki region. So I zoom in Helsinki an and select, roughly, the streets that I&#8217;m interested in using the lasso tool (<em>select object tool </em><img class="alignnone  wp-image-224" title="Bild 16" src="http://dataist.files.wordpress.com/2011/11/bild-16.png?w=14&#038;h=13" alt="" width="14" height="13" />).</p>
<p><a href="http://dataist.files.wordpress.com/2011/11/bild-17.png"><img class="aligncenter size-full wp-image-225" title="Bild 17" src="http://dataist.files.wordpress.com/2011/11/bild-17.png?w=590&#038;h=295" alt="" width="590" height="295" /></a>To export the selected part of the map to the GeoJSON format that Polymaps can read, chose <em>Layer &gt; Save Selection as vector file</em> and GeoJSON as your format. Save! Done!</p>
<h2>Filtering the GeoJSON file</h2>
<p>We got a our GeoJSON-file. Now there is just one problem: it is huge, 18 MB! But there are a lot of streets here that we don&#8217;t need. We want to filter these streets. This will require some programming skills. I turn to Ruby.</p>
<p>This is the structure of an object in the GeoJSON file:</p>
<p style="padding-left:30px;"><code>{ "type": "Feature", "properties": { "TYPE": "cycleway", "NAME": "", "ONEWAY": "", "LANES": 0.000000 }, "geometry": { "type": "LineString", "coordinates": [ [ 24.773350, 60.203288 ], [ 24.774540, 60.203008 ], [ 24.777840, 60.202300 ], [ 24.781013, 60.201565 ], [ 24.781098, 60.201546 ], [ 24.782735, 60.201199 ], [ 24.784300, 60.201045 ], [ 24.785846, 60.201085 ], [ 24.787381, 60.201133 ], [ 24.787812, 60.201169 ], [ 24.788101, 60.201207 ], [ 24.797454, 60.201623 ], [ 24.797636, 60.201620 ], [ 24.799625, 60.201405 ], [ 24.801848, 60.201089 ] ] } }</code></p>
<p>This street does apparently not have a name, but the others do, which means I can extract that streets that I&#8217;m interested in based on their name.</p>
<p>In another array I list the streets that I want to be included in the visualization. Like this:</p>
<p style="padding-left:30px;"><code>streets = [<br />
'Mannerheimintie',<br />
'Hämeentie',<br />
'Ulvilantie'<br />
# and so on...<br />
</code></p>
<p>I now want to tell the computer to iterate through the GeoJSON file and extract the streets that are included in the streets array. Or in practice I approach it the other way around: I check what streets in the GeoJSON file that are not included in the array and remove them.</p>
<p>This is is the code:</p>
<p><code>def process(data)<br />
json = JSON.parse(data)</code></p>
<p><code><span style="color:#ff6600;">#-- STEP 1. Go through the geojson file and add the index numbers ("i") of the street names that are not found in the array "streets" to a new array ("del") </span><br />
i = 0<br />
del = []</code></p>
<p style="padding-left:30px;"><code>json["features"].each do |a|</code></p>
<p style="padding-left:60px;"><code>unless $streets.include? a["properties"]["NAME"]</code></p>
<p style="padding-left:30px;"><code>del.push(i)</code></p>
<p><code>end<br />
i += 1</code><br />
<code>end</code></p>
<p><code><span style="color:#ff6600;"><code>#--</code> STEP 2: Iterate through the del array from the back and remove the streets with the corresponding index numbers in the geojson data ---</span><br />
del.reverse.each do |d|</code></p>
<p style="padding-left:30px;"><code>json["features"].delete_at(d)</code></p>
<p><code>end</code></p>
<p><span style="color:#ff6600;"><code><code>#--</code> Open a new json file and save the filtered geojson ---</code></span></p>
<p><code>File.open("hki.json", 'a'){ |f| f.write(JSON.generate(json))}</code><br />
<code>end</code></p>
<p>In this case <code>data</code> is the GeoJSON file and <code>$streets</code> the array of the selected streets. And <em>voilà: </em>you got yourself a new GeoJSON file. In my case I managed to shrink it down to 1.6 MB.</p>
<h2>The visualization</h2>
<p>I now got what I wanted in the beginning: the geographical coordinates for the streets that I want to plot, which means I&#8217;m halfway to making my visualization.</p>
<p>I won&#8217;t go in to details on how the actual visualization was put together. The short version is that I used this <a href="http://polymaps.org/ex/streets.html">pavement quality example</a> as base script and made some small modifications. The price data is then picked from a separate file. This is the result, the housing prices in Helsinki, street by street:</p>
<div id="attachment_230" class="wp-caption aligncenter" style="width: 600px"><a href="http://jensfinnas.com/dataist/housing-hki/"><img class="size-full wp-image-230" title="Bild 18" src="http://dataist.files.wordpress.com/2011/11/bild-18.png?w=590&#038;h=278" alt="" width="590" height="278" /></a><p class="wp-caption-text">Open the full map in new window.</p></div>
<p style="text-align:left;">Not too shabby, right? I managed to sell this visualization to Hufvudstadsbladet which now <a href="http://hbl.fi/livsstil/boende" target="_blank">runs it on their website</a>.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/222/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/222/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/222/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/222/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/222/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/222/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/222/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/222/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/222/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/222/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/222/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/222/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/222/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/222/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=222&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/11/25/tutorial-how-to-extract-street-coordinates-from-open-street-map-geodata/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/11/bild-14.png" medium="image">
			<media:title type="html">Bild 14</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/11/bild-16.png" medium="image">
			<media:title type="html">Bild 16</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/11/bild-17.png" medium="image">
			<media:title type="html">Bild 17</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/11/bild-18.png" medium="image">
			<media:title type="html">Bild 18</media:title>
		</media:content>
	</item>
		<item>
		<title>One month Wall Street occupation mapped</title>
		<link>http://dataist.wordpress.com/2011/11/06/one-month-wall-street-occupation-mapped/</link>
		<comments>http://dataist.wordpress.com/2011/11/06/one-month-wall-street-occupation-mapped/#comments</comments>
		<pubDate>Sun, 06 Nov 2011 18:50:14 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Own projects]]></category>
		<category><![CDATA[d3.js]]></category>
		<category><![CDATA[interactive]]></category>
		<category><![CDATA[occupy wall street]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=211</guid>
		<description><![CDATA[For a month now we have been getting news about the Occupy movement that started on Wall Street in the beginning of October. There has been some arguing about the size of this movement. Guardian has made and interesting attempt to answer the question using crowdsourcing. I took a different approach. The protest are coordinated [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=211&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>For a month now we have been getting news about the Occupy movement that started on Wall Street in the beginning of October. There has been some arguing about the size of this movement. <a href="http://www.guardian.co.uk/news/datablog/2011/oct/17/occupy-protests-world-list-map">Guardian has made and interesting attempt to answer the question</a> using crowdsourcing. I took a different approach.</p>
<p>The protest are coordinated at the site meetup.com. Here you find a <a href="www.meetup.com/occupytogether/?see_all=1">complete list of the 2 506 occupy communities</a>. I wrote a Ruby scraper that goes through this list and gathers information about all the meetups that has been arranged so far (more than 4 000 in a month).</p>
<p>I used the D3.js library to visualize the the list of meetups. This is the result (opens in new window):</p>
<p style="text-align:left;"><a href="http://jensfinnas.com/dataist/ows/"><img class="size-full wp-image-213 aligncenter" title="Open interactive visualization in new window." src="http://dataist.files.wordpress.com/2011/11/bild-131.png?w=590&#038;h=195" alt="" width="590" height="195" /></a></p>
<p style="text-align:left;">The movement clearly peaked on Octboer 15th with meetups in around 600 different locations around the world. Protestors have continued to rally on Saturdays, but not with the same intensity.</p>
<p>Note that there is a number of protests that are missing here. I had some technical difficulties geocoding special characters (using the Yahoo Place Finder API), but that should not distort the picture of how the movement has developed. I didn&#8217;t have time to resolve the problem at the moment, but if someone knows how to get the API to understand odd characters such as ä, é and ü I&#8217;d appreciate the assistance.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/211/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=211&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/11/06/one-month-wall-street-occupation-mapped/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/11/bild-131.png" medium="image">
			<media:title type="html">Open interactive visualization in new window.</media:title>
		</media:content>
	</item>
		<item>
		<title>SVT launch Guardian inspired data blog</title>
		<link>http://dataist.wordpress.com/2011/10/29/svt-launch-guardian-inspired-data-blog/</link>
		<comments>http://dataist.wordpress.com/2011/10/29/svt-launch-guardian-inspired-data-blog/#comments</comments>
		<pubDate>Sat, 29 Oct 2011 15:01:27 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Links]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=204</guid>
		<description><![CDATA[On Thursday the Swedish public broadcaster SVT launched a new exciting platform called SVT Pejl. It describes itself as a news blog producing journalism based on stats, facts and numbers. &#8220;Our ambition is to explain current events and make numbers and facts available in an accessible way&#8221;, writes Kristofer Sjöholm who is the leader of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=204&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>On Thursday the Swedish public broadcaster SVT launched a new exciting platform called <a href="http://blogg.svt.se/svtpejl/">SVT Pejl</a>. It describes itself as a news blog producing journalism based on stats, facts and numbers. &#8220;Our ambition is to explain current events and make numbers and facts available in an accessible way&#8221;, writes Kristofer Sjöholm who is the leader of the project.</p>
<p>The presentation of the blog features an <a href="http://blogg.svt.se/svtpejl/kategori/datajournalistik/the-guardians-simon-rogers-%E2%80%9Dvi-trodde-bara-geeks-var-intresserade%E2%80%9D">interview with Simon Rogers</a> of Guardian&#8217;s <a href="http://www.guardian.co.uk/news/datablog">Data blog</a>. And this is clearly where the inspiration comes from. This is the Data blog of Sweden.</p>
<p style="text-align:center;"><a href="http://blogg.svt.se/svtpejl/kategori/datajournalistik/vi-jobbar-med-data"><img class="aligncenter size-full wp-image-205" title="Pejl" src="http://dataist.files.wordpress.com/2011/10/bild-10.png?w=590&#038;h=346" alt="" width="590" height="346" /></a></p>
<p style="text-align:left;">If you know some Swedish it is well worth taking a look at this introductory video explaining what data-driven journalism and SVT Pejl is.</p>
<p style="text-align:left;">For a person like me with one foot in Sweden and one in Finland it is interesting to follow (and be part of) the development in this field right now. Helsingin Sanomat has taken a lead role in Finland publishing big open data sets and arranging several hacks and hackers style workshops. It feels like Sweden has been loosing some ground to its little brother here, but maybe this new site will narrow the gap.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/204/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=204&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/10/29/svt-launch-guardian-inspired-data-blog/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/10/bild-10.png" medium="image">
			<media:title type="html">Pejl</media:title>
		</media:content>
	</item>
		<item>
		<title>Animation: World terrorism 2004-2011</title>
		<link>http://dataist.wordpress.com/2011/09/23/animation-world-terrorism-2004-2011/</link>
		<comments>http://dataist.wordpress.com/2011/09/23/animation-world-terrorism-2004-2011/#comments</comments>
		<pubDate>Thu, 22 Sep 2011 22:03:05 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Own projects]]></category>
		<category><![CDATA[animation]]></category>
		<category><![CDATA[d3.js]]></category>
		<category><![CDATA[geocoding]]></category>
		<category><![CDATA[maps]]></category>
		<category><![CDATA[terrorism]]></category>
		<category><![CDATA[visualization]]></category>
		<category><![CDATA[yahoo place finder]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=199</guid>
		<description><![CDATA[After the terror attacks of nine-eleven the USA set out to fight terrorism. It has been a succesful quest in the sense that the Americans themselves have not been hit by terrorist since – but others have. According to statistics from the American Worldwide Incident Tracking System 37,798 lethal attacks have been carried out since [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=199&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>After the terror attacks of nine-eleven the USA set out to fight terrorism. It has been a succesful quest in the sense that the Americans themselves have not been hit by terrorist since – but others have. According to statistics from the <a href="https://wits.nctc.gov/FederalDiscoverWITS/index.do?N=0">American Worldwide Incident Tracking System</a> 37,798 lethal attacks have been carried out since 2004 killing 174,547. That&#8217;s a lot of nine-elevens.</p>
<p>Since the WITS provides such easily accessible data it would be a shame not to do something with it. So I did and this is what I ended up with (<a href="http://jensfinnas.com/dataist/terror_attacks/">click to open in new window</a>):</p>
<p><a href="http://jensfinnas.com/dataist/terror_attacks/" target="_blank"><img class="aligncenter size-full wp-image-200" title="Terrorist attacks visualized" src="http://dataist.files.wordpress.com/2011/09/bild-2.png?w=590&#038;h=331" alt="" width="590" height="331" /></a></p>
<p>A few words about how I did this visualization.</p>
<h2>The data</h2>
<p>The basic data was really easy to gather here. I just filtered the attacks with ten or more casualties and downloaded the spreadsheet from WITS. The challenge was to geocode the places. I hadn&#8217;t done this before.</p>
<p>I wrote a Ruby script that called the <a href="http://developer.yahoo.com/geo/placefinder/" target="_blank">Yahoo Place Finder API</a> to transform the place names to longitudes and latitudes. For some reason a few locations got completely wrong coordinates (I started to wonder when the USA was suddenly hit by major attacks that I had never heard of). These were filtered away.</p>
<h2>The visualization</h2>
<p>This job provided two new challenges. One, working with dates. Two, working with maps. Just as the <a href="http://dataist.wordpress.com/2011/09/05/interactive-athletics-world-record-progression/" target="_blank">last time</a> I used the JavaScript library d3.js to put the visualization together.</p>
<p>For the map I used the provided <a href="https://github.com/mbostock/d3/tree/master/examples/albers" target="_blank">Albers example</a> as a base script. With some assistance from <a href="https://groups.google.com/group/d3-js/browse_thread/thread/1f4cc4eec96a5748" target="_blank">this thread on Google groups</a> I managed to figure out how to make a map in d3 (my heureka moment was when I realized that you can <a href="https://groups.google.com/group/d3-js/msg/e5b11d4de893d687" target="_blank">modify d3.geo.js</a> to center the world map wherever you want).</p>
<p>Getting a hold of the dates in JavaScript became much easier with the <a href="http://www.datejs.com/" target="_blank">date.js</a> library. Highly recommended.</p>
<h2>Final thoughts</h2>
<p>A lot could have been done to polish the animation. One could have added some sort of timeline with key events, graphs and so on. But I think this is a pretty neat base for visualizing, lets says, earthquakes of other catastrophes. And you gotta like a viz on black.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/199/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/199/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/199/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/199/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/199/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/199/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/199/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/199/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/199/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/199/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/199/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/199/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/199/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/199/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=199&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/09/23/animation-world-terrorism-2004-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/09/bild-2.png" medium="image">
			<media:title type="html">Terrorist attacks visualized</media:title>
		</media:content>
	</item>
		<item>
		<title>Interactive: Athletics world record progression</title>
		<link>http://dataist.wordpress.com/2011/09/05/interactive-athletics-world-record-progression/</link>
		<comments>http://dataist.wordpress.com/2011/09/05/interactive-athletics-world-record-progression/#comments</comments>
		<pubDate>Mon, 05 Sep 2011 09:00:54 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Own projects]]></category>
		<category><![CDATA[athletics]]></category>
		<category><![CDATA[d3.js]]></category>
		<category><![CDATA[interactive]]></category>
		<category><![CDATA[visualization]]></category>
		<category><![CDATA[world records]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=190</guid>
		<description><![CDATA[The IAAF athletics world championships just came to an end with the one and only world record set by Jamaica in the short relay. This (the lack of world records) comes as no surprise. It is getting harder and harder to beat the old records, as the graph below shows. More than 2 000 official [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=190&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The IAAF athletics world championships just came to an end with the one and only world record set by Jamaica in the short relay. This (the lack of world records) comes as no surprise. It is getting harder and harder to beat the old records, as the graph below shows.</p>
<div id="attachment_191" class="wp-caption aligncenter" style="width: 495px"><a href="http://dataist.files.wordpress.com/2011/09/bild-10.png"><img class="size-full wp-image-191" title="Number of new world records per year." src="http://dataist.files.wordpress.com/2011/09/bild-10.png?w=590" alt=""   /></a><p class="wp-caption-text">Number of new world records per year.</p></div>
<p>More than 2 000 official IAAF world records have been set since the beginning of the 20th century. In other words:  a very interesting set of data. Inspired by <a href="http://www.nytimes.com/interactive/2008/08/16/sports/olympics/20080816_world_records.html" target="_blank">this visualization by The New York Times from 2008</a> I decided to do my own mashup with this data. <a href="http://jensfinnas.com/dataist/wr_progression/" target="_blank">This is the result (click to open in new window)</a>:</p>
<div id="attachment_192" class="wp-caption aligncenter" style="width: 482px"><a href="http://jensfinnas.com/dataist/wr_progression/"><img class="size-full wp-image-192 " src="http://dataist.files.wordpress.com/2011/09/bild-11.png?w=590" alt=""   /></a><p class="wp-caption-text">Interactive visualization: click to open in new window.</p></div>
<h2><strong>The data</strong></h2>
<p>There were two challenges with this visualization: getting the data and visualizing it. It was surprisingly difficult to find world record data in an accessible format. <a href="http://en.wikipedia.org/wiki/Athletics_record_progressions" target="_blank">Wikipedia</a> provides some help, but the data contains plenty of holes. Instead I had to turn to the only thing the IAAF has to offer: <a href="http://berlin.iaaf.org/news/kind=101/newsid=52384.html" target="_blank">a 700 page pdf with all the athletics statistics you can think of</a>. The open data gospel has apparently not reached IAAF quite yet.</p>
<p>On the other hand this was an opportunity to practice some Excel formatting skills. To copy-paste the data into Excel was easy, transforming into readable columns and rows took some time. But I did it and you&#8217;ll find the <a href="https://docs.google.com/spreadsheet/ccc?key=0AojPWc7pGzlMdDFoVHlDeTFDRnlOZy1yMFMyWnhEbFE&amp;hl=en_US#gid=0" target="_blank">result in Google Docs</a>. I didn&#8217;t figure out how to make Google Docs format seconds, tenths and hundredths correctly, but if you open the spreadsheet in Excel you should be able to get the correct times.</p>
<p>With the data in a pretty spreadsheet I indexed all the results with 1951 as a base year (or the first recorded record for new events) and manually added the newest records, such as the one set by the Jamaican relay team.</p>
<h2><strong>The visualization</strong></h2>
<p>For the first time I used the JavaScript library <a href="http://mbostock.github.com/d3/" target="_blank">d3.js</a> for a visualization. With my short <a href="http://dataist.wordpress.com/tag/protovis/" target="_blank">Protovis background</a> d3.js was a charm to work with. The main advantages with d3.js compared to Protovis are that d3.js provides much greater animation support and makes it easier to interact with other elements on the page (such as div-tags).</p>
<p>As a d3-n00b I used <a href="http://www.janwillemtulp.com/2011/03/31/tutorialthe-basics-working-with-arrays-in-d3/" target="_blank">Jan Willem Tulps tutorial</a> as a base script and built around that. The d3.js documentation is still not conclusive, so for a beginner it takes some trial and error to progress, but undoubtedly this is a very powerful library for making handmade interactive visualizations.</p>
<p>All in all a very educative process and a result that I&#8217;m quite content with.</p>
<h2>Post scriptum</h2>
<p>Do you, by the way, know which the sixth greatest athletics nation of all time is (measured in number of world records)? FINLAND! A bit hard to believe a year like this when non of our athletes made the top-eight.</p>
<table width="251" border="0" cellspacing="0" cellpadding="0">
<col width="137" />
<col width="114" />
<tbody>
<tr>
<td width="137" height="12"><strong>Country</strong></td>
<td align="right" width="114"><strong>Number of records</strong></td>
</tr>
<tr>
<td width="137" height="12">USA</td>
<td align="right" width="114">367</td>
</tr>
<tr>
<td height="12">Soviet union</td>
<td align="right">199</td>
</tr>
<tr>
<td height="12">East Germany</td>
<td align="right">109</td>
</tr>
<tr>
<td height="12">Great Britain</td>
<td align="right">55</td>
</tr>
<tr>
<td height="12">Germany</td>
<td align="right">51</td>
</tr>
<tr>
<td height="12">Finland</td>
<td align="right">49</td>
</tr>
<tr>
<td height="12">Poland</td>
<td align="right">47</td>
</tr>
<tr>
<td height="12">Australia</td>
<td align="right">41</td>
</tr>
<tr>
<td height="12">West Germany</td>
<td align="right">39</td>
</tr>
<tr>
<td height="12">Russia</td>
<td align="right">36</td>
</tr>
</tbody>
</table>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/190/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/190/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/190/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=190&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/09/05/interactive-athletics-world-record-progression/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/09/bild-10.png" medium="image">
			<media:title type="html">Number of new world records per year.</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/09/bild-11.png" medium="image" />
	</item>
		<item>
		<title>Campaign funding times two</title>
		<link>http://dataist.wordpress.com/2011/08/15/campaign-funding-times-two/</link>
		<comments>http://dataist.wordpress.com/2011/08/15/campaign-funding-times-two/#comments</comments>
		<pubDate>Mon, 15 Aug 2011 17:21:02 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Own projects]]></category>
		<category><![CDATA[campaign funding]]></category>
		<category><![CDATA[elections]]></category>
		<category><![CDATA[interactive]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[protovis]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=181</guid>
		<description><![CDATA[These two interactive visualizations has been in the drawer all summer. I made them in June already and did a small effort to get them published, but then I when that didn&#8217;t happen they were sort of forgotten about. The starting point was the campaign funding data that was published after the parliamentary elections here [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=181&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>These two interactive visualizations has been in the drawer all summer. I made them in June already and did a small effort to get them published, but then I when that didn&#8217;t happen they were sort of forgotten about.</p>
<p>The starting point was the campaign funding data that was published after the parliamentary elections here in Finland. All MPs have to publicly declare all donations above 1500 euros. The data can be found <a href="http://www.puoluerahoitus.fi/fi/index/vaalirahailmoituksia/raportit/Tietoaineistot.html.stx" target="_blank">here</a>, or in a slightly refined form <a href="http://blogit.hs.fi/hsnext/kolme-kansanedustajaa-kulutti-yli-20-euroa-jokaiseen-aanestajaansa" target="_blank">here</a> (thanks Helsingin Sanomat!). Helsingin Sanomat has already provided their own visualization, check it out <a href="http://vaaliraha.com/" target="_blank">here</a>.</p>
<h2>The network</h2>
<p>I started by approaching the data as a network using Protovis. This was the result:</p>
<div id="attachment_186" class="wp-caption aligncenter" style="width: 600px"><a href="http://jensfinnas.com/dataist/camp_funding2/"><img class="size-full wp-image-186" title="Campaign funding network" src="http://dataist.files.wordpress.com/2011/08/bild-9.png?w=590&#038;h=319" alt="" width="590" height="319" /></a><p class="wp-caption-text">Click to open in new window. Note that it takes a while to load.</p></div>
<p>I think the output is not too shabby, although the loading time here is really not acceptable. I couldn&#8217;t find a way to fasten up the rendering. The JavaScript code could also have been better, but I learned a lot in the process of putting it all together and would have been able to write a much smoother code today I think.</p>
<h2>The explorer</h2>
<p>The network approach above might be pretty, but not as informative as it could be. Again I used Protovis to build an interface that quickly lets you browse through all the reports.</p>
<div id="attachment_187" class="wp-caption aligncenter" style="width: 600px"><a href="http://jensfinnas.com/dataist/camp_funding1/"><img class="size-full wp-image-187" title="Campaign funding explorer" src="http://dataist.files.wordpress.com/2011/08/bild-8.png?w=590&#038;h=344" alt="" width="590" height="344" /></a><p class="wp-caption-text">Click to open in new window. The explorer itself is in Finnish.</p></div>
<p>I think this visualization has a lot of strengths. It is &#8220;click-less&#8221; which means you can quickly browse the candidates. Life is too short to be clicking. The loading time is also much, much shorter than in the network visualization.</p>
<p>Any thoughts?</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/181/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/181/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/181/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/181/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/181/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/181/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/181/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/181/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/181/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/181/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/181/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/181/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/181/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/181/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=181&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/08/15/campaign-funding-times-two/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/08/bild-9.png" medium="image">
			<media:title type="html">Campaign funding network</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/08/bild-8.png" medium="image">
			<media:title type="html">Campaign funding explorer</media:title>
		</media:content>
	</item>
		<item>
		<title>The Finnish &#8220;immigration critics&#8221; blog network</title>
		<link>http://dataist.wordpress.com/2011/07/31/the-finnish-immigration-critics-blog-network/</link>
		<comments>http://dataist.wordpress.com/2011/07/31/the-finnish-immigration-critics-blog-network/#comments</comments>
		<pubDate>Sun, 31 Jul 2011 21:11:12 +0000</pubDate>
		<dc:creator>Jens Finnäs</dc:creator>
				<category><![CDATA[Own projects]]></category>
		<category><![CDATA[backlinks]]></category>
		<category><![CDATA[gephi]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://dataist.wordpress.com/?p=171</guid>
		<description><![CDATA[In my previous post I mapped the network of anti-jihadist bloggers mentioned in the manifesto of Anders Behring Breivik. This afternoon I stumbled upon a tweet by Martti Tulenheimo requesting something similar on the Finnish blogosphere. I had actually tried to do something like that a few days ago, but didn&#8217;t manage to write the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=171&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In my previous post I <a href="http://dataist.wordpress.com/2011/07/26/mapping-the-anti-jihadist-blogosphere/" target="_blank">mapped the network of anti-jihadist bloggers</a> mentioned in the manifesto of Anders Behring Breivik. This afternoon I stumbled upon a <a href="http://twitter.com/#!/tulenheimo" target="_blank">tweet by Martti Tulenheimo</a> requesting something similar on the Finnish blogosphere.</p>
<p>I had actually tried to do something like that a few days ago, but didn&#8217;t manage to write the script I wanted. The plan was to use <a href="http://developer.yahoo.com/search/siteexplorer/V1/inlinkData.html" target="_blank">Yahoo&#8217;s inbound link API</a> and have it build the network automatically. But I didn&#8217;t figure out how to only include links from the main page of a site (as <a href="http://www.analyzebacklinks.com/" target="_blank">Analyze Banklinks</a> lets you do). So instead I took a more manual approach.</p>
<h3>The method</h3>
<p>As in the previous post I used <a href="http://www.analyzebacklinks.com/" target="_blank">Analyze Banklinks</a> to list incoming links. The backlink analysis was done on the following blogs:</p>
<table width="270" border="0" cellspacing="0" cellpadding="0">
<col width="238" />
<col width="32" />
<tbody>
<tr>
<td width="238" height="13"><strong>Site</strong></td>
<td width="32"><strong>Links</strong></td>
</tr>
<tr>
<td height="13">http://www.jussi-halla.com/scripta</td>
<td align="right">93</td>
</tr>
<tr>
<td height="13">http://kullervokalervonpoika.wordpress.com/</td>
<td align="right">38</td>
</tr>
<tr>
<td height="13">http://octavius1.wordpress.com/</td>
<td align="right">28</td>
</tr>
<tr>
<td height="13">http://www.hommaforum.org</td>
<td align="right">23</td>
</tr>
<tr>
<td height="13">http://yrjoperskeles.blogspot.com/</td>
<td align="right">8</td>
</tr>
<tr>
<td height="13">http://marialohela.fi/blogi/</td>
<td align="right">7</td>
</tr>
<tr>
<td height="13">http://izrailit.blogspot.com/</td>
<td align="right">6</td>
</tr>
<tr>
<td height="13">http://vasarahammer.blogspot.com/</td>
<td align="right">6</td>
</tr>
<tr>
<td height="13">http://turkkila.blogspot.com/</td>
<td align="right">5</td>
</tr>
<tr>
<td height="13">http://tuuritapio.blogspot.com/</td>
<td align="right">3</td>
</tr>
<tr>
<td height="13">http://nikopol2008.blogspot.com/</td>
<td align="right">2</td>
</tr>
<tr>
<td height="13">http://reinoblog.blogspot.com/</td>
<td align="right">2</td>
</tr>
<tr>
<td height="13">http://laivaontaynna.blogspot.com/</td>
<td align="right">1</td>
</tr>
</tbody>
</table>
<p>These are the blogs listed as &#8220;critical voices&#8221; on the blog of Jussi Halla-aho (or &#8220;the master&#8221; as he is refered to on the Homma discussion board) and they serve as a good starting point for this purpose.</p>
<p>I run all of the blogs through Analyze Backlinks to get a network of 133 blogs. I have obviously not read all of these blogs to check if it is correct to label them as immigration critics. The 133 blogs included in this network are merely to sites that link to at least one of the blogs mentioned above.</p>
<p>The results have to be read with some caution. I am not sure how reliable Analyze Backlinks is. Their own disclaimer warns that the results may not be accurate.</p>
<h3>The results</h3>
<p>Again I used Gephi to draw the network and this is what came out (<a href="http://dataist.files.wordpress.com/2011/07/blog_network2.pdf" target="_blank">click to open as pdf</a>):</p>
<div id="attachment_174" class="wp-caption aligncenter" style="width: 600px"><a href="http://dataist.files.wordpress.com/2011/07/blog_network2.pdf"><img class="size-full wp-image-174" title="blognetwork" src="http://dataist.files.wordpress.com/2011/07/blognetwork.png?w=590&#038;h=312" alt="" width="590" height="312" /></a><p class="wp-caption-text">Click to open as pdf.</p></div>
<p style="text-align:left;">The size of the sites are determined by the number of inlinks, that is the number of sites that link to the page (not only front page links are counted here). A large number of inlinks indicate that the site is popular. Hence the big dots should be seen as key nodes in the Finnish immigration critic blogosphere.</p>
<p style="text-align:left;">However, I am not quite sure about the quality of the inlink count that Analyze Backlinks provide. I had a quick look at what numbers that Yahoo&#8217;s backlink API throws out and there seems to be a significant discrepancy.</p>
<p style="text-align:left;">So this analysis is far from perfect, but it&#8217;s a start and it gives you a decent idea of what the most important sites are in blogoshere of Finnish immigration critics. If you have thoughts on how the methodology could be improved I would love to hear your comments.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dataist.wordpress.com/171/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dataist.wordpress.com/171/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/dataist.wordpress.com/171/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dataist.wordpress.com/171/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/dataist.wordpress.com/171/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/dataist.wordpress.com/171/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/dataist.wordpress.com/171/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/dataist.wordpress.com/171/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/dataist.wordpress.com/171/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dataist.wordpress.com/171/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/dataist.wordpress.com/171/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dataist.wordpress.com/171/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/dataist.wordpress.com/171/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dataist.wordpress.com/171/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dataist.wordpress.com&amp;blog=19696904&amp;post=171&amp;subd=dataist&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://dataist.wordpress.com/2011/07/31/the-finnish-immigration-critics-blog-network/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6c46b5ccdf07a2e9d5b6c49dd31e6620?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">dataist</media:title>
		</media:content>

		<media:content url="http://dataist.files.wordpress.com/2011/07/blognetwork.png" medium="image">
			<media:title type="html">blognetwork</media:title>
		</media:content>
	</item>
	</channel>
</rss>
