 Can everybody hear me? Can you hear me fine? I'm fine. That's the best one I've had on microphone. It's quite exciting. Hello and thank you for having me. My name is Lucy Chambers and I work for the Open Knowledge Foundation. Just before I start talking about what I'm actually here to talk about, I'll tell you why I'm here and how Data Geneson is actually connected to the work of the Open Knowledge Foundation. The Open Knowledge Foundation started in the UK in about 2004. So, open data, in the sense that we mean it, means data that anybody is free to use, reuse or redistribute. We don't just do data, we also do content. And in the remit of content comes into things like open educational resources. The Data Journalism Handbook, which I'm going to talk to you today about, is an open educational resource i obor sydd yn aflwys iawn a hwn o'r cyflwyno'n agol a hoffod y gallwn meddwl. Mae rhuwm yn 3 gwybr am y gweithio. Fy gen i'nedd y ddim yn mynd i gael hoffod darmarfod i'r pera. Fy gynnal cael Roeliau sydd wedi i mi gael o'r cyfoseddon yn wneud y gweithio wneud yn ysgol ac mae'r hoffod i gael oed yn gweithio y gweithio gwybr o bobl. Mae'n adeg arnynсти a chynnau iawn i gael o'i cefnod. We also work in open academia, open research where we try and encourage scientists, researchers to open up their data behind their research and make it available so that people can cross check their results, build on their research, don't replicate it and also encourage them to put their, publish their papers in open access journals and we also work in the field of open cultural heritage. So when copyright expires we teach people what they can do with the material but it has passed into the province name. So that's basically what the Okonomish Foundation does and the way it does it is it builds tools, projects and communities. We'll talk about those a bit later. There are legal tools, we build licences and we teach people that it's important not only to publish your data on the internet if you're a government or if you're an NGO, it's also important to tell people what they can do with it. So we encourage people to put licences on their data which tells people what they can do with it and if it's open or if it's not open. Specifically I should mention with open licences we've been asked this numerous times since we've been here. We don't count non-commercial licences as open so any licence which stops commercial bodies from using data we don't count as an open licencer. I just thought I would mention that at that point. One of our biggest projects is a project called CCAN which is an open source data repository. It's been used by the UK government for data.gov.uk. It's also about to be used at the European level for publicdata.eu, the European data repository. Besides actually encouraging governments to open up their data, we also kind of perform the interpretive role so we actually take the data, process it and put it into a form that people can understand a lot more easily. So this is the predecessor of the project that I work on. It's a screenshot from the project where's my money go where you can basically put in your income, can you still hear me? You can put in your yearly income and it will give you on a daily basis information on how much you contribute to different sectors of society. So this is kind of a nice segue into data journalism as well because I'm going to talk a bit later about how journalism is no longer static words on a page. It's also about showing people in the way that's most relevant to them what data means for them. The final project that I'm going to talk about, my colleague Laura who sat down here at the front, who you could have cost if you're interested in it, is that besides working with governments to open up their data and interpreting it for them, we also want to build people's data literacy skills. So when we're talking about data journalism that's not only the journalists themselves, that's also the readers. So we care that the readership actually demands to see the working behind the news articles. They're not happy to see a large number printed in their local newspaper. They want to know exactly where it's come from and how they got to that number. So, with that in mind, that brings us on to how we get to the topic of data journalism and what I've just mentioned is exactly the reason why we were motivated to put together this handbook. Data journalism handbook is an open educational resource, which you can find online at the easiest URL is ddjainbook.org. It's available under a Creative Commons license. It's CC by SA, which means that anybody can use it. Anybody can, as long as they attribute us, and if they, we also encourage people to remix it, and if they remix it, as long as they release it under the same license, they can do whatever they like with it. And it aims to, I've got you, before I tell you what data journalism is, I'm actually going to tell you that there is no such thing as data journalism. Why I say that there is no such thing as data journalism is, we think that data journalism is a misnomer. If you're a journalist and you use a telephone in your reporting, you don't call it telephone journalism. In today's modern age, we don't believe that journalists can get by without using data. So we think that every journalist should be equipped with the skills and know which tools they can use in order to be able to use data. So we would like it, it's very fashionable at the moment to use the name data journalism in the States. They've been talking about computer assisted reporting for a very, very long time. We'd just like it to become common practice, essentially. So, now I've told you that there is no such thing as data journalism, I'm going to tell you what we wrote a book about. So one of the biggest headaches that we have at the Open Knowledge Foundation is journalists, for a long time, have not treated data as a source. They still look for things like interviews, they look to have their own sources of information. And we teach them new ways of getting data, I'll talk a bit more about what those are a little bit later. New ways of understanding and working with the data, we're particularly keen. So there are organisations which are very famous for doing data journalism. Things like The Guardian in the UK, The New York Times. If you're not a multi-million-pound organisation, can you get started with data journalism? Is there anything that you can do as a small local newspaper to get started with data journalism? And, as I mentioned before, with the Where's My Money Go visualization, I don't believe that journalism is any longer a static word on a page. People expect more than just pros. So how can we teach journalists to deliver information in a more engaging and more beautiful way, essentially? The reason it's important now is because we're kind of in an information age. Journalists before were more of hunter-gatherers, they had to fight for their resources and information was scarce. Now you guys have a Right to Information Act, we have a Freedom of Information Act as well. There's also a huge amount going on in the field of government transparency. This just isn't the case anymore. One of the things that is most frustrating in the line of work that we do is journalists not wanting to collaborate. When they have one dataset and another journalist has another dataset, they're keeping holding on to those datasets and not sharing them with the most powerful stories that have come out of data-driven journals and movement, if you like, have been ones where people have combined data from several sources. So we also encourage journalists to build a kind of commons of data and see that whilst it may not be them in the first instance that gets the scoop, they can come back and search later on in these common databases and things like that and get the story a bit later on. So, as I mentioned before, you can't actually see that. This is a site called datacatalogs.org. This is a catalogue of catalogs. It lists as many open government data portals as we know about. We teach journalists about, first and foremost, the most easy way to get data is that a lot of the time is already there, so we teach them where to look for it and we teach them how to use open government data portals and freedom of information requests and things like that. This is not something, I put this slide in here because it's not what we do. This is a visualisation of the WikiLeaks warlogs because where before the journalists were, if you like, come to gathers and they had to forage and hold on to their data, we now end up in a situation where they're absolutely drowning in it. This is a really interesting project. The overview project is built by Jonathan Stray at the Associated Press. It deals with the inverse of the hunter-gatherer problem where journalists are now confronted with so many documents and data that they have to be able to work out where they should even start or if they should even start reading the documents. What the overview project does is it groups together, it finds common words in documents that may be of interest to journalists and it shows them which documents they're located in so they can focus in just on this cluster of documents if they're only interested in things which include the word handcuffed. It's about speeding up fact-checking as well for time-press journalists because they don't have the time to mess around. It's also about crowdsourced data. One of the newsletways which we're finding that journalists are getting hold of data is by actually asking people. With tools like the Shahidi which you see behind me and even really simple things like Google Forms we see that people are actually putting surveys out there and asking their readers and following up after they've put out an interesting story to find out what their readers thought of it. It's changing journalism because they can get their stories a lot faster and a lot more accurate than they ever could from a small handful of sources before. Why we're here and what we would really like to get out of The Fifth Elephant and talking to you guys is that we believe that data journalists sit at the intersection between several groups. They have a lot to learn from coders, data geeks and even traditional academics. A lot of the first Pulitzer Prizes which were one for data-driven stories came from journalists who looked at methods and techniques which historians were using to analyse and work with data and they tried to apply them to their own work and found it actually. That's really, really interesting. What we're keen to find out is in India and with the work that you guys do is there anything that you really wish that journalists knew that makes your life really, really easy when you're working with data? What are the tools which are most useful to you guys and could they learn anything from you? This leads me to... Excellent. This leads me to how the data journalism handbook came out because it was not the project that we originally intended it to be. It started like what we intended to be a two-day project and we started it at the New Zealand Festival in London in 2011 in November and we basically planned to write a book in two days. We got a group of journalists from all over the world, The New York Times, The Guardian, loads and loads of people from all over the world. Put them in a room, lock them in there, wouldn't let them out until they remember. At the end of the two days we had about 20,000 words from about 35 or 55 contributors I think and it just seemed like such a shame not to keep going with it. Over the next six months we embarked on this enormous crowd sourcing what's essentially an enormous crowd sourcing project when we went around and we found what we thought were some of the key data-driven stories and data-driven journalists from around the world and we asked them to write about their experience. We didn't want to write a kind of manual because we realised that that would date really quickly and we didn't want to kind of write about the fashionable tools of the time. What we did was we compiled case studies and it's meant to get journalists excited about the prospect of using data in their work rather than telling them because I'm sure you know no book can turn one person into a fully fleshed data journalist but it's a good start and it's going to inspire them to get started. So we've touched on this before but it deals with things like where can you find data? How can you request data? What tools you can use once you've got hold of your data and importantly how do you find stories and data? Do you go in with a hypothesis, a hunch and looking for something or do you just kind of fumble around and hope that you're going to find it? What are the ways that other people have managed to do this in the past? So I don't know how well you can see that which is a shame. This is a poster with the overview of the entire book where we start from collecting, kind of crowdsourcing all of the different interpretations of what data journalism is and it ranges in everybody's minds as to what the definition actually should be. It can be, like I mentioned, people say it's just journalism where you use data. Another definition will say it's just journalism done well. Another definition will say it's journalism with visualisations and sexy infographics and things like that. Then we move on to look at actually how data journalism changes newsrooms because I've been talking to people as we go through and it seems that the media in India seems fairly healthy. That might be disputable, but as far as I can tell people seem to think that the media is doing pretty well in India. In the UK and in other parts of Europe it's struggling. A lot of the media has moved online and people are struggling to work out new business models which revolve around it. Can you still hear me as my microphone? We look at new business models and they can be anything from actually doing consultancy work around data because all of a sudden you've built this fantastic resource that people actually want to access and use, publishing and things like that. Also we look at things like is data journalism something which you need to intrinsically upskill in your newsroom? Do you actually need to train journalists to have new skills or is it actually more beneficial to buy in the talent if you like? Is it better to train journalists how to code or is it better to get coders in to help you out with it? What can you do on your own and what do you need additional help with? Then we move on to ways of getting data. Freedom of information requests, I think you call them RTI requests. Open data portals. There's a small amount on scraping as well which helps people to use with caution but if people won't give you information the way that you ask them when you ask them nicely it seems a pity not to be able to get it any other way. And then crowdsourcing again. There's a small amount on statistics and understanding data because I'm sure you know that journalists, some are absolutely fantastic at maths and some of them are maths butchers so there's a small amount on data literacy and basic stats for journalists, common mistakes, things like that. The final section deals with new ways of telling stories with data so interactive maps. We look at some kind of proprietary tools and free tools that people are using in their work. A lot of examples from the fantastic work of the Guardian data blog and also how you engage with your community. You want news to be more of a dialogue not just putting out their stories and into a void but how do they actually, after they put their story out how do they get feedback from people to build on it and make it into a better story next time or perhaps the next chapter of the same story. So it's online. There is an online version for free. There is also now a print and e-book version available via Riley and it's available at ddjbook.org If anybody has any questions about this or about the other work of the Open Knowledge Foundation I would be very happy to speak to you about it. Laura and I have a table upstairs with lots of stickers and we're also having a meet-up. First we're having a meet-up next week on Tuesday with the Data Meet Guys on the topic of Open Data. It's at Jaeger. The address is on here and there's a bitly link if you're interested in joining us, we'd love to see you. We're also planning on having an entire couple of days of activities around data journalism at our annual conference in Helsinki in September if you're interested in learning more about it. Thank you very much for having me and I hope that you'll keep talking.