 So, you can join me if you want to, okay? So I would like to once again check your knowledge on Wikimedia Sweden. So please pick up your phone, put in menti.com and use the code 31777. Could we have that up on the screen? So our first question is how many articles are there on the Swedish Wikipedia? Some people still answering. And you are correct, 3.7 million. Woohoo, that's really amazing. And my second question is, which was the most visited Wikipedia page in Sweden this summer? You are totally right. The tragic accident took place in 1986 of Chernobyl. And we think that it's probably likely due to the recently launched HBO TV series about Chernobyl. I'm talking about Chernobyl. Yesterday I was speaking with Mark, the next speaker. And he told me that he recently had updated sources about Chernobyl. So it would be easy to find facts and update the Wikipedia and things. And he works towards universal access to knowledge. And the organization has more than 30 years, 20 years plus experience of web history. That is really amazing, I think. He's the director of Wayback Machine at the Internet Archive. Give me a big hand for Mark Graham. All right, hi. I'm thrilled to be here. I'm truly honored and excited because I get to talk about my favorite subject in the world, which is the Internet Archive. And I get to do it among friends, folks from the Wikipedia community. I'm going to basically introduce some of the work of the Internet Archive in the context of the Sustainable Development Goals, the SDGs, and also relative to Wikipedia. So the Internet Archive is a 22-year-old nonprofit based in San Francisco whose mission is universal access to all knowledge. We've got about 150 staff folks, about 50 of them in San Francisco, and the rest spread around the world. And we work in the space of taking analog material and digitizing it, taking digital material and preserving it, and making that information available worldwide. We don't charge anything for our services, there's no advertising on our site, and we don't track you. No strings attached, basically. And I'm going to do an overview of some of our key projects as they relate to the SDGs. So go for it. Quality education. You know, in some ways, it was kind of hard because there's 17 of the SDGs, and I thought, well, as a library, we're about educating people and fundamentally education underpins the success of everyone of the SDGs. But in particular, with regard to education, one project is that we've been archiving open access journal literature. So we have compiled 18 million papers that are all available, no strings attached, complete open access. They're all backed up to the way back machine, I should say also. And Brian Newbold, who's here, who's the researcher on this project, met one of the contributors of about a million of these papers last night, a fellow Wikipedia. So it's good to be here and collaborate with our friends. We also digitize a lot of books. We digitized several million books, and we do about 1,000 a day. And this is a picture of the machine that we use to digitize books. Yes, they're operated by human beings. Human beings turn the pages and press the buttons to cause the cameras to flash and take high quality photographic images of pages, which are then turned into a whole series of derivative works, including through optical character recognition, the text, and then we're able to do full text indexing, et cetera. So 1,000 books a day, we're in the process of ramping that number up. Our goal is 2,000 books a day. We make these books available in a variety of ways. One of them is something called Open Library, openlibrary.org. You can go there and you can discover books and you can borrow them if they're a modern work. We've also collaborated with about 20 major libraries, Boston Public Library, Georgetown Law, and some others, on a way to make books more widely available in a digital format through libraries. It's called Control Digital Lending, a group of about 20 copyright scholars and library and scholars got together and they wrote a 40-page white paper making the case for why a library can take and choose what format it wants to lend a book in. They can choose to lend a book in a paper format and if they own a paper version of a book, they can choose to lend that book in a digital format with two controls. Control 1 is the number, so it's a one-to-one ratio, and the second is the control that these digital versions that they lend out are used DRM software, encrypted basically with a Adobe encryption. Control Digital Lending is one of the ways that we make books available. Another project that we're working on is to link up books that we have to Wikipedia articles. This is the Martin Luther King, Jr. English Wikipedia page and down at the bottom, the first citation you see is to a book on page 138. So if you click on that link, you go right to page 138 in that book. In the last two weeks, we've added links to more than 30,000 books from English Wiki. There's about another 90,000 books, which I basically just have to press a button and I can add the links to all of those books. And then we've also begun to build wish lists for various language Wikipedia sites. My wish list right now for Ian Wiki is about 1.2 million, and through a special relationship we have with a used bookseller called Better World Books, we're going to work toward getting a great number of those books to acquire hundreds of thousands of those books, digitize them, and then cause links between them and Wikipedia articles. So we're going to roll that out with other language Wikipedia editions. And I would welcome, as in every one of these examples today, welcome your participation in helping to make this a reality. For several years, we've been working to fix broken links on Wikipedia pages. We're probably within the Wikipedia community most well known for this. We've gone through 26 Wikipedia language editions, and we've identified one of the 13 million broken links. These are URLs that would have returned a 404, it would have looked something depressingly like this, sorry, page not found, but then we've gone into the way back machine and we found archives of those, and then edited the URL in the article to point to the archive. So we've basically fixed one of the 13 million broken links, and we're just getting started. SDG Goal 11, I've got three examples, I'll go through them fairly quickly, of some collections that we've built that are basically helping to preserve some cultural heritage in the world. We have hundreds of thousands of collections, this is just three of them. This one is the world's largest collection of Tibetan Buddhist literature. We teamed up with the Buddhist Digital Research Center and put this collection together. Once again, everything that I'm showing you today, you can go to archive.org and you can click and you can read full text of. Here's another relatively interesting collection, it's of palm leaves with Balinese writing on them. We teamed up with scholars in Bali, and they claim that they think that we've digitized about 90% of all of the literature written in Balinese from these palm leaves. To show you the spectrum of the things that we work in, we have Buddhist literature, we have palm leaves, and we have 78s. Before there was vinyl, there was 78s, these are like record-like things made out of shellac, and the Boston Public Library donated a large number of them to us. We've so far digitized about 137,000 of them. You can go to the archive, we've also created some applications for Alexa and Google Home, so you can go, Alexa, ask the Internet Archive to randomly play jazz 78s. By the way, yes, you can also say, hey, Google, ask the Internet Archive to play The Grateful Dead. We have probably the largest collection of Grateful Dead concerts, more than 12,000 of them. For many people, that's what we're most known for is our Grateful Dead collections. Maybe we'll have people here in this room. Relative to goal 13, addressing climate action, so what you're looking at here is two versions of a web page from a US government website. We worked with the Environmental Data and Governance Initiative, EDGE, and we basically take archives of tens of thousands of US government websites every day dealing with environmental issues, and then we compare them day to day. They just, this organization, EDGE, just put out a 50-some page report documenting about a 25% reduction in information about climate change on US government websites. They would change words like climate change turned into weather, or they would change the title of the scientist that are working to obscure what they're working on. In this case, on the left, you have some information about climate change on the right. You basically are saying access denied, sorry, that information is no longer available to you. This information is all maintained in the wayback machine. You can go back and you can find it. You might be asking yourself, where do they store all this stuff? I don't know. And basically the answer is we do it ourselves. We're pretty cheap. We buy hard drives by the pallet, and we have about 100 petabyte of storage on spinning disk right now. That's about 50 petabyte of unique storage, but we're also pretty paranoid, and so we keep at least two copies of it in different locations and write data simultaneously. And as I said, we're really cheap, and one of the ways we save money is we don't use air conditioning. We basically use cool San Francisco air and some fans. And it works. It works pretty well, actually. We've been working for a long time now on archiving news. We archive about 200 million news URLs a week. And by the way, that's about 10% of the total number of URLs. We archive about 2 billion URLs into the wayback machine every week. That's a few thousand every second. And what you're looking at here is a representation of a project that we engaged in recently where we asked ourselves the question, what are all the sources of news in the world? So we aggregated from a couple dozen existing well-known news sources that came up with the number 176,647. It's obviously a moving target, and it has to do with the definition of what is news, et cetera. But this is our working set right now. And we're both archiving from the set of news sources, and we're also about to upload this to WikiData. Because everything that we do, kind of the idea of you don't have to write code that's already been written, I don't want the next person coming along who wants to have a good source of sources of news in the world to have to compile another data set. So we'll be working on that. And here's one way that news ends up being information we have in the archive ends up being of value. A journalist from CNN, Andrew Kazinsky, uses the Wayback Machine extensively to go back and find information that may no longer be available on the public web. And he found information about someone that Donald Trump was nominating for a high-level position in the U.S. government, and found information about this person's misogynist writings. And that was no longer available on the public web, found these on the Wayback Machine, publicized it, and this person is not up for that job anymore. Yeah, we have a little success story. I thought we were going to get the applause from the Fixie Brokely Lakes and say it's hard to tell the audience, but, or this one, and this one, this is not an applause. This is actually like a real bummer, right? It's like governments change, and sometimes when they do that, they try to erase the history. And when there was a failed coup in Turkey recently, for example, not only is we Wikipedia not available from Turkey, but more than 150 news organizations were just taken off the air completely. And so researchers got ahold of us and said, hey, these archives that you have of these Turkish news organizations are the only records of these that are available today. And here's one, unfortunately, very recently out of the headlines. I said so far that we archived through the Wayback Machine on the web. I talked about books, talked about 78s and music and software and academic papers. I didn't mention television, but we do archive a lot of television. We archive 60 television news channels, 24-7. We've been doing that for more than eight years. So we have a couple of million hours of television news. And like many of the things we work with, we then try to turn that into data in some fashion, extract out the data, extract out the metadata so that it can be useful for researchers and journalists and activists and others. So here's a case in which the New York Times used our data from our recording of Fox News and other news organizations and went through and found the very same words that the murderer in El Paso, Texas was using relative to characterizing immigrants in the United States with messages that were being transmitted by the Trump administration and on Fox News, etc. So basically, if you take media and turn it into data, you can do some more interesting and useful research on it. So here's our big idea, if you will, is that every book ever written. There's maybe a hundred-some million books so far that have been published. But that every single one of those books and that every academic paper, as I said, we've cataloged about 18 million of them open. We know that we have metadata on about 80 other million academic papers. Every web page, including all of the millions and millions of web pages that have existed for a variety of reasons, may no longer be available on the public web. All of these resources should be a click away. Chernobyl was mentioned here and when I saw the Chernobyl HBO special, I immediately went to Wikipedia and was asking Wikipedia questions. And I saw opportunities to help make that better. So I'm kind of obsessive, and so I went online and I bought 12 books about Chernobyl, including some Russian ones and others. And then I donated them to the Eastern Archive. We digitized them all and they're now all available. And so I'm looking for partnership. And every single one of the examples I spoke of today, we're open for partnership to be able to take this information and more fully integrate it in with Wikipedia, in with Wikidata, and with other open knowledge projects. So every book, every academic paper, and every web page should be just a click a day. So please join us to help make this a reality. And if you're ever in San Francisco to support goal 17, partnerships to achieve the goals, come visit us. As you saw in that first slide, our building, it's a church. We bought a church, a church of Christian scientists. And now it's a temple for knowledge, knowledge and information. So I invite you to please seriously come visit us. And some people here have come and have lunch with us. We have a lunch every Friday and we give a tour and we'll invite you into our community and figure out a way that we can work together. Thank you very much. Thank you, Mark. Also to you, we have donated some money. Okay, great, thank you.