 My name is Martin Kalpatovic. I'm the Associate Director for Digital Services at Smithsonian Libraries. And today, we're going to talk a little bit about our Smithsonian research online and its relationship to a number of national initiatives, including the OSTP policy on open data. But we're also going to be doing a bit of a history lesson today. There won't be a test afterwards, but I want everyone to sort of see where the Smithsonian fits into this larger federal context in terms of the OST mandate and open access. So the Smithsonian Institution was founded at the bequest of James Smithson, who was an English scientist, who after a series of complicated things left his entire estate to the United States to found in Washington an institution under the name of the Smithsonian an establishment for the increase and diffusion of knowledge. And it's these two missions, the increasing of knowledge and the diffusion of knowledge, that we're rolling into a lot of our Smithsonian research online and how that all fits together. So an important part of the founding of the Smithsonian was the plan of organization for the institution. This is Joseph Henry that you see on the screen, who is the first secretary or leader of the Smithsonian. And under his guidance, one of the things before the formation of the institution was that it would be comprised of a library, a museum, and a gallery of art, together with all of the different things associated with those things. So again, from the beginning, the Smithsonian was related to all of these different aspects. It was also important that there would be no restriction on the type of things that would be covered under this. So it's art, science, history, literature was actually included in this original founding documents. We don't really do as much on that as we would under that organization. Also in this founding document was a library and our first librarian was Charles Coffin-Jewitt. There was a lot of little political and personal things that happened over the years and the library was sort of shunted aside a bit at the beginning, but it all came back in full force in later years. And again, one of the key things for the library was the gathering of documents related to the admission of the institution. Now here's the important part. Many people feel the Smithsonian is a federal institution and although the bulk of our funding comes from the federal government, we're actually a trust instrument of the United States people that's jointly governed by the Congress, the Executive Office, the legislative branch, and citizen regions. So although we are not technically a federal agency in any sense of the word, we do try to adhere to many of the different federal policies and procedures that the federal government puts out. So under the context of the OSTP mandate, we're actually complying out of will as opposed to force of law. So the Smithsonian has a number of museums in the National Zoo. My slides didn't seem to get updated on this network. There's actually, depending on how you count, 19 museums and galleries or 15 museums and galleries depending on how you're counting the different portions and the National Zoo. We also have research institutes. The Smithsonian Libraries is one of the research institutes as well as these other elements that you see on the screen including the archives, et cetera. So there's the museums that are the more public component and then the research things that happen more behind the scenes. We also have a number of research centers. This number should say 16 and these are all the different study centers that are created at the Smithsonian to do very specific projects. In all of these activities that are going through the museums, the research institutes and the research centers are all generated in the type of research that we are collecting within the Smithsonian research online and then distributing and making available through those different mechanisms. So from the very beginning, the Smithsonian was also focused on publishing the outcomes of the research. What you see on the screen is the very first Smithsonian publication which was in a series called Smithsonian Contributions to Knowledge and it was actually on the mounds of Central in the Midwest area and so this was the outcome of research done on the early burial mounds and other archeological sites of the Midwest. We also, as the years went by, had a number of other Smithsonian publications. Again, part of this was many of these were published by the General Printing Office in later years. So these are some of the major series of Smithsonian publications that were issued from the mid 19th century onwards. Many of these were all published by the GPO so in this case they're actually federal documents and again so all of the things that go along with federal documents accrue to these publications. In the 1960s, those publications sort of faded off and we created a new series of publications called the Smithsonian Contribution Series. These are the most of those. They have new ones have come and gone over the years but this is sort of the bulk of those Smithsonian Contribution Series. These are all published by the Smithsonian Institution Scholarly Press currently and we've been working very closely with the press at the libraries over the last few years building the website, making all of these available digitally at the point of publication so as these are published in print and distributed through the Federal Deposit Library System they're also available freely, available online through that same process. So what we wanted to do a number of years ago is create a system where we could track the scholarly output of the institution and this actually started with the help of the Smithsonian Institution Natural History Botany Department which had been tracking their publications over the years. The library took over the management of the bibliography of Smithsonian research for Botany and then expanded it to the rest of the science unit. Our director Nancy Gwynne who happens to be in the audience today worked very closely with our Under Secretary for Science to convince them that the best way to do our equivalent of promotion in tenure which is called PAEC would be to use this bibliography as the source for that data for all of that. That was then expanded to include also our history, art and culture units. The Smithsonian Research Online was built to include a number of different systems and services that would provide the access to that information as well as reports and services that would then flow back. So the Smithsonian Research Online is comprised of two main portions at the moment. There's a research bibliography that has a little over 73 maybe 74,000 titles in it right now and then the digital repository which is a more traditional institutional repository with about 20,000 objects in that. I'd said earlier that the Smithsonian encompasses all different types of research but of course within the quantity of scientific publications science does tend to dominate. So here are just some recent statistics that show the breakdown between science, history, art and culture and then our publications from the United States National Museum. So again as you can see science dominates the research output of the Smithsonian as it does with our budget. So this is showing the quantity of publications from each of the different museums or research institutes of the Smithsonian. Basically you can ignore those small numbers. So here's the key part which basically shows that the primary output are again coming from our large science units of the Smithsonian, the National Museum of Natural History, the Environmental Research Center, the Astrophysical Observatory and the Tropical Research Institute. So the key thing here again is that for a compilation of the content within the bibliography, working within the science is much easier because again most of that comes from web of science and related things. And at the Smithsonian, the research output of history and art and culture is much more complicated. In our Center for Folklife, one of the scholarly outputs are liner notes on CDs that are created from the Folkways project. That's the kind of thing that doesn't usually make its way into most online bibliography. So we have to manually gather a lot of that information. This is just another representation again of that same information showing again science which is green in our world is very large. Over the course of, we started in about 2008 gathering the information for the Smithsonian Research Online and you can see the growth of the content over the years. As again more of our Smithsonian units came online to participate in the Smithsonian Research Online and the big bump there is when we did a retrospective conversion and gathered a whole bunch of early publications. And again this shows again how that content acquisition comes in through alerts, et cetera. I'm not touching anything. But the important thing is actually how the data is reused on a regular basis by our different Smithsonian researchers and administrators. So again, this is a one-stop shop. So what we don't wanna do is have researchers either have to enter data or gather information in numerous places. So by compiling it all within the Smithsonian Research Online all of that data can be repurposed back out to the different web pages. So the example we see here is botany. So our botany department web page for each of the researchers can just use, we're now using JSON scripts to pull the information from SRO and to populate those individual researcher pages. We can also do the same for departments or a whole museum. It's also an important tool for our administrators to report back. So here we just see some charts and graphs that are showing how we can output this for the administrators at the Smithsonian so that they can report back up to Congress and their other constituencies, the type of research that's being done. We also calculate number of covers of science and nature magazines so that we can actually say that we're 12 covers of nature in the last three years with Smithsonian authors on them. And this is just some statistics on use showing where those different users are coming from. And again, usage by the different Smithsonian museum content. So again, science again is highly the most popular. We're also working with altmetrics and some of the other new ways of measuring usage. So we're adding altmetrics content to the different pages so that researchers can see where all of that tweeting and blog post and news outlets are coming for. So here you can just see a couple of altmetrics pages for different publications, geographic coverage which is very important for us. So the question is, how does this all sort of fit into our OSTP compliance? Our OSTP compliance was generated from our Undersecretary for Collections Interdisciplinary Support, which is where the library reports and our DUSCUS gathered together a group of employees at the Smithsonian from different units, the library, the archives, our Chief Information Officer, to design a response to the OSTP policy and come up with a way that we could provide that type of information. So the basis for this was, we decided it would cover both Smithsonian Science and non-science materials. The Smithsonian scholarly press would be one of the key players in terms of managing the workflow of publications. We have been in conversation with chorus and shares in terms of working with them for all of this. And all of it plays into the general desire for an increased open access to the Smithsonian content. So a couple of years ago, we were a signatory of the recent Berlin Declaration for Open Access and this all fits together in that area. So one question, again, with the OSTP mandate is about data, where does data fit in? I was told to tell Josh Greenberg the other day about this, he said, where does data fit in? I said, well, it doesn't really, we're sort of punting on the data portion right now, as are many people. So right now what we are doing is including data sets for published materials within our repository system, but for ongoing research, we're trying to figure out a way to do this. And many of you know Thornie Staples who works within our Chief Information Officer, used to be University of Virginia, et cetera. Thornie is working on what he's calling Sidora, Smithsonian Institution Fedora repository and that's gonna be for ongoing Smithsonian research. So we're currently in a beta version of the Sidora project, which will be a method to capture ongoing data from the Smithsonian researchers and we're currently working on an entomology project as our test base. Library staff are working as sort of the docents along with the researchers on that and we'll be moving that into a test instance of Fedora to do the data. If that all works out, that will be our data component for the OSTP compliance. And here's just a screenshot of the current behind the firewall Fedora installation. As I mentioned, we are working with Cores and shares. I like this slide, there's a picture, that's you can make and tell, but that's the space shuttle doing the overflight of the National Mall and there's a panda on the top and one of our scientific researchers, scientific directors, the head of the zoo was telling the air and space directors, really the best way to get publicity, if only we could put a baby panda on a space shuttle and that would be the best way to bring out the sort of importance of Smithsonian research as flying pandas. So I want to bring together pandas. But importantly, what we're doing now is as many people are, is we want to integrate the personal element as well as all of the other types of things that comprise research in that scholarly environment that OCLC's documented. So we are instituting a vivo installation under the name of Smithsonian Profiles. It's currently in beta testing right now. We're importing all of the Smithsonian data, tying it into our grants research. It ties back research, grant research data, ties back into the bibliographic data and a number of other sources. So this will enable us to provide a bigger holistic view of the Smithsonian research. There's just some sample pages from that. And here's a sort of not yet fully branded version of the vivo pages. So what we like to think is that, well as in the words of our first secretary, Joseph Henry, it's not really the stuff that you keep inside your walls of your institution that make it important. And at the Smithsonian where we have 137 million objects in our collection, we have a lot of important stuff. That's how we actually get that information and that content out to the greater world that actually defines the importance of the institution. So on behalf of all the Smithsonian staff, I'd like to thank you for your time today. And I'd also like to thank some of my staff, Alvin Hutchinson and Richard Naples. You may know as the gift master from the Tumblr for their help on this, as well as the rest of the Smithsonian staff that have worked on Smithsonian research online for the last number of years. Thank you.