 Data management current research in policy and education panel. I'm Spencer Corralis I'm a council on library and information resources postdoctoral fellow at the University of North Texas libraries I'm here today with Martin Halbert the dean of libraries at the University of North Texas Rachel Frick the director of the Digital Library Federation at the council on library and information resources and Bill Mowen the associate dean for research at the College of Information at the University of North Texas We've got these three projects are interrelated. I had the privilege of contributing to the research on two of them And we since we have a fairly short time I'm going to keep myself out of the picture as much as possible and let the presenters go We'll be starting off today with dr. Mowen and thank you again very much for taking time to come Good afternoon It sounds like we're getting audio here so you can hear me back there. Okay. Yes. Okay, cool I want to just do a brief project overview on a current I am a less funded initiative That we have going at the University of North Texas and there is some handouts on the three chairs over there There's a small brochure like this for our project. Feel free to pick one up on your way out. So This project is a three-year effort in capacity building started in September At the University and it's funded by a library for librarians for the 21st century grant And it's important to recognize it's a strong collaboration between UNT libraries and the College of Information Martin is co-PI on this project and as you'll see there's another one that's coming that's being reported on where there's also a collaboration between the College of Information and UNT libraries and it's very fun to be at a place where we can have such strong collaborations between these two units so the goal for this Project which we've called iCamp for curate archive manage and preserve is To develop curriculum within the Department of Library Information Sciences graduate program to increase the number of trained professionals and Disciplinary research and I'll come back to that in a minute for digital curation and data management responsibilities We have a few courses that sort of touch on that in our graduate program right now but we are planning to provide a graduate academic certificate of four courses in this arena funded by IMLS our Objectives are this four course sequence Web based they'll be fully online At the end of the project. We're designing Them to be pretty much online from the get-go, but we're Requiring students to be able to attend some onsite sessions more in the sense of assessment and debriefing and some Participatory design in the first or two offerings of these The plan is that each course will be offered at least three times in the course of the three-year project. We develop deploy Assess revise refine deploy three times for each of the four courses We're also Trying out some new technical infrastructure because of some issues related to Questions by folks at IMLS and other places about to what extent are these online courses that? library and information science graduate students or others really leading to Skills and hands-on experiences and practical training In the same way that they might have had they been in a residential program using computer labs on campus So I'll come back to that in a minute as well and then this teaching environment. It's sort of aligned up with the The technical infrastructure so in a way We're kind of presenting an early kind of framework for what we're doing We have a competency focus for the basis or foundation of the curriculum and we've identified Seven groups of competencies and I'll touch on those briefly in a minute the four courses And then this learning environment multifaceted and multi technology learning environment with the outcome being the You know the information professionals equipped with knowledge and skills to deal with digital curation and data management We target four or three groups One our traditional graduates in Information science in our library information science program, but we have about a thousand Master's students spread around the country many several a number of cohort groups around the country And we need to provide this training not just to the residential students in a face-to-face environment But to all of those students that are interested in it We're also looking at post master's information professionals who need and want to be retrained for new responsibilities in this area And we think one innovative aspect of this Project is that we have funding for 15 stipends to bring in disciplinary graduate students who may have data management responsibilities in their futures such as someone in biology History wherever they might be but are not library information Professionals, but what our hope is that by them taking one or two courses in this sequence Alongside LIS professionals in training that they're going to be rubbing shoulders with their future collaborators in terms of data management and digital curation Our outcomes that we foresee this curriculum having increasing students preparedness by enhancing practical training and that's part of this virtual lab teaching environment that is essential to our curriculum and Also trying to assess positive change in disciplinary specific graduate students perceptions about the library and about the information profession and their willingness and Understanding a need to collaborate with information professionals in their future research lies We also hope to set up some models for how to do technology type courses better in an online environment and also then showing how the the power of the collaboration between LIS education and information education With university libraries or libraries in general in terms of collaboration on projects So our driving question in some ways it deals with how can we improve the nature of LIS education, how can we improve new ways of Preparing graduate students to deal with these amazing opportunities and challenges Related to data management digital curation and preservation The Competencies, let me just briefly talk about those We see our curriculum being based on a set of Competencies as we have worked in the last four or five months in this area We see the breadth of responsibilities that could be involved in data data management digital curation Functions within an academic library and across the university So we are trying to identify a set of competencies that our courses will address and that we can fully Achieve those competencies the students can fully achieve those competencies through these four courses that we're developing now That means that there may be other competencies that are needed out there that we're not expressly able to Deal with in these four courses That's going to be coming out in the next three years as we test out to see what we can manage to address In four course for graduate courses The competency groups we are including professional skill type things such as communication and interpersonal Communication as well as environmental scanning things that These students need to be ready to do lifelong learning in this environment as things change so rapidly just You know thinking back to the presentation at the plenary session the changes that are Happening right now the new second revolution and information technology these sorts of things are really going to be Shaping the roles and responsibilities of these folks as well as more traditional and expected things such as curating and preserving content The technologies and service development around digital curation and data management our curriculum development we have four courses and we're looking at a Variety of pedagogical and instructional design Framework so that we can make the most out of the time and effort that the students are putting into the the work of the four courses and we'll have The first two that will be offered this summer our digital curation data management fundamentals and then one that's focused on the technologies tools applications and infrastructure and these two will be be Very much in line as things are covered Conceptually within the first course there will be activities in the technology course to Bring into bring to bear The skills and knowledge to use the tools and technology the final course We're just labeling it as a seminar right now As we start developing the third and fourth courses next fall that that one should take more shade and Then finally the last thing I want to mention and talk about is the technology This is one area that I feel very excited about and I think there's innovations related to the curriculum here We're going to have three types of learning environments technology-enabled learning environments One is a traditional learning management system We have just released blackboard learn for use as our enterprise system at the University of North Texas And we see that as providing traditional kind of guided directed Study for the students, you know lectures will be up there some communications delivering their assignments and so on but what is The area that we are most interested in exploring in terms of supporting student learning is the virtual lab environment for the last several years starting with a Imls grant a number of years ago Dealing with botanical specimens in a workflow for dealing with botanical specimen data We've been using Drupal Island or in fedora Island Dora was developed at the University of Pratt and Sedward Island as a connector between Drupal and fedora and we're seeing how we can use this as a platform for student engagement Where they will be dealing with a real-life? Repository where they can be putting things in taking things out of the metadata creation all that in a safe environment While having all the functions or a selected set of functions of a content management system like Drupal for web publishing so we have the Announcements in the calendars in the forum and things like that. So we're very excited about seeing how we can leverage this particular Platform for student learning. We are also by the way using the same platform for the project's web presence And this allows us to have one web platform for project management that is Protected from the public view while at the same time having all the data that we want to make public from our project Easily made surfaced as a public-facing thing by flipping the switch in the repository And then finally the third piece again, this is where the collaboration with the libraries is really important We will have a sandbox of the technologies and applications that are used in day-to-day Operations at UNT libraries again a safe place where the students can carry out problem-based learning and using the tools that Martin and his staff use on a daily basis for preservation Curation metadata and things like that. So we think that the students are going to have a very rich technology enabled learning experience in these four courses And just finally the list of staff. There's three co-pis Myself an associate assistant professor in the Department of Library Information Sciences at UNT and then Martin and then several professional staff a research scientist from my College of Information and then two folks from the UNT libraries and We have four student positions right now. We have four pay graduate research assistants to which are filled right now We are recruiting to replacements for people who've left Thank you very much Hi, I'm Rachel Frick. I'm the director of the Digital Library Federation at the Council on Library and Information Resources I'm gonna quickly Give a brief overview of what we've been working on at clear for the last year in regards to data curation education Most of our activities have been Around a Alfred P. Sloan founded Foundation funded activity. It was a planning and research grant to just investigate what is happening in data curation education and when we say data Curation education, I'm trying to say we're really focusing in on data management plans Researcher needs in this particular field So I clear we really felt that Developing and maintaining skills and data curation must become a central part for professional Central to the professional identities of specialists in each discipline if our educational institutions Are to build robust efficient and appropriately integrated online environments for future research teaching and learning? That's a bit of a mouthful With that thought we had some certain assumptions We assume that most graduate programs and the sciences and social sciences as well as the humanities are not well-prepared To cultivate the data management skills of their students or sometimes even have to teach them Why such skills are important to their survival of their fields of study and that in every discipline at least some Professionals must come to the to grasp the complex demands Related to the creation access reuse and preservation of digital research data And in the past this has been in the purview of the library and iSchools and especially those and also computer science So we were wondering how do we begin to stake out or put out some signposts for a new professional identity? Who are these people that are going to be managing our research data and with these questions and April 2011? We were funded like I said by Alfred peace loan to investigate this and look at these questions and recommend a path forward and in our minds Which we were trying to figure out How do we build community around this new professional identity? Where do we how do we identify training and educational opportunities and what could clear do moving forward to help kind of build this? Professional capacity that was a little bit different than what we've been seeing at the iSchools, but come from it from a more discipline centric Perspective So we gathered together an advisory board we have the names of the folks there as you can see a lot of them come from Our iSchools across the country and have been involved with research data activities up to this point And we decided to do two particular studies one was an environmental snapshot just basically just a click a moment in time What data curation? Education programs are being offered in the iSchool. What do they look like? What kind of certificates do they offer are they open to non? lis students Once again, just kind of getting a shape a look and feel the landscape the other Research study was more of an ethnographic study of people who are currently working in data curation positions at higher education institutions across the country and We wanted to take both those studies look at them as well as other things that are being published in the field because they're So much being published in this area right now and try to make some recommendations on how clear could support future education and training Opportunities as well as inform other people who are thinking about developing these types of programs The environmental scan was done by Spencer here at the university in north, Texas And like I said it was to describe lis programs as well as what we're calling extra academic those post professional Opportunities and other alternative models and then make recommendations to clear Highlights from that particular snapshot once again. It was I want to say this was a click moment in time It was not meant to be in Robustly comprehensive for everything, but it's just it's just a nice kind of a way to kind of get situated in the landscape But we realized that only five lis schools Offered dedicated programs for digital curation or data curation. It was that actually gave somebody a Certificate that this was their concentration That there was a growth a certificate programs open to non ILS students And actually just I'm backing up just a little wee bit Since 2006, I mean, I don't think this is any surprise But the Institute of museums and library services through the Laura Bush 21st century library professional program distributed more than six $6,000,000 in grants to develop this data curation capacity in our library schools So one of the things I think that was really helpful that Spencer kind of bubbled up for us was this data curation skills matrix that Cali authored back in 2009 and what was really interesting about that was that he was able to break down just More like functional areas or what we're calling four meta level functions of data curation and this breakdown of data curation know-how as opposed to theory was Helps inform and I think has spurned the development of a more modularized skills-based curriculum that we're starting to see come up Alvaro LIS schools and as alternative programming. So there's a link there. It's really really interesting to look at and Also, we're starting to see Data curation programs coming up in museums Which I think is really great coming to bubble up because there is a lot of research data within our life in our museums and other cultural heritage Institutions, so that's an interesting growth that we're seeing in that area Once again, this goal of this environmental snapshot was to provide a resource guide for us as we think about When I say we clear future education and training programs the other study we did was and what we're gonna say and Environmental what wasn't it was a ethnographic study of data management and curation practice so The goal of this study was to identify barriers to data curation and Unmet researcher needs within the university environment as well as gain a holistic understanding of the workflows involved with the creation management and preservation of research data So the investigators Laurie Yankee and Andrew Asher conducted the interviews with faculty Postdoc fellows graduate students and other researchers in a variety of social sciences disciplines Again for those who are involved in the data curation field. I don't think these key findings are very surprising I think it's really nice when you kind of get another hit of what you think you already know That in regards to training none of the researchers they interviewed for this study received formal training and data management practices Nor did they express satisfaction with their level of expertise? They felt that they were learning on the job and in an ad hoc fashion in regards to preservation few researchers think about long-term considerations of their data and a Lot of times the demand of publication Outweighs the demand of assigning metadata or any type of rights are giving you any even writing a code book to help you understand it later They only think about that if it's only interesting to them if it helps them with their research work Collaboration and data sharing there is definitely a need and I think we all knew this for a more effect More effective collaboration tools and spaces that support the volume of data generated and provide appropriate privacy and access controls This leads me down to the last bullet not the last one next one. It was basically some of the things that Researchers would rather risk data loss than actually Inappropriate data exposure does that make sense? They would rather hold on to it then And that way have more control over it and that researchers in general and I think this Notion is backed up with a previous Ithaca study a long time ago about Science e-journals or journals in general that researchers view libraries as a dispensary of goods and not really a locus for real-time professional researchers support so putting this all together and the recommendations made in the secondary study that Like I said this this falls in line with conventional wisdom There is unlikely to be a single out-of-the-box solution that can be applied to problems of data curation We know that instead an approach that emphasizes Engagement with the researcher and as a dialogue around identifying or building the appropriate tools and I thought that was interesting for their particular project, it's I don't even think it's something that we can You know stratify out to a specific discipline is really almost projects specific That research must researchers must be given access to adequate network storage a lot of times They heard over and over again that they just did not have the network storage to place all their data So it was running around on thumb drives on You know servers in their department underneath their desk It was just all over the place because they couldn't get it They didn't have enough storage allowance on their university's network storage or the other hand was they were working with multi-institutional partnerships and Their networked policies went allow people from the outside to have access to the storage on that particular university's network that The other recommendation is that education and other training programs that focus on early intervention in the researcher career path are likely to have the greatest benefit and That data curation systems should be Integrate with active research phase like if you're backing up or doing collaboration work or built into the tools that they use kind of like The project that's going on with Microsoft research in California Digital Library about enhancing Excel in order to facilitate data curation and once again the privacy and access controls that You know ethical risks associated with inappropriate data release outweigh the risks of data loss I kind of ran through those studies really really fast and bring kind of cursory They will be polished in two weeks on the clear website Once again the environmental scan was a snapshot There's some really great jumps in the ethnological study But those two reports together we review those and then combine them with once again What we were looking out and seeing what was published in the field what the IMLS was funding around data curation education and We developed a follow-up proposal back to sling once again. How do we start developing this professional identity around? data curation that isn't so Hard and fast coming from the LIS school But it can be seen as an alternative approach coming from the discipline side Because as you can see researchers really align themselves with disciplines and not their institutions And we felt that the network of data specialists that are supporting data curation should also be similarly aligned to the disciplines And that they should be integrated into the research team I saw a presentation by Susie Allard at the research data and archiving preservation summit the other week down in New Orleans And she said paraphrasing completely That those data specialists really need to be at the elbow of the scientists and be right there as part of the research team But what's really interesting by developing that data specialist that also has an understanding of what's happening in the library and IT they can also serve as an advocate for the researchers To I guess broker within those local systems How do you go to IT and talk to folks about can I have more network storage or how important it is to have this multi-user access to a Certain portion or a cold room someone's talking about of of data so people can work together and that and this is the part that clear really Feels a passion for is that this should be a defined professional path and not a secondary career choice I Remember seeing a presentation done by say each order. I really didn't think he'd be in the room It was a couple years ago And he had actually said something like somebody's gonna have to take a hit for the team in order to manage the data and We feel that it's absolutely true But when that phrase taking a hit for a team that seems like almost a negative commentation that this career path You know it's kind of a substandard Choice when we really think this should be a first choice, you know that by signposting this out It could be just as a valuable career choice as being the researcher itself so going back To the Sloan grant that we just received On March 20th for Rolex say about that is that we're going to expand the existing clear post postdoctoral fellows and academic libraries program to one that To have a subset so our concentration and data curation We're gonna be able to fund six positions focusing on sciences and social sciences We have a nice advisory board, but what's really interesting about our advisory board They're helping us not only that applications for the positions But also they're going to serve as mentors to these individuals as they work and they're at their host institution So they'll have a host they'll have a mentor that's from their discipline But also a mentor that's from a data curation side so they can help them guide them I think one of the things right now that was the biggest challenge for our data curation professionals is that they tend to be One person or two people in an institution and they are often feeling Alone or set of drafts. So how do we start building these connections between? Mentors who might be in their high schools as well as this growing community nationwide Going back and connecting to the growing data Curation community when I was talking to Susie Allard some other people involved with data one and data Conservancy there is a lot of outreach and education activities and those two NSF data net Funded proposals we want to work with those projects as well as some of the other newly funded data net projects that are doing some Education and outreach in this area to connect those threads together and make sure we're not building Siloed communities of data curators, but also just trying to knit them together So we're working on Activities and how do we invite people in how do we send these postdocs to these other events so they can Meet their other colleagues and peers We're accepting applications for those six postdocs now until June 30th We're going to be reviewing applications on May 1st And there's a URL right there at the bottom for more information, but there's also handouts out by the door by the ICAMP fires We have positions at these at our six institutional partners They're Indiana University Lehigh University McMaster University Purdue University the University of California at Los Angeles And the University of Michigan I'm really invite you to click on the URL and look at the position descriptions there I put up just a brief example from Lehigh The reason why I really liked all these position descriptions when we went and asked folks to Kind of like it was a really tight timeline to get this in for that this year's postdoc Cycle We were telling them what we did the results of our research and the idea of having this person that stands in Between the library and the data center or the library and the researcher kind of that liaison role with a discipline focused and as you can See with the Lehigh position It's one that's focusing on Earth and environmental sciences. It's working with web and GIS folks. It's I'm sorry the font It's really really small But it's standing in that sweet spot in between so it's really it's not so tightly Bound up with the library, but it's more it's more situated with the researcher and the other five positions are very similar So, you know as all good people do once we get one grant we try to plan for the next one so our plans for 2013 is to expand the program with more fellowships and To actually expand those to possibly include Positions that are in the humanities as well as the sciences and social sciences For 2013 once again connecting with other efforts as they're as we know about them and and doing a lot of outreach and Communication about this program because we feel it's really important to stay in Step with what's happening with the LIS schools. We don't want to see this as something Competitive we don't want to see this as something that's Distracting from the overall effort, but we want to see something that's complementary and supportive of the larger data curation education efforts We're also going to be seeking host institutions for the next round of Postdocs that'll be for 2013. So if you're interested, please contact Krista Willaford She's here today raise your hand Krista her emails below But we also have a link to the press release that has more information about that program I know I whipped through that. So if you have any questions, I guess at the end of the panel or buy online All right. Thanks folks Okay, I'm here to talk to you about the data res project. It's a Investigation of the emerging landscape of research data management practices and policies in the country and has a a lot of interconnections with the other projects. You've just heard about We're now at the midpoint and ready to share some of the initial findings and Outline where we're going with the project. I first want to give you know credit where credits do Here's the list of some of the principles in the project and I want to particularly Commends dr. Corralis and my strategic projects librarian Shannon Stark for the work they've done on this They've done all the boots on the ground work of analysis in these in this project so far Bill and I are co-pis on the project But also Rachel and Claire should also be up here listed. They are also playing various roles in the project I'll talk about in a moment So you would have had to have been living under a rock for the last two years to not understand the motivations for this project and the prominence and criticality of Data management in the emerging research landscape today You know everyone heard the the comments and the great keynote by dr. Duder stat this morning about big data You also many of you will have seen the webinar in the last week By the heads of several major federal agencies NSF NIH many of them, you know talking about the administration's new programs and importance that they're placing on on big data This project was intended from the beginning to try to get traction on this emerging landscape and to study both the the agency the funding agency priorities Mandates expectations in this area for research activities in the country and also the emerging institutional Responses and interventions into data management that are coming about as a result of this Most saliently we're trying to figure out. Well, okay, where does the the extended library and information science? profession go from here. How do we respond to these new needs and Requirements and and what's the best? What are hopefully best practices that we can identify? it is a collaborative project between the University of North Texas libraries the UNT College of Information and the Council on Library and Information Resources and we were also obviously working with many institutions around the country to delve into what their practices are so our first methodology I'll first talk about what we've Found out uncovered in studying Institutional responses to these mandates, but then I'm going to delve into a little bit of the detail That has emerged from a close analysis of the of the agency mandates or requirements and also some focus group discussions with both groups of stakeholders First, you know our initial methodology was to follow the money So to first look at the agency the institutions rather that have been the biggest awardees of the biggest recipients of awards from both the NSF and NIH Perhaps not surprisingly. There's a lot of overlap between those institutions when you look at the top 200 NSF awardee schools in the top 200 NIH awardee schools. It produces a set of about 220 institutions Maybe the first surprising or not so surprising Finding was that in that group of 220 institutions? 72% of them do not have any institutional level policy or really even guidance Governing the retention and sharing of research data So the response of most institutions to date is still pretty scattered pretty Non-existent in a at an institutional level to these mandates In looking at the 50 to 60 Institutions that had some kind of official statement about research data management at a either a policy or guidance level We we found some interesting things And we're delving now we're focusing more on this smaller group of institutions where there was a Coordinated systematic sort of response to the requirements of the new federal requirements First the biggest single group that we did find about 30 institutions at which the libraries in the in the institutions have stepped up To provide the support and guidance or have been the most prominent in Responding to those needs and putting out You know either explicit Tools like the the CDL tool that's been developed in conjunction with other places You know around helping people develop a data management plan other kinds of instructional sessions and Kinds of interim interventions to help schools and the researchers in them Respond to the new data management requirements We did find seven institutions at which support is provided by the Office of Research or the campus IT organizations And in some of these cases there were close or at least ostensibly close Cooperative relationships between these different groups to try to come up with a consistent and coordinated support set of services for Research data management So one of the things we're doing now is doing closer analysis more interviews with these schools to understand Understand how the their response is the ones that have had it's not surprising I mean the one the institutions that are sort of the farthest along on these areas are ones that you would expect You know that get a lot of federal dollars Places like Purdue, Minnesota and others The the interesting you know as we delve into this We're starting to suspect that many of these more coordinated programs came about as a result of the NIH requirements that came out before most of the other agencies and you know as far as we can tell we're received without a lot of the same sort of level of ripples of of You know surprise and Uncertainty that some of the other federal agencies mandates were received with We we think that the you know well that led us to study well Okay, what are the differences in the different agency mandates and how have people how have institutions responded differently to them so You know we actually used a number of research methodologies to first study the agency Either mandates or requirements or in some cases just a form that you have to fill out as a part of a Grant application process and while some of these these We use text analysis tools and various things and the thing that actually Worked the most as a heuristic as an immediately accessible kind of heuristic to look at these Mandates was something as simple as wordal So we put the the requirements from these different agencies through wordal clouds and it was sort of interesting I'll talk just a moment about each of them. So this is the The wordal that emerges from the final NIH statement on sharing research and you know What's interesting here is? Just obviously the prominence of that those two words data and sharing which are not as prominent in the other agency Requirements, we think this is you know largely because of the long history Yeah of regulation of data and study outputs in the medical community You know the there have been hit requirements like HIPAA and and obviously IRB requirements in medical research for many many years and We think that this prominence and this sort of focus on data research data sharing that comes out of and is You know obvious in the wordal cloud from the NIH policy is Perhaps not surprising given that the background of the medical field Okay, so then we'll contrast this with two other federal agencies and it's and then talk about the the focus group Discussions we've had with both institutions and the agency reps about them So this is the wordal that emerges from the the NEH requirement Big prominence of data, you know big focus on data and to a lesser extent data management You'll see a an acronym that appears in their DMP data management plans. There was a big part of the NEH Discussion of this You know what this says to us if you know about the hit the recent history of NIA and EH They've been focusing a lot on you know new digital humanities endeavors They now have a new division of digital humanities and they're really focusing on trying to understand what data means in the humanities How you manage data how you curate it long term all of the kinds of things that that both Rachel and And Bill were talking about We also analyzed the NSF requirement. This one is a little more Diffuse or at least balance between different terms is no one term that rises quite to the same level of prominence as in the others actually the One term that we took out of this because it was such a preponderance of occurrence was the phrase NSF itself and You know, I I'm not trying to give NSF and Amy back there and other NSF officials that are might be in the crowd a hard time What what we understood from this is that there is a shift that you know I came about from a lot of careful planning in NSF to try to get to a more consistent long-term understanding of data management and Managing research data rather than simply, you know doing your research and then loot promptly proceeding to lose your data A lot of when you delve into the text of this set of requirements What emerges is trying to you know have a teachable moment where you can have a conversation About why you need to preserve Research data over the long term with researchers that perhaps have not thought about this as much now when we had Focus groups with NSF officials Amy and others included one of the things that was surprising to us When we contrasted that with the focus groups we had with researchers and librarians is there there may be some disjuncture here in You know where where the impetus for this is coming NSF is very much expecting the standards and the the best Practices to emerge from the community Whereas I think a lot of librarians and researchers somehow think that this is You know that there's some secret agenda in the background of NSF officials minds of what the best practices are That's not the case at all NSF is looking for the community to step up to the plate and develop the best practices for long-term data management Which again begs the question. Well, okay, where is the community going to go with this? I will Speed this up a little bit in the interest of time and having some discussion because I'm very interested in hearing from this crowd About your thoughts on research data management But let me just say a little bit about some of the activities that we have undertaken in the course of this project and Where we're going to go with this research next so I've mentioned some of the focus groups that we've done today And we've had quite a number of them now in sort of close conversation with Different stakeholders in the entire research cycle Starting with funding agency officials, but then going on to researchers Librarians IT Special IT specialists on the campus that are in charge of managing data But also officials like provosts and vice presidents for research that whose job it It is to lead the academic community in how they they conduct research and presumably where they're going to go with The maintenance and curation of the outputs of research over time Okay, too and those focus groups have been very illuminating as I said We're going to continue doing them and doing them in a more targeted way now that we've gotten some broad results that we can zero in on specific institutional Responses and the differences between them we have as a this thing that we're doing right now in the project a Primary survey of all the different stakeholder groups that we're looking at some 12 different stakeholder groups and We have a little QR code for the survey We're encouraging everybody to take it. I don't know are these distributed around the room Spencer Okay, so you should have access to one of these little things in a in a handy way We would we've already gotten about 200 responses to the survey and Are beginning to analyze it and we're going to be closing that home next week So please do if you get a chance, you know fill out the survey We are very interested in the responses and the perspectives of all the different stakeholder groups in the research cycle about data management We will follow that survey once we have a chance to analyze its results with a secondary closer survey of some of these administrative officials because we do want to get at the question of well, how is this question of Managing research data being prioritized in the very challenging Landscape of funding that you heard about from dr. Deuterstadt this morning when institutions are you know sort of beset with a Raft of priorities that none of which they can fund How are we? Well, are we going to step up as institutions to the very challenging? Requirements of managing a whole new realm of content that we haven't historically done a very good job of managing yet Where then you know we have already done some key informant interviews But we're going to continue with those as a way of zeroing in on more detailed findings Finally a couple of the final three deliverables from this project that will be upcoming I'll talk about individually in a couple of slides first is an upcoming Symposium that we're hosting on open access and research data management and Dr. Myron gutman the Head of the NSF directorate for social behavioral and economic sciences will be our keynote. We also have provosts again vpr's scientists and data management specialists from different parts of the data curation life cycle Speaking at this event. I think it will be a very significant set of conversations So please if you if you do want to come, please you're welcome and on the second day a a group of die-hards from this group will get together and Try to come up with some more prescriptive statements about data management You know going beyond the the sort of the level of well, these are our minimal requirements You know we hope we do these things well ideally if we could step up to the the highest level of the most Prescriptive level what we would like to be able to do what would that manifesto for research data management look like? We're going to take a stab at that and develop it as in the course of this project we are also in the process of We've now put out a call for proposals for the culminating event of this project a Symposium that's going to be held as a pre-conference to the fall DLF meeting this year It will pull together peer-reviewed papers from major practitioners and thought leaders in the country on research data management practices Policies responses and interventions in a volume that will be published as a clear report ultimately The submission you know requirements and information are listed there Okay Yeah, let's wrap up That's it. So let's go to Q&A Steve gas from MIT and Martin I have a question about your project I was curious as to whether you considered analyzing the responses to the RFI from OSTP in December as part of the work that you're doing in this project We absolutely have and that's a great point the You know, there's so much information so many responses coming out right now We all as a field are trying to feel our way through this new landscape There are a lot of things that we are looking at analyzing but those are the You're talking about the the couple of different requests for information requests for comments that have come out There was one the White House RFI on on data management and digital curation topics There was also the request for comments on the NSB the National Science Board Recommendations to NSF that that went out not that long ago that we're received both of those are things where we are looking at Analyzing we just haven't gotten to it yet. We're trying to the first the next step We're trying to get through is our our primary survey, but we are going to be looking at products like that other questions I Have a question for the audience if I could see a show of hands of if you're at an institution Where you feel like you have at least a Coherent, you know on you know working Research data management Program, okay, just a smattering of hands Yeah other questions You know I want to hear I want to let Spencer give a shot at that because he's done the most close analysis I think it varies greatly depending upon the research needs of the individual institution. I think The in some cases the what the responses we've seen range from like a little links page where they say here's the NSF's Guidance to researchers, and that's it To very robust programs like at Penn State and MIT, and I think MIT's goes back to the NSF or the NIH mandate in 2003 Minneapolis or the University of Minnesota in Minneapolis provides workshops for continuing Credit for PI status as part of their program and that's through the library so I think some of these responses will be driven by the demands of the researchers on the campuses and it's going to vary widely from Institution to institution how robust they actually need to be You notice I did not hold up my hand, you know part of the impetus for doing this This project was honestly a self-interested one of informing us in Preparing to you know come up develop a Such a coherent plan because I we don't we definitely don't have one yet Yes, Shirley Baker Washington University in st. Louis. This isn't a question. This is a comment For those of us who are feeling our way in the dark This is a very encouraging set of reports on research And you begin to see that how the landscape might form and we might actually get our arms around this good work Thank you as much as anything these you know particularly the data res project was Intentionally proposed because we knew that the landscape was going to be changing a lot in the next two years And we wanted to do an a activity to sort of capture that as those changes were happening So we're taking notes of when particular events and things like the RFIs that have been in RFC's that have come out Recently and other Responses out of both at both the federal level and the institutional level are happening and how they You know, it's a an ongoing dialogue that you know that builds on itself I think that we can look forward. I mean this is a really seminal time if you sort of think in a a Metaphor of an orbit You know, we're sort of at perigee right now So anything that we do to change anything that we do to lay in new capabilities will have the most effects now Because we'll be able to build on the efforts that we do now the Investigations that we do now in ways that will inform where we're going as a field. Yeah, Trish Hi, really great presentations. Thank you for all this wonderful work. I just wanted to point out that data one Has done two surveys that people might find useful Carol Tenapier and Susie Allard would who Rachel Frick mentioned They surveyed attitudes by researchers who are managing and sharing data and that was that was in plus So if you go to plus and just look for Tenapier, you'll see you'll find the article but they recently did a huge survey of libraries and librarians and That that work should be out and available very soon. So that should be a nice window into some practices That's wonderful. Thank you Trish Elizabeth Long University of Chicago. This is really for all of you though. I think particularly for Rachel. I was really Glad to hear your comment about wanting to make data curation, you know a real career path Not a kind of you know, oh, I have nothing else to do kind of career path Though it is something that I think a lot of students are feeling and I've certainly heard and seen this and more than anything seen people in Interested in figuring out well, what do I do if I can't get a job, you know as an academic in in my field and they're not Finding the right advice among their faculty and there was a you an example of this is is About a year ago, maybe a little less Fermilab had was hosting a data curation Workshop on in in HEP data and you know the field itself is very split between kind of what the role of Sharing data is you know data curation, etc But there were certainly a group of people very interested in it and there were some who themselves were talking in this way about that And yet what you could see were a lot of older faculty Not seeing how that was a career path Not seeing in fact or even in some ways aware of the fact that the young people in the room were sitting there saying We don't have any other career path. What do you do? I've had humanities people come to me asking about this So I guess really what my question is is what can our field do Especially given what some of the comments made today were which is this needs to in many cases be embedded in the Disciplines not necessarily in libraries to help generate the right kinds of You know new programs new ways of giving graduate students advice in the disciplines to give them some kind of career Paths so this doesn't become what they come to when they're about to graduate and they suddenly say what else do I Look for I think we're trying to figure that out. I mean I not to say you know Clear in the past with its postdoc program is has tried to find these alternative careers for academics because what was it? Only 20% of people with postdocs actually have teaching jobs and we tend to keep on minting them So how can we but there there is this crucial piece standing between the disciplines in our libraries and our data centers and and I'm gonna call out somebody that I got to meet in the last year Carly Strosser she works at CDL She was getting I think she has a PhD in biology if I'm correct But she said you know here She is putting stuff on Petri dishes and going there has to be something more you know and she just stumbled upon This job at CDL working around data curation She's actually one that keep people working on the Excel project with Microsoft research And it was just having that conversation with her and she said you know if I just didn't if I hadn't Stumbled upon this and met all these great people and see this exciting work Where she felt like she actually is making an impact and a contribution moving forward She said you know I had no idea this was there So the challenge that we're seeing is how do we start really just putting stakes in the ground and saying go this way or go that way and also working with our LIS schools and Working with them and showing how we can open up the curriculum for folks to go and and and seek out these programs So it's it's it's really just wayfinding right now and trying to bubble up some of the things that are happening I know there's some education stuff happening with data one I think Trisha's gonna tell me about this, but it's it's it's there's some things happening right now And it's and a lot of times it's just keeping track of one of all the good work That's happening right now. And so when people do have that question you say go here or go there or contact these people Yeah, I think I have a little bit to add to that before I'll let Trisha go first really quick Go ahead Trisha Well, just really quickly one thing that I hear a lot and I agree Carly is a wonderful person And we wouldn't have found her unless we kind of bumped into each other But one thing I hear a lot from Colleagues around the UC system is they would like to hire a person a data curation person This is gonna sound really silly, but they're not quite sure what the job at should look like You know what what do you put in a data curate? And so you know we kind of have this little stable of job ads that we've been shipping around So I think that would be something really Helpful for the community to say what what are the kind of expertise that you're looking for and what does that job ad look like? I know that seems silly, but that's not silly at all. It's it's absolutely central to it What is the preparation? You know for for somebody in that role and you know and look well, and I know You did a lot of analysis of those job ads bill. Do you want to say anything about that? I Think again, there's ways that we're trying to figure out what this landscape looks like for work and the responsibilities and The document that came out about it almost two years ago now I guess it was maybe a year and a half ago that harnessing the power of digital data Where the matrices and the appendices I thought were very an interesting way of showing the variety varieties of entities and Personnel involved in data management data activities And so I think you know there's going to be this uneasy time going forward We're trying to figure out what set of competencies we can actually address Without getting into data science in a way Which is another realm but depends on how you slice the slice the pie one thing I would like to suggest though is I think that you know, we're trying to have conversations with administrators administration University administration to talk about the you know this incredible value of Digital data and digital scholarship and as we start to see maybe a new discourse in the academy Around the value of curating data sets. I mean, what's an article versus? What's the data set the value the reward structure possibly as Along a parallel track we not only help define New job opportunities and career paths, but we also see a reward structure within the academy that Acknowledges the importance and the vitality of the data that is going to power the sorts of things that we heard in the plenary session earlier Spencer you had yeah, and I think we also need to be better at speaking to the disciplines You know, it's one thing to for us to come together at CNI and DLF and talk to each other but we should be proposing panels on Pro, you know thinking about data management to the MLA AHA ICPSR You know, there's the they need to we need to do outreach and let them know what we're good at So the and help them understand what their needs are better And I actually Corey Jackson who's another clear fellow and I just proposed a panel on on data management to MLA So fingers crossed that that gets picked up Go ahead a good point I'm Tom Wilson from the University of Alabama where we are Literally in the midst of building the necessary on-campus partnerships to have What was that phrase used Martin a robust? sustainable data Management or a curation program I Have a couple of thoughts. I don't know that they're necessarily really questions And at the risk of irritating some of my colleagues in certain Sub-disciplines of the information science as I will proceed anyway I'm a little concerned that Well a couple of things one is that we truly believe a single individual can take care of this process for an organization in my decades of dealing with a Variety of library issues. I've rarely found that to be the case And a more recent historical experience that we've had as a broad community is with digital library programs Those are not a one-person. They might be a one-person, but in terms of ideally they're not a one-person kind of thing Nor would you necessarily find one person who had the? Desired background so I would ask that we not approach Data curation or data management in that regard either I also think we While we curation is a good shorthand term to use Historically that has been affiliated primarily with the archival community and While certainly that community has much to offer in terms of practice and procedure Data management is not the same is curating a special collection is very different And again, I think maybe the answer to that is a team approach The third point I'd like to make is that while I'm all for establishing educational opportunities for say pre professional People coming to us either with a disciplinary background or with a perhaps more somewhat more traditional library or archival background But I'm also interested in what we're doing for people who are already in the tracks We're not always in a position to be able to say oh, we're going to hire some direct person directly out of Graduate program with a certificate in digital curation because we we may not have a position that's open So how do we? Get in and work with our existing Faculty and staff to bring this understanding of what are the principles that need to be applied here? And how do we how do we deal with this? And I might add also that a piece that probably most library schools are not good at is informing people how to deal with the politics and Being at the central core of this in my institution. There's an extreme amount of politics involved here Well, you said it Tom. I mean I'll ditto everything you you just said it You know with as many different Subdisciplines within librarianship in the traditional sense. Why would we think that you know data science something is as complex as this emerging? You know totally new, you know fractionated, you know type of digital of content would be would be any simpler or something that we could consolidate into one position and Absolutely, we're having to you know grow people that are in our staffs in an existing way So yeah Very good points, and I think that was one the motivations behind the expansion of the postdoc program because we know it's going to be a spectrum of professionals and Somehow this this network of teams coming together to handle this problem because it is so complex And I think we're cutting into valuable coffee time now. So We thank you very much for the discussion and we'll be around all week. So thank you