 So welcome everybody to the presentation models of support for data science and the perspective of two libraries I'm young browser. I'm head of research at the State and University library in Göttingen in Germany and With me is my colleague. Well, I get he's a professor for scientific Information analysis also from University of Göttingen and we have David Minor He's head of the research data curation program at UC San Diego and these are the two libraries that we're talking about Göttingen and San Diego So what are we talking about? This is a brief table of contacts. We will talk a bit about the difference in our years in Göttingen and San Diego about data science and what we are planning to do in the future and Some of you might be aware that this is sort of a follow-up talk to something we did at the last C&I meeting Fall in Washington DC, but I assume not all of you have been there so in this Presentation in Washington. We were elaborating a bit that Since 2016 now there is a cooperation going on between Göttingen and San Diego because we found out that the University of California San Diego and the University of Göttingen especially the libraries have a lot of similarities in size in Focus group and the campuses also a bit alike And it was interesting to see how we do things differently and how we do things alike and to learn from each other And since then we started to Learn from experience. We had staff visits and the longer staff visits shorter staff visits. We're working together We signed even an MOU Where we agreed to work together look at different topics because We as you all know we know that science is a global thing and we have to be in this together And sometimes it's it's very good to learn from each other walking to each other's footsteps To see what people on the other half of the globe are doing and that can be very beneficial so While doing the Washington presentation We were very much focusing on research data management because in Göttingen we have a service unit that serve office specific services for research data management on campus and That's something that San Diego is also doing and so the focus of our Corporation in the first years was looking at day-to-day work in providing research data management services and infrastructures and But we also identified topics that we want to look Much further into into the next phase and these include data research analytics data science in general also more exchanges and More than just staff exchanges also student exchanges Researcher exchanges and so this is what we're doing now and this is what this presentation will Mould us be about so for those of you who want to know more about the research data management that we're doing I'm referring to the video from the CNI fall meeting. I remember all the presentations were taped and you can see David and I making a presentation in Washington and this could be also very interesting So without much further do I give over to David? Musical chairs here. Thank you. Yon and hi everyone. I guess I can say welcome to San Diego. It's nice to have you here Please go spend money Yeah, so as you guys mentioned we want to talk a little bit about the Environment in which we operate in our libraries and in our campus Going through the lens of data science machine learning AI a lot of the things that that we've been talking about at this conference and in our Our work almost every day and I think many of you are a certain large subset of you We're in this room for the last presentation by Stephanie Laboux So you will have seen Actualization of some of these things so we're going to be taking a step back and talking about the environment in which we're operating So just to give you and I know I said in the back I know it can be hard to read the screen So please do refer to the slides when they're available But just to give you a quick thumbnail sketch of how we got here at the UC San Diego side The data science program was founded in 2015 so not that long ago a less than a decade And for the numbers in the last year or so you will see we have grown both in terms of scale number of students as well As the size of programs the number of programs the type of programs You could fill in the black and we've probably doubled and tripled it and not surprisingly both for students And I think for researchers and faculty. This is an incredibly interesting major As Stephanie alluded to it's not just data science. It's computational Fill in the blank biology social science chemistry oceanography. They're all in this space somehow. Oh And as I said or as it says on the bottom of this slide the data science and this sounds obvious But just stick with me here the data science program is primarily focused on education and training students in data science a A few years after that and actually is in 2018 the Holyachew Data Science Institute was formed on our campus and named after our the primary benefactor who was something like Facebook employee number three Something like that and gave a bunch of money. We don't have a lot. We're very new universities We don't have a lot of alumni But we have one and so he gave money to found this Institute and this was very much a game changer on our campus Because it really took the work that was started in the data science program and brought it up to kind of that national research Kind of excellence doing this work kind of thing And so it cross-cuts a much wider swath of the university than you find just in the classrooms And does a lot of support for both not just education, but also research both fundamental as was well as the more high-end things And in contrast to the program it's generally focused more on advancing the field of data science through research and innovation So we've got kind of these two different threads intersecting with each other on campus And then we lastly have a library. This is Stephanie for the I don't know if she's even still in the room anymore but data science librarian not coincidentally hired right at about the same time and Not just hiring her but a lot of other services in the library We're either started or increased at that time So some of these will be kind of familiar to us in this space data and GIS lab a wide range of services Which I'll be getting to in the next slides We've begun bringing student data into our library digital collections. I'll also be touching on that in just a second And as Stephanie mentioned they have been doing a lot of research on what it means to be kind of a library and a repository in this space I do encourage you when the paper comes out to read it that she referenced And I wanted to emphasize again this kind of point in the library and this will come become very interesting when Baylor starts to talk We've really been focused on this kind of external facing Assisting students and faculty were appropriate being in the classroom were appropriate We haven't really taken a lot of this on board inward, right? And this is a place a lot of organizations are asking now like can't we do a lot of these Efficiency scaling and understanding a new process internally So we're kind of right at this point of asking some interesting questions and looking at some interesting purposes The next couple slides are an eye chart. I'm not even going to bother reading these But I just wanted to give you a sense of the kind of work that we have been doing in the library in support of students So first off just a quick gloss on the kind of interactions we have And I think you'll see they range everything from literally one-on-ones to sitting in huge classrooms of students Some of it is our long-term relationships working supporting people through projects capstone projects Graduate work all those kinds of things. So we've really spanned the gamut in terms of how we support students here We've created a lot of the kind of materials. I think you would expect ephemera going forward. We've got lib guides We've we help you either run or help sponsor a lot of events on campus Working through different Programs and project student groups on campus. We have a heavily support for a lot of our data science groups on campus And we've also been working on it as you saw in last presentation I'll be talking on just a moment of actually bringing student data data Data science student. There's a lot of data in that word data science student data into our data repository I was going to get there And then lastly and this is definitely emerging as the new graduate programs are spinning up Not just data science, but as I said computational biology fill in the blank We're trying to navigate. What does that space even begin to look like? Because these are rapidly developing programs It doesn't say it in here. I think it was on one of the earlier slides one of our data science graduate programs Is the very first one at UCSD that's can be first fully virtual So there's a lot of experimentation in this space having things like that I also wanted to talk and this will be my last slide But this is kind of one of our big next steps is Working with our repository. So we do have a data repository on campus that manages Well, that will accept any kind of data. It's a generalist repository for a campus What would it mean to actually bring some of these data science projects data? Outputs things people are using in the classroom into the repository and hopefully leverage some of those best practices that are Librarians are researching about and finding so this this is going to be the way that we begin to get in and implement some of these new steps But I just wanted to highlight this this new program called an educational data set service This is the library working hand-in-hand with both our campus research IT Who manages a large machine learning platform that all the students use for their classroom? Analysis and running code and all those kinds of things working with them as providers But then also working with our researchers in our faculty in the classroom to say what do your students need? Where are the gaps? What are they not understanding and There was a question in the last session about kind of that education of students for creating using saving Sighting all those kinds of things. This is going to be how we the library begins to address this that we say Okay, on the one hand, we'll take a bunch of data and make it available and tag it appropriately But we'd love to work with students going forward so that they can learn how to create the tags And they can learn how to do the meta they they can learn about licenses and all those kinds of things So this is actually a pretty exciting development for us And I think it's going to be the first step and some of the things you're going to see starting on the next slide Where we're going to begin to dip our toes into what does it mean internally to take on board some of these processes? And maybe even what does it make it mean to begin to work with these students directly in the library? These data science students of all different levels So again, that was a very quick gloss. I'm happy to answer questions either after or after the presentation And then I'm going to hand it over to dr. Gip Thanks, David Hello, it's actually my first time here at the conference. Nice to meet you all My name is Bella Gip and I'm from the University of gutting and in gutting this The university library and the university they work very closely together And I'm a faculty member of the computer science department, but actually for example our offices are in the state library So although we are computer scientists in our group. We yeah, work very closely with the library Of course also is with Jan and who's working there in the research department and we have joint projects and yeah, I'm very thankful that we have now this collaboration also is yeah, David and his team and Working together on interesting research projects mostly related to libraries and establishing this exchange program So in case you don't know gutting. I mean, it's not the Probably most of you haven't really heard of it. I don't know so it's in the very center of Germany As you see on the Google Maps, so it's really I think Yeah, maybe you know I think this only a few kilometers away from the most central point in in Germany and yeah on the right you can can see the library and we are located in the actually in the historical building and that's our research group and In the next few slides, I'd like to give a brief overview what we're working on so The core research areas are information retrieval so finding relevant information kind of what what Google is doing then Natural language processing is in research field that is getting more and more important for us Would say most of our doctoral students now work on natural language processing and we also work on Using blockchain technology for archival purposes So yeah, the foundations are data science artificial intelligence and information visualization So in natural language processing we Have various different Projects and I'm very curious what actually the first project is maybe where we collaborate But I mean you all know chat GPT and and all the possibilities that large language models offer And I'm sure that there will be some really interesting areas to color rate Another field as I said are Literature recommendations, so we develop recommender systems for literature and Developed in this area also some some algorithms and have a system to evaluate our algorithms and Another area is plagiarism detection, although I have to say I think this is soon an area of the past because with generative AI the whole Yeah idea of plagiarism I think will just change no one will just copy paste you will use AI to to rewrite something and but also identifying texts that were Generated using AI is something that we work on although. I'm not even sure whether that's something that we will need in the future because I Think you shouldn't forbid the students to work with Systems that yeah used generative AI to write text so it's a it's a difficult question and I don't really have an answer for that at the moment Another field is Intellectual property protection and we developed some Systems that use blockchains distributed ledger technology to embed timestamps in these blockchains Which can also be relevant for for archives and digital libraries And we work on analyzing news. So for example That you can identify media bias and and similar things. So these are the main Research areas and the last one math information retrieval. So maybe if you I mean, I'm sure you all have used Wikipedia whenever you see a formula displayed within Wikipedia that actually developed in our group this Representation so we want to make sure that You don't just have an image with a formula but actually the The formula can be embedded and that's something one of my group members actually developed and that is now rolled out in Wikipedia worldwide So yeah, many interesting areas for for research and I just wanted to give this short overview to show what we are working on and Also very important We as we plan to have this research and student exchange that Yeah, do you want to know you send this slide? Okay? Anyways, so we have this the idea that students from UC San Diego come to Göttingen and the other way around and They're very interesting possibilities, but not just for students, but also for Yes staff exchange and I don't want to go into all the details of the programs that also Financially supported because of course it also costs money But you can see here a barcode and then you can download the slides with the links So they're interesting programs from the DAD which is the German academic exchange Organization and they have some really interesting funding possibilities for that. So Yeah, if you are interested or know someone Please let them know and I think then I hand over to Jan again. Thank you So as I said, um, we now look back at eight years of of cooperation and so what we found out in the In the first phase that was of course very heavily disrupted by the COVID pandemic where we did not meet as much as we wanted to but we have been a benefit very much from from learning about each other because we saw that libraries all over the world they face very very similar issues and of course the way that the Libraries in the different countries and different domains deal with these issues are different Of course relying on the on the individual culture in the on the funding scheme on the financial situation of the library and But the general questions are all the same and we found out that it's very helpful to a See what are other people doing and also sometimes have invite people from other communities Especially other libraries as critical friends and let them have a look at what you're doing So we had colleagues from San Diego being at gutting in and we showed them This is the way we do things and what do you think about that? And that sometimes gives good insights from from the external perspective and vice versa We came to San Diego and looked at the way they were doing things and this was very beneficial with a clear focus on Services library services research data management services research data management infrastructures And now as a next step we saw that on this this whole Cooperation started from the libraries, but it's now slowly spread into the whole universities So it's no longer just the University of gutting and library and the UCSD library But it's now the whole universities the whole campuses that have an interest in learning from from the other partner around the globe So actually focusing on data science Firstly on the different topics that Bella was presenting as actual topics where his team can work with David's team and also the folks at the UCSD from the data science division and see if there is Interesting topics for for joint proposals. The other thing that we wanted to do is also help our cooperation by having individual exchanges and these are of course researchers exchanges But as Bella presented there are nice funding opportunities for students So we actually have good funding opportunities for California students to come to Göttingen and vice versa and it it actually fits a very nice gap and it's very the right thing to do because as I learned in the master program data science for the UCSD students in the first year of their master program, they actually have to make a project and it's mandatory and Bella's team and we at the library we have a lot of projects where people actually can work and of course We also invite California students to come and work with us and vice versa. Our students are encouraged to come to To go board for a semester into a different country and they have interest to of course to go into California So we believe that with this exchange of students we will bring our cooperation to the next level and Interestingly, there are funding opportunities by the German government for that. So that is something that we're now very much looking into and hopefully 24 25 we will have the first students being sent across the globe to the different country and then who knows Next spring CNI or maybe the fall CNI this time We can give you an update on how it has been and what has happened in the time between So that's all from us and we very much welcome you to give us some questions and give us some statements and see what you're thinking about So questions, please. I know there's a refreshment break, but I'm not quite sure if they already have I think they will only start Serving in eight minutes So the first part of your question how we how and when and why we met each other interestingly enough was at a CNI meeting All right, I know In NDC seven or eight years ago. Actually, it was the the previous library director Both from horseman at good to get in and yawn were there and it was literally the standard CNI Afterwards grabbing a beer and saying oh, you're working on that. Well, we're working on that, too We should work on that together. So it was literally as simple as that. Yeah, right But at the same time I think it was the group being open to not just having those discussions, but actually wanting to do something You know, I think that's something we're big fans of is Start Start because what just as an example what we have found out even more recently in this collaboration is On the UCSD side as we've been exploring for instance these new student exchanges There's a ton of funding opportunities out there, right both for students and for faculty and researchers that If we hadn't even begun to ask the question no one was going to tell us about and it wasn't until we could go and say Oh, we're interested in working on with this group on these topics on on these special things and the university said Oh, yeah, that would make us look better as a university. So yeah, please go do that So I think from our vantage point that was a big piece of it and in terms of how can you get these connections? Of course, there is no one-size-fits-all solution to that, but I would say generally if you attend international conferences you as David said you usually network and then you meet other people and and I think you I think most of you will have Contact with some colleagues from aboard and where you share just interest in the same topics and then it's a pretty Small step from widening this up and saying that why don't we let get our institutions evolved a bit And I think the most crucial part, but please correct me if I'm wrong is that you have to convince the superiors in your institutions to be actually open-minded to invite external people and take them Serious enough that listen to listen to what they are saying because I think that's the critical part I know that many institutions like to do the way the things they do things and then coming up with the idea Let's invite these colleagues from that country come over to spend for two weeks And we show them what we're doing and and they can give us an input on that and then they tell us what they're doing I think that's the start it needs to come up with ideas where you can work together But but being willing to to invite the people and listen to what they're saying I think that's that's the most crucial step, but apart from that the I Think inviting a visiting researcher over for two weeks come to our place Learn what we're doing is is pretty straightforward And I would say most researchers have some sort of funding for that sort of trip So it's just not many more money involved in that and then you can do the next steps next steps Okay More questions Okay, I seems we've blown you away Of course, we're proud of that but so and please we're here so don't hesitate come to us Speak with us directly if there are some of the topics of Bila that you were would have learned more about If you want to just exchange some ideas come over to us and we're here. Okay. Thank you