 All right, next up is the Genomic Data Science Working Group, and Mark Craven, council member, is a member of that working group, and he's going to give the report. Yeah, so this is a report on the annual workings of the Genomic Data Science Working Group, and like the engagement working group we just heard from, this one was formed in the spring of 2017, and it's considered a subcommittee of the council. We hold our meetings about every two months virtually, although in the last few meetings there's been some discussion about potentially having an in-person, more in-depth meeting at some time in the near future. So the key functions that the working group is charged with are firstly to provide advice to the NHGRI on various plans that relate to data science activities going on, to go into more depth than we can in council meetings. We also have discussions to provide input to the NHGRI on trans-NIH data science issues and discussions about a broad range of challenges that involve the intersection of data science and areas of genomics that range from basic science areas to genomic medicine. So this is the current composition of the working group. Over the past year, I served as co-chair with Christina Leslie, and we had that role for one year, and now coming into 2023, the co-chairs of the working group are Shannon Aguini and Anshu Okundaje. And then there's also two other members of council who are part of the working group, so Tim Reddy and Gail Jarvik. We have regular attendance by three NHGRI staff members, who are Eric Green, Valentino de Francesco and Helen Thompson, and they do a great job of managing us and guiding us and making sure that everything really runs well. So over the past year, there are a number of things that we focused on that specifically fall under this rubric of NHGRI active issues that pertain to data science. So they were talking about the ANVIL program and its renewal, the new data management and sharing policy that Eric talked about in his director's report, and then the new NHGRI Office of Genomic Data Science. So let me tell you a little bit about each of these activities. So late in 2021, there was a workshop that was held to discuss the NHGRI analysis, visualization and informatics lab space, also known as ANVIL. And to talk about the progress of this program to identify gaps, challenges and future opportunities and to think about what the next steps might be. So this workshop was attended by many of the data science working group members and then subsequently we had discussions about this program in one of our regular meetings. And so these discussions along with additional input provided by the members helped inform the NHGRI plans to renew this program. And one of the key things that really came out of all these discussions was the additional of a new clinical component to the program to really move ANVIL towards supporting the clinical research community. So another topic that we had a lot of discussion about in the past year was the new data management and sharing policy which went into effect just a few weeks ago. So we provided some feedback on how our institutions were preparing for the new policy and provided a number of suggestions and topics for the NHGRI and the NIH more broadly to consider. So among these were concerns about how to protect the privacy of research participants going forward with the new data sharing policy. There were discussions about institutional liability that might arise about long-term storage costs for data sets and about the potential for inconsistencies between the ICs and the NIH as a whole in implementing and enforcing the policy. So one upshot of all this discussion was that now that the policy has just recently gone into effect that the working group will reconvene soon and kind of discuss opportunities and challenges that have emerged as the policy is taking place. And I'll also note that we had a really nice informative discussion on this from Elena Ghanian who is the policy advisor for data science and sharing at the NIH GRI. Sorry, I skipped a hair slide here. So another topic at one of our meetings was a presentation on the new office of genomic data science which was established this past year at the NHGRI and is directed by Valentino de Francesco. So the mission of this office is threefold. It's to develop, promote, support and coordinate genomic data science activities across all parts of the NHGRI. Secondly, it's to serve as the coordinating hub for NHGRI involvement in trans-NIH, genomic data science efforts. And then third is to provide guidance for the NHGRI's implementation of NIH data sharing policies and to promote governance, stewardship and sustainability of shared data. So within its first year, Valentino gave us a nice report and told us about how this office has taken responsibility for a few extramarial activities and programs including taking responsibility for leading the data science working group for the Anvil program that we heard about for various data science and informatics resources including the mods, the alliance of genomic resources, galaxy, bioconductor, et cetera. So in addition to these internal discussions of NHGRI specific activities, we also had a number of topics we discussed that pertain to trans-NIH activities. And specifically these included collaborations between the NHGRI and the NIH Office of Data Science Strategy. The more broadly, NIH transitioned to cloud computing and specifically the STRIDES initiative and then we had a very nice report on genomics programs that pertain to Africa. So again, let me tell you about each of these briefly. So Susan Gregerich, who is the Associate Director of Data Science and the Director of the Office of Data Science Strategy gave us a presentation on the NIH Office of Data Science Strategy and their various plans. So a lot of this discussion focused on the STRIDES initiative, which is the NIH program to really push towards cloud computing to provide interface to commercial cloud providers and training in doing cloud computing. We also got an update from her on the NIH cloud platforms interoperability efforts, which are intended to really coordinate cloud computing initiatives across the NHGRI, NHLBI and a number of the other institutes. We heard about NIH discussions on streamlining and control data access and opportunities to improve efficiency and harmonization among various NIH repositories. We also heard about the NIH artificial intelligence and machine learning consortium to advance health equity and research or diversity, also known as AIMAHEAD and also the ODSS National Service Scholar Program, which is intended to recruit experienced data and computer scientists and engineers into the NIH to take on a number of these challenging problems. We also had a presentation specifically on the STRIDES program and how the NIH is encouraging researchers to move to the cloud. We heard about some strengths of the cloud as a platform computing and then had a lot of discussion as well as some of the challenges, such as just the culture change, controlling cloud costs, training for investigators to become adept at using this as a computing resource. And then a number of the working group members also provided some firsthand accounts of challenges they had encountered when transitioning their computing into cloud resources. We also had a whole session, as I mentioned, devoted towards data science and Africa initiatives. So we had a great report on the H3Africa program, which, as Eric mentioned, instructor's report began 10 years ago with the aim of developing a sustainable and collaborative African genomics research enterprise. We also had a report on the efforts that is known as DSI Africa, which stands for harnessing data science for health discovery and innovation in Africa initiative. And this initiative aims to leverage data science methods to really transform biomedical and behavioral research by contributing to the development of expertise among African scientists and to establish networks of African investigators. So in the coming year, a number of topics that are on the agenda for the working group to discuss are the following. So one is to have an overview of the NHGRI portfolio in data and informatics resources and research. There is interest in having discussion about phenotypic data collection sharing across various NHGRI programs to talk about analysis, tool dissemination, and training. As I mentioned earlier, to again revisit our discussion of the data management and sharing policy and to talk about data science training more broadly at the NHGRI. So with that, I'm happy to take questions or welcome additional comments. And I guess I would also ask Valentina and Gail and Tim to chime in if they have additions that they wanted to make on this report. Questions or comments from Mark? Gail. Well, just to say, I think that was an excellent summary of the work of this group. I really appreciate your attention to the details there. Thank you. Actually, Valentina and Helen get a lot of the credit, maybe most of the credit for that. So let's thank them. Thank you for crediting them. OK, thank you, Mark. Let's move along.