 Great. Thank you. I also want to extend my welcome to everyone. We're really excited to have you here Have our NHGRI colleagues as well as NIH colleagues here and people who are watching and webcasts We're very excited about this workshop and looking forward to the discussions over the next day and a half Eric actually covered a lot of the points that I wanted to make and So I hope we can go through this fairly quickly and catch up a little bit on time So I wanted to go over a little bit more about the purpose of the workshop and background about encode and a little bit more detail about The planning process for the future. So as Eric mentioned encode is ending in This current phase is ending in 2016 And so we really wanted to bring together the community to get input on what hdr I should support in the area of functional genomics that does build on the success of encode and And so I think Eric made this point already But I think it's worthwhile repeating is that the functional genomics really is central to realizing NHGRI's Goals we know that non-coding DNA is important for disease and gene regulation We know that about 90% of GeoAS findings lie outside of protein coding regions We know that non-coding DNA variants are can cause human diseases and alter human traits and about 80% of recent adaptation Signatures lie outside of protein coding regions So we feel that comprehensive functional information about the human genome is really critically needed for the interpretation of genomes Understanding genetic variation and applying genomics to the clinic Little bit more background about encode that workshop was July of 2002 actually Which is about a year before we had? The next the strategic plan that came out with the completion of the human genome sequence So the goal is to compile a comprehensive Encyclopedia of sequence features in the human genome and then the genomes of model organisms and to make the resource freely available to the Community to enhance our understanding of the regulation of gene expression and the genetic basis of disease This slide Shows the the timeline for encode as Eric mentioned we started with a pilot project is focused on 1% of the human genome In 2003 and at the same time we initiated the first of a series of technology development efforts We felt that technology development was really a critical part of what we needed in order to meet the goals of encode Then based on the success of the pilot project back in 2007. We launched the first production phase and that was When we went from 1% to 100% of the genome Focused in human and then at the same time we started the modern code project, which was focused on creating a resource of functional elements in the C. elegans and jesophila melodic gastro genomes and those were in concurrent from 2007 to 2012 and Then in 2009 with some money from the economic stimulus We were able to launch a modest effort in mouse encode that also went through 2012 so as we're nearly nearing the completion of 2000 of these projects in 2012 We recognized that we still had a long way to go in completing the catalogs And so we initiated the encode phase 3 which we're in right now. We started in 2012 ending in 2016 That was focused specifically on human and on mouse and then we also Funded a number of computational analysis awards because we felt that We really needed to bring a lot of expertise to the question of how we're going to be able to interpret all of this data and understand it and also in 2012 we we launched the The fourth initiative in technology development Just very brief Overview of the encode data slides. I think Mike Snire is going to go into this maybe in a little bit more detail So this is a cartoon that shows that encode employs a series of high-throughput technologies that are primarily sequence based to measure a number of different biochemical Signatures in the genome including transcription DNA methylation histone modifications DNA's hypersensitivity and the data that is coming from Those assays are used to map genes and transcripts and identify candidate promoters and long-range regulatory elements such as enhancers This slide outlines the structure current structure of encode Which is a fairly heavily managed? Consortium we have a number of data production groups and they deposit their data into the data coordination Center, which does quality Evaluation uniform processing it stores the data and makes it available to the community the data is then taken up by the analysis working group which is supported by a data analysis Center and They generate gene models chromatin states and identify candidate elements that really comprises encyclopedia the computation analysis groups work together with the Analysis working group and the technology development groups are are generating or exploring obtaining information about how to Find new function elements how to validate them and how to analyze the data So I just want to highlight a couple of the features of Consortium as Eric said this is how we run this program and some other programs. This is one model There are a number of them the Consortium is really highly collaborative and synergistic efforts that benefits by a lot of exchange of information and ideas within the group There's a misted mix of investigator initiated and top-down directive projects as I mentioned There's focus management coordination for example the data production groups have quantitative milestones and we bug people for Quarterly reports on progress towards those milestones this generation of high quality data I think Mike's going to touch on this a little bit more they use cost effective and high throughput methods And I think what's key here is really being able to take advantage of the economies of scale This is a focus on the resource generation for the community and we work to actually balance the needs of the individual Researchers allowing them to publish on their own data But also getting the data out very rapidly to the community and we've been developing data standards in common data formats and Using common cell types in some cases to facilitate data integration and analysis and this also provides transparency about data experiments and data quality Just want to give a few highlights of encoded commas again Mike's going to go into this more detail, but I thought it was important to Recognize a few of these at a higher level. There is Rapid release of Thousands of experiments. These are again as I mentioned high quality and well documented They're uniformly processed and these processing pipelines are available on the cloud We developed and disseminate out analysis tools actually the data coordination center has been taking the lead in data interoperability and developing working with other projects to create to work with common Metadata standards so that the data from different projects can be Understood well from one to the other And code has updated and informed consent language for open access to genomic data And we are working to obtain Samples with this consent so that data is out and not not put behind In dbGaP or and so people get access to the raw data They've been a series of publications and we're most gratified to see that the data has been used by the research community in Over 700 publications, which is probably an underestimate because it's hard for us to track all this And these publications include disease studies that help identify causal variants So as Eric mentioned and code is not the only project that we support in functional genomics And a sure I also supports genomics gene regulation and fun var The NIH common funds but supports every genomics as well as the GTX and comp projects And the the management of GTX and comp also Our house at any sure I and there are international projects such as the international human epigenomics consortium and Mike Pays in Mike Snyder and Dan Gilkerson are going to tell you more about these a little bit later today So I think we've been through this with Elvis briefly mentioned the objectives are of the workshop or to discuss the scientific Questions and opportunities for better understanding genomic function and applying that knowledge to basic biological questions and disease through high through large-scale genomics efforts and considering what projects That any sure Isis should support in this area When we have these discussions, we want you to consider a couple of points The main areas of discussion are going to be on continued data generation How any sure I can enable basic biology and disease studies for the really the rest of the world to be Conducting and underlying all of us. I think we'll be discussions about technology development We're looking for bold new ideas that any sure I can catalyze I'm just a word of caution. We're looking for fairly general advice at high level We don't want very detailed advice So we don't want to put anybody in conflict for future funding opportunities and again think about what what is appropriate for NHGRI versus other funding Institutes and agencies as Eric mentioned NHGRI has a long history of supporting resource generation and dissemination technology development implementation of new genomics technologies and approaches we do Support a lot of research consortia when a high-level coordination staff involvement is appropriate But as Eric mentioned, we have you know various models for this which has varying degrees of coordination and management and We also do support investigator initiative research as well as training and SBIRs We want to give a brief overview of a framework of the agenda to Get everybody oriented We're going to start with some introductory talks to set the stage First talk from you and Bernie about defining the scientific challenges Then we're going to hear about these series of related projects Then we're going to hear from two proposals for future directions One is the current encode PI's vision for functional genomics as well as recommendations from the sequencing workshop that Eric mentioned specifically integrating genomic variant discovery with genome function and these are really just ideas that the community has already put forward that really are just starting points for for today's and Today and tomorrow Then we've organized really the rest of the agenda focus on these three topics The first is on identifying and characterizing functional elements The second is on using genomic acids of function to interpret the role of genetic variants in disease and the third is on using genomic Assays of function to study basic biological questions, and we do recognize there's going to be some overlap in these discussions And that's fine. We want to see where where these intersections are and we also want to see what's unique for each of of these topics You should have gotten a handout for the discussion questions for topics one through three if we don't we'll Should be more at the registration desk. I'm not going to go into these, but we've established eight Questions that are going to be used by the moderators to Help frame this this discussion and then tomorrow afternoon. We're going to have overall discussion on looking at cross-cutting ideas. What are all the different Ideas that people have for programs what the priorities are and what might balance of activities being So the next steps were in the middle of the the planning process right now We have in this planning workshop and then as Eric mentioned We're going to be taking one or more concept clearances to be reviewed at the May advisory council And then depending upon that the outcome of that discussion will have potential funding opportunities to be released in this this coming summer 2015 with review and funding in 2016 and we do actually have four council members here Eric Borowinkle, Carol Bolt, Joe Ecker and Jay Shenduri and so they will be able to represent These discussions at the council meeting as well Okay, I think Eric already went through this so we have the organizing committee He's already thanked him. I will also add my thanks as well as thanks to Sandra Romberg from Capital Consulting Corporation The encode consortium of course. This is a picture from last year's Consortium meeting was about 180 people and this Saturday we're going to have this year's Meeting and I tell you won't look quite this nice will be out on Long Island And we're hoping there's not going to be a lot too much snow on on the ground and Eric mentioned the encode external consultants panel Eric Borowinkle David Kingsley Dan up here Paul Sturberg and Ola Troy and Skia and everyone except I believe David is going to be here At the at the workshop and then a final thanks to NHGRI staff my colleagues working in encode Mike Paysen and Dan Gilchrist Adam Felsenfeld from the large-scale sequencing program and Division director Jeff Schloss have really been very helpful in organizing this workshop as well as our program analysts Hannah Norton and Especially a shout-out to Julie Corson who is taking the lead in organizing this workshop for us Finally, I hate handing on such a Monday note, but we do have a number of housekeeping items to mention Reminder we are webcasting this so please use the mics which Eric remind us of We we should have received a Google an invitation to join a Google doc or have access to the Google doc for meeting notes This is meant primarily for the organizing committee NHGRI, but everyone who all the participants can have read and comment Access to to this document The capital consulting corporation staff is on site if you have any questions will be receiving Box dinners delivered late this afternoon if you haven't ordered one during the coffee break You can go upstairs and there's a cafe open and you can pick up some some snacks for later We'll have a shuttle today at the end of the session going to back to the hotel at 9 And then picking you up again. Sorry. It's early tomorrow morning at 7 or to get you through security and get coffee and some some breakfast You should order your transportation to the airport either through the hotel or see the capital consulting corporation staff before lunchtime Tomorrow to sign up and please don't forget to send your reimbursement vouchers to the capital consulting by the end of March So on that note, I guess Mike do I have time for if there are any questions? They won't have any questions Not I think I'm probably just going to turn it over to you in It's going to give the next talk so we can get back on schedule so it is really a pleasure to invite you and back to the encode discussion and We've asked him to give a talk on defining the scientific questions really setting the stage for the