 I'd like to start with an acknowledgement, I'd like to acknowledge the traditional owners of these lands that we're meeting on today and pay my respects to their elders past, present and emerging. You're probably quite familiar with this now but for people who might be joining who haven't heard so much about our programs here in ARDC, the National Data Assets initiative is our overarching initiative and it was set up to establish strategic partnerships with research communities and institutions and that was to develop portfolios of national scale data assets to support leading-edge research and hopefully that that will lead to long lived data assets that can be used to leverage existing research and administrative investment. What we think about as underpinning this initiative is that we can think of collections of data as being national research infrastructure because they support leading-edge research and then national and scale. The Australian data partnerships program is one of five current programs currently running under this National Data Assets initiative and it's our largest and most broadly focused and it aims to create or develop high quality data collections that support leading-edge research and a national scale. So this program we were thinking about data from multiple organizations, data that's consumed by multiple consumers like researchers, policymakers and NGOs and governance arrangements over that data that include multiple organizations. Strategically we wanted what we wanted to do here as ARDC was to support establishment and development of those assets where what was lacking was a force of competitive and uncoordinated where the force was lacking that drives this in the market and so we wanted to step in and fill that gap and enable that emergence of those national scale data assets. So that's what we're trying to do here. Now, so what I had to remind you of is we've got short presentations, a series of them here. We've got a little bit the first half hour sort of assigned to doing that. Following those presentations, we're going to pause and have some discussion. What we're going to do is run something called Mentimeter after our presentations and we've got Kerry here helping with that and she's that's going to help us focus on some of the issues that come out of that are common that come out of these presentations and across the projects. The things that are problems or needs or gaps that are in common and we're just going to do a bit of a poll around that and hopefully have a little bit of discussion about some of those commonalities and then we'll wrap it up. If I'm going to keep a bit of an eye on the time, if you're running a bit over, I'm going to try to ring my bell and if you hear a bell, if you could just wrap up what you're saying and bring it to a close and then we can move on to our next speaker. All right, so we might move on to our first speaker and that's Michael from the University Queensland and he's going to talk to us about the language data commons of Australia. Over to you, Michael. Thank you. I also wanted to acknowledge the traditional owners of the land from where I'm speaking, which is the younger people's near the Brisbane River and to pay my respects and our respects half of the whole project team to the ancestors and the custodianship of the lands of which we meet. So the project about which I'm talking today is the language data commons of Australia data partnerships project. We call it in full to help us distinguish it from other bits of the border, Eldaka initiative that's underway. So first, I'd like to acknowledge all the partners, collaborators in the project. We're working with people in data collections and IACIS researchers at Australian National University, the Aerosene Centre of Excellence for the Dynamics of Language, University of Melbourne, Monash University and Paradiesic and also the University of Queensland Brown based. So I think I'm ready to make a slide. So I guess the first thing that I always say is we often think of Australia as an English speaking country and well, yes, but no, it's actually this this hundreds and hundreds of languages spoken here. And I hope through the establishment of language data commons of Australia that will help raise awareness amongst Australians that this is actually a massively multilingual society and a really linguistically diverse region of the world and it's something we should be proud of actually and support. It's a beautiful thing. So there's more than 300 different immigrant languages spoken in Australia. We consider English to be an immigrant language as well, in addition to hundreds of indigenous languages, which are varying states of kind of people talk about the health of the language. So there's about 15 of them, which we consider to be really healthy languages. And so part of the mission is about gathering this data together and making it accessible for communities to work on for reviving and revitalizing the languages. Australia is also home to some of the world's oldest continuous cultures. So it really isn't a young country at all, even though that's something we sometimes think. The thing about language data in Australia there's actually been some really great work among some of these partners, including Paradiesic and Aniotsis, where they've actually started to build up really excellent collections following international best practice. But there's still a lot of collections which are underutilized and even at risk as well. So our mission is to start trying to build infrastructure to bring these things together. So what is our aim? So we talk about it being federated discovery and access to language data collections of higher strategic importance for the Australian research community. So one thing we try and make clear is it probably should be called the language data research infrastructure commons of Australia in a way because the data itself always belongs to particular institutions. We're not suggesting that we slurp all of this data up into some common kind of pool where it's a freefall to access it. It's really about building that common structure, research infrastructure. So things like building a governance framework where it's shared, where we look after the infrastructure, but also how we access this data. So there's a lot of variation across different institutions and how we do it. And there's a lot of cross learning, cross colonization. We think that can happen through this community. We're also trying to build shared technical infrastructure. One of the things we mean by that is that the underlying data structure is the same. And some of it involves migrating it into a common data infrastructure, which aids when you're trying to build services to use that data across. That helps with that. We also want to build a portal that starts to help discover the data. So a good example of that would be if you're trying to work, revive particular indigenous language in the community, you might find there's archival data. Some of it might be held in IAASIS. Some might be held in Paradisic. Some of it might be held in university libraries. Some of it might be in state libraries. Some of it might be in Trove. It's really a big job to try and discover because you don't know what you don't know. So it's about bringing that together. So I go to the last slide. So overall, AIM is really starting to lay the groundwork for sustainable long term repository. And we like to talk about it being democratising access, not only to researchers, but also to communities, because this is really important to everyone in Australia. Thanks. Thank you very much for that, Michael. Now we'll move on to Tim from the University of Western Australia. He's going to talk about enhancing fish and shark image data. Thanks very much, Catherine. Yes, so thanks very much, everybody. Now, I'm actually joining you from a boat today, and I apologise for any noise we've got going on in the background. We're out here actively collecting fish and shark imagery data for Marine Park Reporting. We're actually out here in the middle of the geographic marine. We want to go to the first slide. Thank you. Imagery is being used increasingly around Australia and the world. And Australia is a global leader in collecting this imagery of fish and shark assemblages. And it's been shown to be really important for environmental reporting, the data that comes from this. But there's a real need to increase the fairness of this data. Most of that is about really making it available, making it available for things like automation, automation projects that are out there to automatically identify some of these fish and sharks, and also to make it available for environmental reporting at both local, national and global scales. So one example of this is a current synthesis to be working on with many partners around Australia of 22,000 samples or so around Australia. And one of the really interesting things in this data set and also in other data sets being collected around the world at the moment is that the sampling methods are pretty standard and the annotation methods are pretty standard. But it's what happens to the data, the curation of the data that hasn't been standard. Next slide, please. So what we're proposing to do as part of this is to try and make a better place for all these data sets to be stored and to sort of improve the value of this data set to make it more reusable is to implement, you know, standard metadata and standard ways of referencing the data sets. And then as part of the work we're going to do with the current fish and shark data community, make this available for initiatives like the annotation of automated annotation of the imagery, make the data available as training data, both the imagery and the existing annotations to sort of help those initiatives. And I saw Rod Connelly here before, which is exactly the person that I actually need to be working with and talking to. But then also make them available, as I said, for for summaries and synthesis so we can go to the next slide. I think. Thank you very much. Yes. So already the work we've done so far has led to two papers put into publication for helping us to assess how marine parks are doing around Australia. That's one of the top left for assessing state, territory and Commonwealth Marine Park performance, if you want to have that kind of an idea. And we've also had some of the data sets submitted already towards the new state of the environment reporting report. At the moment, and also recently contributing to some global studies as well. Because as I mentioned, Australia and it's a certain part of New Zealand has been leading the way in developing these methods and they're now spreading around the globe. And so in our work here with this fish and shark data data program and also using the global archive platform that we built with previous RDC funding, we really think we can actually show people, you know, a good workflow and a good way of how to look off these data sets and make them more readily available around the world, not just in Australia and therefore give Australian researchers a great competitive advantage and get invited on to international grants and publications. Thank you. That's everything from me. Thank you so much, Tim, as a very authentic presentation. And I think we won one an award. We stopped the we stopped the boat just so I could do this because I was too noisy. Fantastic. Thank you. Now we're handing over to Nick Buckley from the University of Sydney. Nick is going to be talking about the poisoning data set. Yeah, so this is Rose Cairns as the project lead on this. And I'm presenting it because Rose is off on maternity leave. I think she got pregnant about the time the grants went in. So we could go to the first slide. But basically this is about a national poisoning data set. And many people. Sure, why there's a blacked out reach in there. But many people don't appreciate how a major cause of injury poisoning is. So it's larger than the road toll and its effect on life expectancy. And must be a cardiologist has blacked that out because that's showing that fact it has the same effect on life expectancy as cardiovascular disease. And there's been a big increase over the last decade that we don't really understand. And our routine data collections done by, say, AIHW use this coding system that actually doesn't tell you what any of the agents are that people have ingested. And the poison centres provide a lot of coverage. So 200,000 calls a year take about poisonings where they're precisely recording lots of things. But this data has largely just been used for clinical purposes. So generating a data set that can be used very easily for research is trying to achieve with this initiative. Next slide. So we've got four different poison centres, plus potentially some other sources of data on poisoning. And we need to harmonise all the different data fields so that it's all comparable. It's all collated in one location. There's been multiple databases in the past. So it's not just the full databases. So we've got a whole lot of historical data to bring in. Data governance is a key issue for us because of privacy considerations around isn't fairly sensitive data around suicide, self-harm, and drug abuse. And there's many other things that they're doing to make the data of good research quality. And they can jump forward to the impact. And so basically this summarises the whole project that people can read that. But basically there's a lot of data comes in. So there's calls from the public, but there's calls from hospitals, ambulance services. There's more calls about adverse drug reactions to the poison centre than there are to the therapeutic goods administration. There are more calls about drugs and pregnancy than there are to most drugs and pregnancy ones. So all that data can be collected into a research data set. And therefore, because it's this very, very big data set on a big problem, there's all sorts of people who are interested in this. From those people who are interested in suicide prevention, to product safety, to just general injury surveillance, to identifying drugs that are dangerous, to following initiatives to curb substance misuse, such as real-time prescription monitoring. Everyone is interested in denimation. So I don't know whether you can pick up box jellyfish with your fish scanner, but you know, we could tell you if it matches up with when we get calls about them. Occupational health calls, again, there's a massive number of those, far more than are reported to occupational health agencies. And then we've got clinicians who are interested in improving processing management. That's my three minutes. Thank you, Nick. We'll pass over to Rebecca Glowart from Telefon Kids to talk about the National Child Health and Development Atlas. Thanks, Catherine. Sorry, I'm a bit croaky. I'm just recovering from a cold. So if I start coughing, that's why. So this project is called the Australian National Child Health and Development Atlas or ANSTA, which is a really catchy acronym. We're currently working on a bit of branding. I haven't got everyone's logos on here because there are in excess of 25 partners. So that's why we're kind of working on a bit of a branding project at the moment. Okay, next slide. Thanks. Okay, so what is the problem? Well, we know that data on children and young people and families are held in a multitude of locations across the country. Government tend to work in silos and they collect their data in silos. However, in order to really look at the health and development and wellbeing of children, we need to put those data sets together to get a really good picture of how the children and young people in Australia are actually faring. We can't make effective and efficient evidence-based decisions, programs, research on children and young people unless we actually have a really good picture of their health and wellbeing. There's a number of groups across the country who have been working on bringing data together on children and young people and demonstrating it in a visual form. So basically geographically mapping data on children and young people for their specific states or territories. And we thought, well, it's a perfect opportunity for us to actually bring everything together, bring all the partners together, look at what everyone has been working on and pick out the best parts so we can actually create a single portal or hub for visualizing data on children and young people across the country. We really need an Australia-wide solution rather than state-based. So next slide, thanks, Catherine. So as I said, we've got people from across the country. It's a fantastic group. We've been working, we're going to be working to curate the data. Look at what data is common across those groups and also what data is available that each group doesn't have individually. And we're going to hopefully have that visualized so geographically information, use GIS technology to display that. So you can see this image here. This is what we've done in WA with the West Australian Child Development Atlas. So we wanna do something similar, take the best parts of each different platform and bring them all together under a single banner. Next slide. Thank you. So look, we think that there's such huge impact, potential impact for this project. It's actually really exciting. There's been a huge amount of excitement across the country actually since we got the grant. There's been so many people working on trying to bring data together in different ways and this partnership is so strong and excited. So we know we think that we've significantly increased the evidence base for decision making by having one single location where we can access a huge range of data. We can look at trends over time. So a lot of the data sets that people hold in government agencies go back 10, 20, 30 years. So you can look at changes over time. We can look at the impact of environmental factors. We can look at emerging trends and monitor the impacts of things such as COVID-19 across the country. And we can actually see where things are going well in particular states or where they're not going so well and look at the impact of different policies and programs on children's outcomes. So that's it. Thank you, Rebecca. We'll hand over to Sebastian Mantini now at the University of Tasmania to talk about in situ wave observations. Yes, can you hear me okay? Yes, perfect. So yeah, I will talk about the development of a national infrastructure for in situ wave observation. So you can go to the next slide. So wave data are critical to ocean industries, coastal development and leisure activities. But unfortunately in Australia, the landscape for data collection and data delivery is quite fragmented. So that's why I put this slide of the puzzle one. So there's a lot of different organizations that are collecting data in Australia. So primarily it's the state agencies but also private industry. All in gas, for example, they collect a lot of data around Australia but also more recently, IMOs, so the integrated marine observing system has started to also deploy web both. Part of the problem is that there's a lack of, some of this data is publicly accessible but a lot of it is not actually publicly accessible. So it's hard to get an old on. There's a lack of a grid data and metadata standards across the different groups that are collecting the data and publishing it. So make it harder for the user to really compile all the data together. There's really a lack of documented QA, QC practices across the different groups. And there's a need from users for additional data product, not just the usual time series of the wave height and the wave period, but more in-depth product that you can derive data from it. Next slide. So the solution is that sort of a summary diagram here. We will build upon an infrastructure that we started building in 2018 from another project funded by RDC. And so one of the main outcome really of a project would be to publish new data set collection. So we've engaged with additional partners in there to put new data into the system. So some of the costal data collected by state agencies, data from the Victorian government that is really lacking at the moment that they are deploying a lot of web-boy at the moment that we want the data in the system, but also engaging with private industry and really trying to crack that to see if we can get data that is collected by all the gas companies to try to see that it can be put together with the rest of the data. The team will work on data standards, reaching out to international colleagues to see how can we define data and metadata standards. We want to improve our workflow. So we're going to work with the Bureau of Meteorology to provide real-time data and a proper workflow to lower the bar of entry for new people to come and put their data into the system. Yeah, next slide, I think. And so the impact is that wave data is required for many societal applications, so such as safety at sea and on the coast, understanding costal erosion and for leisure activities, such as surfing. Wave information is routinely identified as a primary requirement of blue economy, industries and coastal manager. So it's really important. In the offshore industries, wave data are used for design, engineering requirements and for operation at sea. Wave data is essential for calibration and validation of satellite instruments, especially in the last couple of years and in the future, there are many more satellite mission with more instruments on board that are going to collect data at a high resolution and you need for the institute observation to calibrate it. It's also very important for calibration of wave model, especially in the local area where you want to look at coastal erosion, impact of inundation and erosion and things like that. And finally, surface wave has been identified by IPCC as a key driver for coastal zone stability. There's a fundamental role in modifying exchange of heat, mass and momentum across the IRC interface. So it's really key for climate application as well. That's it, I think. Thanks. Thank you. Thanks very much, Seb. We'll move on now to our final speaker, Jessica from CSRO, talking about human mobility data asset. Hey. Hi, everyone. I hope you can hear me okay. I had to rejoin from a foreign because my laptop decided to shut down. So can you hear me okay? That's good, Jess. We can hear you. All right, so yeah, I will give you a brief overview of the Mobility Australia project, which is a collaboration between the University of Melbourne, Queensland University of Technology, the Australian Urban Research Infrastructure Network and the Commonwealth Scientific and Industrial Research Organization. All right, so what is the problem? Human mobility data is a key ingredient to further research in many different areas of science. And the current outbreak of COVID-19 is an example that has demonstrated that human-spread infectious diseases really rapidly. Now Australia lacks a comprehensive human mobility data set that can be used to model such diseases and how they spread and to ultimately prevent and control possible future outbreaks. However, there's an abundance of high quality public data available, reporting different aspects of human mobility. So a few examples are the number of international arrivals into Australia, passenger volumes on domestic flights and information about the usage of the Australian Broad Network. Now these data sets are collected by different organizations and government departments and are presented in various formats. Sorry, the data might be collected at different spatial and temporal scales and be presented in the form of say passenger counts or aggregated movement floors. So if you can go to the next slide, please. So this project addresses the challenge of transforming available mobility data into a consistent format that is then suitable for analysis in a broad range of research areas. So merging these various individual data sets into Australia's first comprehensive national scale human mobility data asset drastically improves the quality and coverage of existing data sets. And the final data asset will also be made publicly available and hosted on the Australian Urban Research Infrastructure Network. Users will then be able to download origin destination movement floors at different spatial and temporal resolutions through an API. And then the next slide. Sorry, the final data asset has the potential to create impact in various areas of science such as epidemiology, policy and administration and also transportation. For example, understanding the dynamics of human movement is paramount to understand disease spread and to ultimately contain outbreaks in a timely manner. In addition, human movement data can be utilized to evaluate the effectiveness of disease containment measures and inform new testing on containment protocols. Another area where this project can create impact is transportation. Australia's transportation sector forms the second largest source of carbon dioxide emission. So to reduce Australia's carbon footprint it is necessary to gain insight into how humans move across the continent. And availability of this comprehensive national scale mobility data asset that we will be creating will enable the detailed investigation of the impacts of human movement and can then in turn inform the planning of sustainable transport networks. And that's it from me. Thank you so much, Jess. That was terrific actually everyone. What a great diversity of projects. We're doing really interesting things across a huge range of disciplines.