 So welcome everyone. My name is Geri Ryder and it's my pleasure to introduce our presenter today who, like me, is joining webinar from Adelaide. Dr Anita Smyth is a researcher whose current role is as data facilitator with the TURN, or Terrestrial Ecosystem Research Network, Eco-Infamatics facility based at the University of Adelaide. Anita's research interests are around the wildlife ecology of Australia's arid and forest landscapes and as you'll soon realise she's also quite passionate about research data, including its role in developing research collaborations. I won't preempt Anita's presentation though, rather I'll hand over to her now to share her perspective on using research data to raise your research profile. Thank you Geri and greetings to all. I'll start with the question can sharing data raise your research profile? Now to most researchers the answer to this question is obviously yes. Researchers have been sharing data amongst their trusted networks for centuries. These collaborations have resulted in many scientific publications and have also raised many profiles, some to iconic status. For me though, a more relevant question is can data publication raise your research profile? I believe it can and it can do this in the same way as paper publications. Now before I get into stormy waters about using the term data publication, I acknowledge that data sharing may be more appropriate term in some research areas but in my field of research data publication makes common sense because it aligns well with the culture of paper publication and definitely makes for easier communication. For most researchers data publication is new to our scientific workflow and it's a juggernaut that's not well understood. I hope my talk today will go some way towards closing the gap and perhaps motivate you to publish your valuable data. I start my talk by describing a typical scientific workflow to illustrate the key role data plays in discovery and new knowledge. Next I'll talk a little bit about scientific impact and its metrics as scientific careers are built on them. I will also present some helpful hints on how social media can raise your profile even further. Now up to this point my talk will be using paper publications as a springboard for achieving impact and raising profiles. However open data publication offers a new opportunity for increasing impact so I'll describe how this can be achieved with examples and then I'll close the talk. My field of research is ecology and it's a diverse research area. For over 15 years I've studied biodiversity and interactions of living organisms with other organisms and their environments in Australia. As Gerry said I'm passionate about data. I've collected it, analysed it, modelled it and published it and have written papers and continue to do so. Now I'm on my third career path as a data facilitator at Tern's Ecoinformatics Facility. Tern is a terrestrial ecosystem research network which is a capability with Australia's national collaborative research infrastructure strategy. It basically is involved with building infrastructure and networks to enable long-term collection storage synthesis and sharing of ecosystem data. It also is doing something really novel in especially for our field of research and it's actually collecting science with policy and management. If you want to find out more information about Tern please visit our headquarters website there. Okay I'm in the Ecoinformatics Facility and in the interest of declaring conflicts of interest our facility specializes in the business of building and producing online data publication services for the deposit and open access of high quality well-described ecological data. As the data facilitator my role is to engage and build relationships with data creators and users to enable them to use data services that meet their research needs in the long term. Now I appreciate that many of the work, many of you work in other disciplines so I've tried to make my talk as generic as possible. Based on a scientific paper productivity ecology is a notable research field in Australia as shown by the brown circles here and also it has a global presence. So I hope that some of the research experience, my research experience especially in this field will resonate you as I go with you as I go through this talk. Now in ecology there will be researchers who create or collect data and they will also use that data many times over for different research questions. We call this group primary data users. Now a typical workflow for this group is actually shown here and as you can see there are many steps involved that are to do with data. For us researchers data is the currency of our science. First we start off with identifying the problem, design the study. Once we have that design we've probably run a bit of a pilot we know it will work. We then will acquire the funding. When that's forthcoming we'll collect the data, we'll look at that data do a bit of quality assurance on it and then that data will be stored and most commonly in our field it's stored in Excel spreadsheets. Once that's done we conduct the analysis, interpret the results and then we will publish a manuscript or a paper in a scientific journal. Nowadays though when the data can also be stored in an open access data repository and often this can be a condition of actually publishing the scientific paper. An example of this anyone who's hoping to publish in the PLOS journals they will require you to submit your data with the manuscript and they have a data policy as shown here. There are also another group of users, researchers in my field and these people they use other people's data to answer research questions. We call this group secondary data users. Now between both groups there's a lot of cross fertilization and as you can see here from the workflow of these secondary data users there's some commonalities but there are also some key differences. First of all they need to find, acquire, assess the suitability of the data of data sets from other researchers and if it has a limited degree of suitability or it needs to be modified they will go and do that and then go back and keep going through this cycle of search, acquire, modify. Now this aspect of secondary data users workflow can take up to almost 70% of their time of their research effort. Once they do that then they follow through the other steps which is similar to what primary data users do. Now this group are potential collaborators with primary data users as they can raise profiles through data citations and providing obviously your data suits their needs. Also on the other side of the coin secondary data users also have responsibilities with respect to primary data users in that that we have a responsibility to acknowledge the value that primary data users have in terms of their data. So there's a dual responsibility and there's dual benefits in terms of collaborating across these two groups. So in summary researchers have strong relationships with data it drives discovery and certainly our scientific publication. Achieving science impact and raising our profiles is important for researchers because some publish prolifically and it's getting increasingly harder to get noticed. Here we have a universe of scientific papers published in the physical sciences and as we drill down you can see how difficult it can be to get noticed in this massive paper publication. Here's an image of oops I'll just go back another reason why impact is important. It may lead to an iconic profile and a successful career. It increases employment opportunities, grant success, professional accolades and it lays the path for prodigious legacy. It also helps to pay the mortgage as with a higher impact, higher research profile you're more likely to have long-term employment. So now that we have a better understanding of why researchers place so much importance on raising their profiles through impact the next step is to try and understand how can this impact be measured. For a lot of us researchers this is probably secondary or secondhand knowledge but for those who are new to it it will be hopefully of interest to you. Traditionally impact is measured by personal citations based on scientific paper publications. Here we have an example of personal citations which have been measured by the H index and it was compiled in the form of I guess a football league table for UK astronomers and as you can see here there's those who published a huge amount of papers and they are actually in what we call the champion leagues and then there's others which are sort of tapering off down here and most people I guess would have publications in this group and some of them are not even in the league. So there's a lot of emphasis just to generate a table like this shows the importance of personal citations to researchers. Another way of measuring impact is through journal rankings. A common one for us in our area of field is nature which has an impact factor of 38 can be one of the highest in our field of research although I imagine there's higher ones in I think the cell can have a much higher one than this. These rankings are published by data by citation indexing organizations if you want to follow them up. Another way of measuring impact is through publication frequency. Now as I mentioned before the H index is a metric of impact despite its shortcomings there are some debates around it. Here we have a publishing profiles for three researchers with researcher C having a stellar performance by publishing many papers in high-end journals in a short time span. Researcher A and researcher B have slower rates of publication but circumstances make it difficult for them to have a stellar performance especially researcher A and I guess I kind of see myself more as a researcher A type person because I've basically had you know I'm on my fourth career path right now and I guess I'm more of a Sunday League publisher at this stage. These publication impact metrics and things like that are certainly entrenched in our research culture with some publishing their profiles on Google Scholar. Others also set their profiles up in ResearchGate, academia.edu and also there's other ways in which you can raise your research profile for paper publications other than using the standard citation approaches that I've just described. You can also set up your own blog and here this blog was very useful in terms of being raising his profile through radio and TV programs. You can also use Facebook, Google+, or Twitter other ways in which you can raise your research profile for example here. This particular researcher he set up a Twitter profile. He has 2,000, almost two and a half thousand followers. Now if he publishes a paper up on his Twitter account he automatically can spread it to that number of people can access it. So it's a powerful tool. Similarly sometimes the publication houses will publish papers up and in the case his one by plus one that went out to their followers of about 1500 here for the Royal Society of Publishing almost 7,000 people had access or learned about this paper here. So definitely in this case social media can be certainly a powerful tool for raising for gaining impact and definitely raising your research profile. Other than the H index there's a whole range of new metrics that is being used to show impact and this is an example here. They are called altmetrics and impact stories shows here where someone has published something and so they're showing the number of readers on Mendeley and a number of other different metrics in here. So just to summarize in terms of the metrics for impact and how impact is assessed and how that can raise your profiles in a number of ways there's sort of usage that I've just spoke about and this is sort of gained through Google Scholar. It's also gained through Web of Science and other types of usage tools. You can raise your profile through peer reviews through researchgateacademia.edu and also f1000. Of course I spoke quite a lot about citational personal citations and because DOIs are certainly very helpful in that respect and of course social media and also altmetrics are other ways to increase or capture your impact and spread and increase your profile. Right so having spoken about paper publications and focused very much on them I now want to turn to open data publication. It's certainly an emerging trend in scientific publication and researchers including government scientists are publishing their data for open access. Open access is shown by this symbol here an open lock and you may see it in your travels but certainly it is a way I believe you can increase your profile. There's a number of data repositories where you can publish your data. In this case DataBlib has identified 604 repositories and the top 10 countries are listed here. So there's quite a few data hosting organisations. Examples are scientific data which is linked in with Dryad. They put up well described data associated with publications. There's also Pangea or Pangaea another source for the physical sciences. There's Fig Share and in Australia we've developed the Australian ecological knowledge and observation system portal we call ECOS and so ecological data is certainly available through there. Now I just want to go through some examples of how publishing data can certainly get you noticed. This is an example where a researcher Professor Mike Ball at Flinders University here in Adelaide he has been studying sleepy lizards for a well over 20 maybe even 30 years and he decided to put up the metadata in the Australian National Data Services Research Data Australia portal and as a result of putting that information up there he was contacted by other researchers that he didn't know to collaborate on a paper and research work and this actually led to new work, new funding and eventually new paper. Another example where organisations, government organisations can get recognised. Researcher in the US doing a global study, a group of them doing a global study on grasses and their phrenologies and how the change in response to climate change and they accessed our portal and this led to some state government agencies having an international profile in a way that they've never had before. Okay so in terms of getting noticed there's other ways in which you can get noticed there's now an ability to assess data citation impact via the Web of Sciences data citation index even though it's sort of early days for it being a tool that we can use. Nevertheless it's definitely helping us to identify where our profile is and there's been some really good work undertaken by Heather Pinovoy who actually looked at the relationship between those who actually publish their data at the same time as they publish their papers and there's been a strong correlation in terms of citation impact when you do that. Just to give you an example of how it looks once you get in there there's this over here you find a particular data set of interest and it'll show might be yours or somebody else and it'll show the number of times it's been cited and who's actually cited it. Now I'd just like to I guess take you through a real example how publishing his data has actually led to has given him an improved visibility. This researcher has published the data set it's I guess underpinning research and it's been published in Dryad and the data was submitted at the same time and obviously the paper here as you can see was published in Molecular Ecology. So when the paper was published he submitted a package of data sets you can see here there's two data sets which was put up and you can see that both these data sets has been downloaded a number of times and you can see down here the bottom in addition to having a DOI for the paper there's also a digital object identifier which has been given for the data set and as a part of submitting the paper publication he has referred to that data set within the body of the manuscript and also maybe and in the references would be a normal thing to do and and as a result of that this has led to citations through things like data citation index. Once you reference your data set in the body of your manuscript or in the references data citation and other citation services can pick that up. Another thing that I'd just like to sort of point out at this stage when this researcher put up this information there was a license which was attached to the work and in this particular case it's a creative common zero which is a US license. The interesting thing for us in Australia when Australians publish their data in international repositories we need to be mindful about the licenses which we're publishing them under. We have a set of licenses in Australia which are called creative commons 3 Australia which has a range of different licenses for open access the most common one is creative commons by attribution commonly known as CC by. Now with something like CC zero here this is under the US creative commons license and CC zero means that you can put up your data and the person who uses that data can do anything whatever they like with it they don't even have to attribute you. So it's slightly different in Australia where we our most least restrictive license that we have or our open access license is a creative commons by attribution meaning that the user has to acknowledge the use of that data and that's all I'll say about license but it's something that I'm often asked about in terms of licensing. Okay now I just want to go through another example here with somebody this researcher Christopher Lloyd he's about to present this presentation at the Ecological Society of America's conference in the US and he was glad to share a preview of his slide share with us and in that he did an analysis of where his data was being cited so he put it up in Fig Share and he actually had 12 citations via data citations index but he had over a thousand views of his data set in Fig Share and through tweets he only had about four and I guess the take home message I got from his presentation was you need to really put your data up in a variety of places so that many people can access it and as a consequence it can raise your research profile. There was a recent publication on Twitter or commentary on Twitter about whether publishing and data can raise your profile and well not so much publication your data can raise your profile but putting your data up on and using social media can raise your profile and it was published by Hall in Genome Biology and he talked about a Kardashian index and how some people may sort of have a profile which is all Twitter or all social media and not much substance but I think there are a lot of researchers you'll find there are some researchers who really they're more keen on doing the paper publication especially those who are seeking a stellar career and they see research as a really long-term career and they want to really leave a legacy they certainly will will ensure that you know they have quite a lot of publications as well as using social media to sort of promote that publication so there is some debate about the role of social media in raising your profile but coming to a close now and I just really like to say that publishing your data openly sharing it and publishing it it's really good science practice as scientists we want to do robust science and we also want it to be our science to be reproducible so publishing your data gives others an opportunity to actually reproduce that science it also can minimize fraud and and if there's even errors in your data you can get quite good positive feedback to to make your data even better and to make the science even more robust so that's a benefit of publishing your data openly. Another reason for publishing your data is that you can be a good science citizen because you can meet the funding terms and conditions of a lot of grants and things like that so it's a worthwhile exercise to do that as well I believe. In the interest of balance there are times I believe when perhaps you need to think about whether you should be publishing your data some of these issues sent around patent applications certainly in the field of medicine confidential human patient term details is a real issue to consider if you're about to release new IT software and you're arranging a commercial sponsorship there could be reasons to keep it confidential then in my in my area of research certainly governments do declare certain species to be highly sensitive due to poaching and other kind of nefarious activities and so it might be useful to keep to not publish data about those species and of course really importantly for those who are early career researchers especially doing your PhD you may well want to put up your data sets in I don't know data submission tools and things like that like for example we have a shared tool which is a submission tool for publishing ecological data and so you want to put it in there not so much to publish your data but to let other people know that you are doing this research so rather if you have a fear of being scooped well then you can then bargain that date until you've actually published your research so that's another other time in which you need to think about whether you should publish your data okay so I'd like to sort of coming to a close now and just sort of close with the statement that governments research organizations journals and many others are pushing for open access publication it's a really sort of rapidly moving juggernaut and it's I believe really worthwhile to get on board all of these people are learning by doing and how to do this to make it ethical fair and useful for all I would argue data publication is in its infancy in terms of ethics and protocols and it is evolving quite substantially in response to use of feedback so any researchers out there who are hoping to publish their data certainly feedback to those repositories about new ways in which they can improve your experience I believe if us researchers don't take ownership of this juggernaut now we will be left behind and others will do it on our behalf and just before I close I'd just like to say that the views expressed in this talk are not necessarily those of of turn or the University of Adelaide or they'll expect a lot of the views will be in common with their views as well so yeah basically my views are synthesis of the ideas in the data publication space and I grateful acknowledge those who have actively published blog and commented in this area it's been very useful indeed and I would just like to acknowledge that this presentation will be posted to be available under open access and certainly thank you for listening and if you want if we can be of further help please contact turn eco informatics through these web addresses here and we are certainly welcome any new collaborations so thank you so thank you Anita that's been a really informative and interesting presentation and we really appreciate you taking the time to share your knowledge and experiences of how research data can be used to raise your research profile so thank you to all our attendees today and thank you Anita and have a great day