 many thanks for joining to further your participation in this open air provide community call so a meeting a monthly meeting dedicated to all those that are contributing with with content to the open air infrastructure content providers managers from repositories and other uh scholar communication services and systems from institutions from research communities so many things for joining today apart from the usual updates that we give about the service itself we have a dedicated topic about the observatory the open air open science observatory it was one request from two persons in the previous community call something that we already thought about but we we we have the availability of our colleagues in open air to present and to discuss with you the observatory the open science observatory for the meeting today so we took the opportunity to anticipate and to have this presentation today and this discussion dedicated to the open science observatory to better understand this service and also the way that you as a content provider of open air infrastructure have your content available via this this server this service uh we will hear from our colleague Johanna from a research center from open air this um a bit more about the open science observatory more details um what this service want to achieve what are the main issues regarding this service and the main functionalities and content and information available via this observatory so this will be our main topic but you can we can discuss this topic later but we you can also put your questions and about the the provide dashboard and other services in open air so if you have any issue to discuss with us feel free to to ask questions about so recent developments novelties and some of my colleagues here that are part of the provide management team also can head something more but I we have identified three three main three main novelties or reminders on recent developments in the provide dashboard one is something that is being developed you cannot see it in in beta or in production but in fact we we are we are working on that and the workflows and and even the the main components of the user interface are already defined so finally we are making progress for the the process of registration of three systems as you know we have a process for for the validation so in in the validator tool we have this available yet a process using the provide dashboard to register and to follow a workflow standard standard day standardized workflow for the registration of three systems when we have it we have prepared it our colleagues from university of bill of health are in charge of this within the the open air nexus project and the management of the provide service something that we are working on what is important to say something that we have already shared in previous calls that the trees the directory of research information systems made available by by your crease is the alternative register for crease systems in open air so make sure that your crease system is part of this directory from your crease and then you can make it available via open air when we have this new phone functionality available they hope that we have some information about this we did some important progress is hope that we can have novelties of this soon if my colleague Andreas wants to have something feel free if not maybe later Andreas can present in fact can present this services this service available in beta in a previous stage or even in production when it's fully available and tested by us and then my colleague Andrea already shared here the address the directory just for you to be aware of this that you can also check if you have any if you are aware of any system or if you make sure that it's registered here something that we have presented in the last call is just a reminder be aware that we have this multi-user access functionality working something that was really a request from the community and we have a dedicated presentation demo about this in the previous in the previous call so but be aware that we have this this working properly in the dashboard that I am sharing now under this area of update you can have access to update the admins so there are three tabs one is dedicated to the OAPMH interfaces and the other one is dedicated to the update of admins so you can invite more people to to to become and to have access to the same content and information that you have as an admin of your data source in an open app provide dashboard so what you need to do is to invite a member only members though only members that are already that have already an account in open app if they don't have an account an account please ask them to create an account and then you can have them so you are not inviting them to create an account and to join this this service you are inviting an already existing user already existing account so be aware of this is the only limitation that we have the rest is working quite well and one of our colleagues in the previous as a result of this identify one or two issues with the with the providers with data sources that she was responsible so liana i'm not sure if liana is here but this this is related with another issue is not related with this this issue with this new functionality we we didn't identify any issue with this new functionality in fact so it's working it's working well so it's it's it's interesting and then it's good to know that everything is working well in terms of this functionality just this is just a reminder as i know that we have new people in every call so maybe some of you were not in the previous call you are aware of this and this is interesting functionality as we have several people managing the the same data source in from a big university or from a big research institution so we may have people from different departments that may have access to this to this service so it's interesting to share here this admin rights we have the issues with the registration process we still receive the registration but there are in the process of registration the user may receive some some errors something that is happening since september and because of we identified this error was related with another thing in provide our technical team is fixing and the changing some parts of the code of the of the provide service so we are almost there to put this in in production and fix it so so if you are interested in in in in registering your your your your survey your data source your new content provided in open air be aware that we have an issue here with the registration we still receive the but you don't receive the the proper feedback of the validation of this registration if you have any issues just contact us via mail via the op desk and we will provide you an answer but we but be aware that we can then proceed with the registration process but in fact the visible part the feedback that we provide via the dashboard is not working properly for only that specific functionality of the registration hope that we'll this will be solved really really soon so we only have three three but i discussed with my colleague and then we said okay be aware that apart from that that history of the aggregation of your source that you have available in the in in provide dashboard so here in the aggregation story tab that you have available and where you can identify when was the last time that we aggregate the content when was the last time that the content was in exit be aware that there is this this page visible in via the explorer okay my colleague Andrea will share the link be aware that this is a page explaining all the workflow in terms of aggregation and also we have a table there you have a table so we have this explanation about all the workflow and then at the end there is a table where you can see when was the what is the the the status of the of the index update of open air explore service so be aware and then if there is something relevant that we can identify so we identify this so if you see who found some delays it means that we have any particular issue sometimes in this process we identify decrease decrease of of output and we try to identify where is the problem and then we need to delay a bit the the the index etc but so this is visible and and and we make it transparent here in this in this page where you have a go to table with the updates and you can check and then some explanation sometimes in information about the specific funders etc and we you can you can see that information so this is just a reminder it's not something new it's just something that is already there you can really identify by this page if you want to check something it's interesting for you to be aware so let's now proceed with the main topic if there is any any question if there are questions okay Rianne is also identifying you things about the orchid okay if you create a new account you can create using also other credentials from federated sources but but also from orchid I just want to share that we tested and I add as an administrator just the cork who log in with orchid credentials this is perfect yes yes yes okay perfect okay thank you thank you Rianne for sharing that okay things are working well and Andrea also shared all the links so now my colleague Johanna Gripari can can proceed with this presentation so you can present yourself Johanna feel free this is this is a community call so this old part of open air so hi everyone it's so nice to join you and I can actually meet the people responsible for all the content that is amazing so as Pedro said I'm going to talk about the open air observatory today so I'm going to share my screen I'm going to do a quick presentation of kind of the the story behind the observatory why we put it away we did and then I'm going to move on to a quick demo and we can have a discussion later if that is okay so all right so the idea behind the observatory it was to start so the idea was to start to build a platform in order to better understand the European open science landscape now we are already considering extending it to the entire world so if you're not from Europe please stick around so this was just the beginning idea of course we would like to expand it how do we monitor and enhance the open policy uptake and as a secondary call how do we track research activities using the content you provide and how do we aggregate all this context to measure the impact on the society so in terms of open science policy uptake there are a lot of mandates that are coming up and becoming more and more popular of course so we would like we build a platform in order to try and track what has worked what has not revealed hidden potential areas that are lagging behind and so on we do want to compare at different levels of interest one of which is what we call data sources but of course the content providers and i'm going to showcase later in the demo how we do this and the idea behind this indicator basically platform is how do we turn all the data provided into actual insights to lead later on into changes in policy making and so on in order to promote good practices so under the auspices of open science policy we wanted to build a platform where that can be used for monitoring policy making telling stories and reporting on performance and of course analysis so as usual we're all about openness so it's built on the open research graph and i'm going to show you later on where in the pipeline in the opener workflow this is it is based on open science principles and we try to build indicators and visualizations that are relevant for the community now the average user stakeholder for the open observatory would be a research administrator a policy maker and so on but there is no reason to limit to this but that's who we had in mind so this is the workflow or the pipeline of the open research graph at the first step we have data sources content provision and at the last step all the way to the right we have our two indicator platform that is an analysis that would be open air monitor and the open science observatory that we will discuss today as you can see everything that is shown here is based on content provision of course and the validity and meaningfulness of the indicators depends on having as good as possible method data records from the original sources okay so what is it finally so it's user-friendly i think but you will tell me also a data and visualization platform that has different exporting capabilities that i will show you so that someone can just download the statistics and do their own analysis or download images and put them in a report with the focus being the open science update across and within europe as i said before since the open air city has global coverage we are considering extending it to to the entire world it's mostly focus on open science indicators of course but there are also indicators on open access research output and how it impacts science or together collaborations academic impact and so on we have broken it down by the influence of interest and again the focus here is at the country level now there is another task for the open air monitor that focuses on funding institutional views but that is for another or you can also contact me if you are interested in this as well okay i'm going to move on to the demo now i can can you see the platform that i switch from the slides yes yes yes okay perfect um so after the content provision we harmonize the duplicating the link and enrich everything that we have that we have provided with different properties and relationships in order to end up at this final product which is statistics and indicators and metrics on the content available in the graph so starting in the observatory we have the overview of europe and there is a there is a there's a map of europe where we have different information for each country open access publications publications refer to peer review publications everywhere in the observatory we have that data sets repositories and journals again if you are not in europe please try to visualize this in a global level or pick your favorite european country um so right of the bat we can see some interesting things i'm just going to give some example of the type of analysis one can do here so we see that for the united kingdom 630 something thousand publications are affiliated to an organization in the country however only 80 percent or 500 000 of those are deposited in the country's institutional repositories these numbers are actually going to be updated tonight so if you have any comment on the numbers please wait for a few hours so as we see united kingdom 80 percent is deposited in the country's institutional repositories the rest not uh it says something about the practices and perhaps infrastructure however when we move to switzerland we can see that the only five percent of publications affiliated to the country are not deposited in a country repository um now this could also be because the pavement is in switzerland as well or not but this is the type of thing we can see here now further down we present the some overall statistics for the numbers that you see above um one thing that may be of interest is here we have the number of repositories in open door and retweet data now uh you see underneath perhaps i will make this a bit larger um that 31.5 percent are validated what does validated mean uh we go here to the methodology which you can do for everything on the platform validated is under constructed attributes and we see the definition and the construction so data source of research outcomes that upholds metadata standards and then we describe here how this is done with a validator service that was previously mentioned by pedro so this is just an example of how you can use the methodology so we see for example here uh that so validated means that the the sources have a validator score of above 50 and that just 30 percent of them are validated uh indicating that as a community it's good for us to work on uh metadata standards and so on now uh further down we present a per-country view on the same numbers that you had before and some additional ones so we have repositories open access journals and then open access research outcomes the three main the four main entities that we have on the graph applications data set software another there are different views here affiliated or deposited which may be more of an interest to you for example depending on who you're representing and you can see the numbers as actual numbers or as shares and sort by different columns okay so if you want to see what happens a word per country you can click on a country here search for a country here or just select a country from the map and then view details but before we go into that let's move to the the europe view that has a more detailed overview of what's going on in europe as a whole um so we have some average numbers here some aggregate statistics okay uh then we move on to a set of graphs we have separated different tabs here overview open science and collaborations I've played the indicators into this they're more coming along and being populated all the time but we started off with this um so we have followed the following strategy for most of the graphs we show the graph with the four different options by country by data source for top data sources okay by organization and by funder so most of the indicators if not all you'll be able to uh to see how they vary across these levels the functionalities that I mentioned before are here so one can download an image in their preferred format or the data behind if they want to for example do the analysis themselves or and put the image in a report or something like this uh if as an example let me continue with the open science tab now here we have golden green publications again these are constructed attributes it's not inherited metadata so if you go to methodology I can see the definition of golden green open access and how we in particular constructed it using the data from the content providers so for example one thing you can do here so these are publications that are green publications are the ones the open access publications deposited in a repository so let's say I want to see all the green publications so I can deselect the ones that are only gold or that are neither and I can see the green ones and uh which one of these are also gold so these are publications in fully open access journals that have also been deposited in a repository I can also do the same kind of analysis by the different dimensions like we discussed before so for example if I go to organization I can see for example that Autonomine Barcelona has gold publications but we cannot find them in any repository that doesn't mean that they're not necessarily deposited but we cannot match them in the metadata records or perhaps they're not we will leave the analysis to the user okay so here we have provided some indicators that show metadata completeness if publication was with an abstract if it comes with a license and what are APCCC license PID and so on and there's a more detailed stamp where you can compare the particular indicators of across countries again affiliated including a peer review deposited like in the first you can also see it as shares to make the comparisons more meaningful further up here we have a distinction between the type of research outcomes the publications data sets software and other so by clicking on one of these buttons you can see the indicators that are available for that research product so I raised the cash before that's why there was a small delay and again when you go to more details you can see more details and some collaboration indicators and so on now let's say that I want to examine a particular country I can activate the map here if I don't want to return to the home page and if I look at open access data sets I see here that Spain has a very high number so let's say I want to understand this further and I see that even more than that is deposited in the country's depositors so something is going on with payment data sets so let's click on the viewed details here I see some average information on Spain and then moving on to the overview I see that most of these data sets were added in 2013 and 2015 without a lower number later on and perhaps someone who knows can figure out what is why this is happening or what is missing or what went well and a content provider can definitely know and figure this out perhaps and then perhaps I want to know okay so we have all these open access data sets but what is their metadata quality after the reaching and everything we did in the graph and it is duplicating what is the what is the best metadata as a combination that we have for the records so if I go to open science here I can see for example let's go to data sets okay so these are all the open access data sets let's see in this graph and I see that most of them do have do come along with the license so that is good news and then here where I limit myself only to CC like creative commons licenses I see that the numbers are quite high as well and almost the same if not almost the same let's say as in the other graph at least for 2014 just two points difference and which means that these data sets did pretty well in terms of licensing most of them are CC licenses so on the permissive side and we will add some indicators now on the type of CC license as well if we see how different data sources compared to providing with the licenses we can see some that are that the data sets that provide the all come with the license some that do provide way way way more data sets not all of them have licenses I'm not interpreting this in a particular way I'm just saying what we can see here and similarly we can do this for availability of PIDs and go for example in organizations and examine if organizations affiliated with data sets data yes economically with the data sets if they also if those data sets come with a PID or not overall it is very important to emphasize here that this the quality of the numbers depends on the quality of the affiliations of the affiliation method data that we receive from content providers and that are supplied by the researchers themselves of course if a country is missing from a record then the research product will not show up here so we may have a skewed opinion of what's going on now in terms of future plans of the observatory so as I said we want to open it up to the entire work but the timeline is still under consideration and we're also working on integrating a fruit of science classifier in the open-range research graph which would mean that at least for every publication it would be possible to assign it to a specific fruit of science down to a couple of levels so not just economics by economics microeconomics applied microeconomics I don't know excuse me what we are more actively working on right now or it's a more a certain goal is the continued development of indicators which as I said before depends on the quality of the open-range research graph that we're always working on improving and in particular the thing is that you will be seeing in the observatory in the next couple of months are more detailed into different fair aspects, publication costs, ABCs and indicators on collaboration including network analysis if there is something that would be a particular interest to you please let us know okay and that is it for me great many thanks Joanna so we have time for questions so feel free to to make comments it's also important as you are part of open-air community also to provide feedback so not only questions or thoughts but also for some criticism is also interesting because we are all contributing to the same effort to build this type of services based on open infrastructure so feel free to to ask questions using your microphone or to put in the chat so Joanna I was speaking just for you to check the chat and to reply to the questions that are already here from Norma and from Jochen yes let's see so Jochen asks does it mean the observatory is using absolute numbers not relative numbers to compare not as you saw so some most of the indicators in this four details tab can be turned into all of the indicators into shares and we're also considering now putting some including some growth rates and also some numbers that are per capita or per GDP R&D spending in order to to make them more comparable depending on the indicator there will be a different one and then someone says congratulations thank you Norma this is my favorite quote and Amelia okay I put my question there is a question in the document as well yes Andre can support us on that usually we also have a minutes document where we people can ask questions so I'm sharing my screen in order to to all see the questions here thank you okay I'm sorry what do you mean there how do you know that they're not counted in observatory I know because I am checking this data on the open science observatory and you have there you could see the data source and I recognize a lot of our repository but not all of them and I just but data source shows only the top that data sources yeah it doesn't it doesn't show all of them yeah I know I know but if this scientific this Nardos has 11 000 and the sign decks more than 20 000 it must be on the top for sure 15 so this may have to do with the country of the affiliation in any case I will I will not be able to answer this meaningfully on the spot so we're gonna check it out and and come back to this thank you so much for for prompting yeah yeah if I may I would like also to address this issue because if if if these two aims to be a trend analysis then it's okay otherwise if if we want to find reliable information then we need to know what is the basis and we need to have a feedback mechanism to see what data was used to do to produce the indicators and and personally I would I would also like to use this tool as a reliable tool to see the figures and not only the trends and for that I I must know what is your basis for the observatory for my country for my institution so yes so not all right so let me answer this in two parts first everything that is on the open air research graph is in the observatory nothing is excluded so it's automatically the indicators are automatically created from the graph okay now I don't want to to to enter into a dialogue I have the same problem then because our we have more than 500,000 documents in Portugal and when you're showing the map it shows 67,000 if I'm not mistaken no no these numbers will be so they just updated explore today and the observatory I told them to hold it because I had the demo so we expect the numbers to to of observatory to match the ones in explore by later today but my question remains if I don't have a way to see how we are showing this data then it's it's not fully reliable for me that that's my feedback okay we have to we have to be comfortable with the data we see we have to see that everything we have is there and and for that we have somehow to have a feedback mechanism or something to see what you are using to produce the data perfect so a link to explore would satisfy this right because they have the same numbers so linking it to explore where you can browse all the research outcomes would just I want to guarantee that it would solve this problem is that correct maybe it can be something more I think one one one thing that we we did quite well in in the observatory is the methodology I think the methodology is already something quite important for the transparency of this kind of tools which is critical and here we are talking about the transparency of the approach that we use to to gather this data and to expose this data I think it will be interesting also to improve a bit this part for the content and to clearly state what we are using from the graph here in the open science observatory I think it it can be it can be in some cases you want it makes sense it can be a link for them for the for explore to make it clear that to make it yes make it clear that is there is a link between what we have publicly available in explore and what we are consuming and exposing via observatory but in in for some some type of content or we may also expose a bit what what we have in the in the in the graph more in the let's say in the in the back end to make it transparent but this is something that we should discuss a bit further just to give you some of feedback on whether explorer would work or not if we were to use this tool as a reliable tool we would start by doing by doing a sample and extracting from both sides from both sides the information and to compare them and to see what the differences are and once we are ensured that all the data is matched then we can rely on the figures so if explorer can extract a sample of data from portugal somehow then it would work okay okay that's that's very clear okay okay can I ask another thing I don't know if this is a monitor related I don't know if you will have a presentation a presentation on on monitor or or you already did it I'm part of it soon we will have one in in December our public webinar about the monitor but you can ask the question because you went in the way I can ask I can ask questions slash things because I we are starting to think on what we need and I just would like to give you some feedback on that the first thing we are strongly missing as since we are planned as a funder compliant is to have a way to see for every project for every institution even for every searcher what is the level of compliancy and to be able to drill down to each of the publications related to that level of analysis okay because we want to have a monitor and compliancy tool that we can use and for instance if a project reports 10 outputs and only five of them comply with with plan S then we want to send a message to the PI of that project to say hey you have 10 outputs but you're not complying with plan S in five of them these are the DIYs or PIDs and please update or upload the the outputs in the repositories or whatever so and this level of detail is is very important so hopefully in December we'll see and monitor can comply with these requirements okay that is noted we are preparing now for the institutional dashboard for monitor plan S indicators with respect to transformative agreements and so on and these are not showcased at the project level however in combination with explore where perhaps something can be done but I keep it in mind and we'll work on it and keep you posted okay and me and my team would be glad to talk with you further on these requirements as well okay and even to think about global services that open air could provide to funders as a whole and to check the compliancy and stuff like that I mean yes so the the funded dashboard is meant to have to go in this direction um however of course it's no more aggregate level uh and the requirements are it's funded taken into consideration but it is definitely something we're very interested in so let me get back to you and we can talk further about this yeah I would be glad to thank you thank you many thanks well so we have already here to some some messages congratulations to work and also specific cases from Serbia and from Portugal writing some some some issues or limitations or improvements that we should work on and by the way about the api question we do not at this point we don't expose the information yes yes we expose the information from the so different kinds of information it's publicly available via develop.openair.eu you can check what we have not we have we have there even the information about projects etc and we have a dump of all content from the graph that is available via Zernoldo, a record in Zernoldo in the past we had the way IPMH interface for all the content that we the publication content that we have in our graph but we realized that it was not as scalable and feasible to keep it to maintain it now we generate I'm not sure about the periodicity but at least three four times per year a dump of all our content from the graph in Zernoldo you can search open air dump open air graph dump in Zernoldo and you will find it we we share updates put updates there one important remark I think it's important to do it's related with what Joan was was talking also about affiliations organizations so from from one one one one so this important remark is to mention that this work that we are pushing in open air to improve the quality of the information about the organizations and the affiliations of the authors of publications so this is something that we we are improving in open air infrastructure so this will make our graph more powerful and this is related also with the open orcs open orcs service we can we can also put a link for this service is a service where we can curate organizations and curate any organization curate organizations via this service we can improve the quality of the organizations the information about the organizations that we have in in open air the next step about after this open orcs is also to work with some representatives of countries if they have like is the case of portugal to work with with with the entities that are responsible for curating organizations at the national level and to integrate this information in the open orcs service and via that mean integrated in in in the open air graph so and with this we will improve for sure the quality of the information that we have in our graph and that we are exposing in explore in monitor and all the other services that consume the open air graph okay but this will be visible via monitor for sure is this improvement of the quality and via the observatory also i was going to make i was going to make a comment this is a small thing that with regards to the bids i think you should add a output bids because when you we see the information i i was it was difficult for me to understand which kind of pids you were referring to and then i understood it was related to to outputs but you have pids for orcs for authors for other purposes and it gets confused it was confused for me to understand what kind of pids you were talking about absolutely thank you research results okay the social outputs yes we talk a lot on republications but yeah outputs yeah yeah good suggestion gilana thank you okay great so we are coming to the end but i think this community calls have this objective is to share the progress for your developments and really having the having the content providers those that are part of our infrastructure providing feedback to the development of the services this is really what we want and when so and this is the one of the objectives of this community calls this is why we have accepted the challenge of two of our of our previous participants one is here and liana so dedicate this call to the to this new service deep and science observatory let's say if okay if you have more comments questions you can put it in the document that andre already shared it here in the shop but we can put it again and for you to comment and then we can follow up you can also have your your you can send us an email in order for us to keep in contact with you so you are now ready to look some some notes and unfortunately we always need to do improvements no you and services are it's a to finalize it is a story but this is why we are we are here and this is why we have this community calls is to to improve okay many thanks for your your comments comments also from Emilia Norma Jochen thank you thank you all we all I don't know it's beyond but we receive all your comments and questions many thanks so be aware that so we we have just finalize here my presentation is I just want to not finalize it without inviting you to the coming calls we have decided to skip the call from December in December it's it's it's because in in fact the first two Wednesdays of of December are holidays it's it's holidays in Portugal I know that we are doing this for all Europe in some countries is also holidays in the second Wednesday of December but as we are going to have a tech clinic and a public webinar in end of November and in December for other things from from open air we have decided to skip this start again like we did last year in February so please put in your calendars we we pushed and yesterday we put already the new dates from the coming calls until the summer and next year put put in your agenda so they are already available in in in provide that you provide community calls put in your agenda in your calendars this is what we want to say for January January next year we don't have the community call because we also want to have a public webinar related with also the way that provide is the door for you ask okay so we don't want to have a dedicated call about the subject we want to have a public webinar then we will invite you but this will be in January next year so this is why we are skipping also and usually the first Wednesday of the of of the the year of January next year is also only the only the time for several of us and several people in several countries so this is why we are also skipping it so we don't have community call in December in January we are back in February but we are going to have a public webinar about provide in January next year and expect also to have other public webinars related with other services in December okay many things and our the recordings and the presentation will be made available via the the website of this the page of these community calls and do not forget to subscribe the newsletter where we intend to send new informations every time there is an important article that we have highlighted in the in the newsletter that we have sent out yesterday this third article that we have identified something is towards an open ecosystem for scholarly metadata it's an important it's an important article our technical director paolo mangue is one of the authors i advise you to to check this article maybe andre can also put here the link just to has the last link here in the chat and many thanks for your participation so it's all from from us and Johanna many thanks for your effort okay for being available and open to run this presentation and open for feedback thank you for inviting me it would be i'm looking forward to more discussions yes many thanks Johanna okay bye bye all let's see if andre can manage to put out andre already managed to put the link okay great andre many thanks and thank you all see you in in one of the upcoming activities that we are running and if not before in february in the community call bye bye