 So this first community call after summer, first Wednesday of September was quite close to August and for some South region countries was not easy to manage availability. So let's So we have a simple agenda for the other calls, but I will just highlight some updates and some quick information about the open air provider service. We have a main topic, we already addressed only this year the broker service, but as we have some novelties that will change a bit the way that you interact with the service we really want to present and to to receive your feedback. I hope that we can have a comprehensive presentation about this and then we can have also your feedback to showcase what are the changes and then so if you have any specific issue so this is also the time to ask feel free to jump in. So first I do the updates then we have the topic discussion on the broker if you have any specific question about the broker we can address them and then at the end we have time for other questions. So we have the notes for the meetings if you can Andrea will share it in the chat but you can put questions in the minutes or you can make some comments there if we need to address something after after the call you can put it in the in the in the minutes we in this document we will check it but you can also write in the chat so feel free to do that. Andrea are we recording are we fine also with the recording everything is okay? Yes okay okay so I just wanted to I like the three or four things one is related with the broker so Paul will sorry Claudio might call our colleague from from PISA from CNR is the will detail what we did in the slightly change a bit the way that we are that we are interacting with the users about the enrichment events not currently currently is what you see in the in the provide is so the same but we are already we already discussed the changes we are not completely sure about if if we are doing the right approach in terms of usability for for the end users but we will have some changes now during October you will see it of course we will send that information but be aware that this one of the coming visits to provide into the tab of the enrichment events you will see something slightly different well you will present it and we also did we are also preparing some improvements regarding the registration workflow in order to have better communication with those that are registering the data source in open air but we did an interesting also a slightly change but it's quite important based it on the on some best tickets that we have some interactions we have with our aggregation team so now when when you want to update something for example when we want to update the way IPMH interface and Warriel and if you did some changes you in the update there is a field that you can comment something so you can write something explaining why you are doing this update if you need to explain something okay there is a text box for you to write something which is good for us to to receive that feedback sometimes you just do for kind of you are testing something or you did something and we didn't react properly so now we have a way for a direct shout so you can write something there and we are aware of that so when you perform any changing when you update something your information or way IPMH interface etc so the other the other information is regarding the uses count service so it's important to say that we are progressing with the statistics in the in provide as you you are aware we had some issues with some repositories that were that have enabled long time ago the service so the the numbers the figures are not updated you only have figures I think by the end of until but until the end of 2018 but we are progressing fixing everything so we are doing repository by repository for those that are that have enabled recently they don't have issues for those that were one of the first numbers and to enable them the the service so we have some some issues but I think they are almost all solved if you if you really check your data in the user statistics and you see that is it's not up to date so please be aware that this will be solved in the coming weeks so I have just checked with my colleague Dimitris and they are doing that I think almost all are correct now but but contact us if you have any issue if you want to check it here in this community call just put the information in the statistics in the in the in the notes and we will check it be aware that we have a page with with use cases and some use cases that we want to highlight are are related with the content providers and the way that the content providers are using open air provide services so in fact we had two or three use cases there that are closely related with the the functionalities that we have been provided recently we had a new one from Serbia we also had one from from Portugal in the past from our national infrastructure in network of Zitris so if you have something relevant please share with us I'm I'm I'm quite happy with an invitation that I received from the the repository managers from Spain that are part of the network of libraries and the responsible by the working group of repositories in Spain they are organizing a session during the the it's after the the the open access week but it's it's a session on open air provide service with real use cases from Spain October 29 thank you thank you Paco and I'm quite happy with the this kind of activity so feel free to organize like Paco did with other colleagues and if you need any help from us if you want to invite us so we are we are quite happy to support Paco if you want to share the link feel free also to share the link I know that is in Spanish but people at least people will realize that the what are organizing I think this is an excellent example of community involvement because there are some people using provide service for different things so this is the kind of use cases that we want to have in the open air portal but we are happy that people like Paco are organizing these sessions and and highlighting the way that they are using open air in some cases in some cases it's perfect other cases have some have some gaps or some problems but then we can with this visibility we also can push to solve it and to do our best to have everything working well and do not forget about sometimes sometimes I receive some some requests of information be aware that we have this this page where we where we put the information about the aggregation and the content provision workflows for you to be aware which works but the most important is that here in this page we have a table that we say when we will have the next update schedule and what are like the two or three main highlights from the the last update so we try to have this properly updated sometimes we don't do it but usually we have information here as you can see so you can see also when we will have the next one be aware of that of that so we need to put this in a different place to highlight this but be aware of this usually if you if you in this this page is available in in explore also when you this is on the bottom of the search page it's not in the right place but we can put it in other places so the luck the last index information so as as I'm receiving from a request about that so be aware of that okay of course this is the information about the general index update and status update for all open air research graph but of course inside provide you have you have the history aggregation history that is I think working well we realize that we don't communicate so well with the end users so we did some tests user tests and we realize that people this is not clear that page is not clear it's not providing a clear information for the end user we are trying to address that our design team is proposing some changes for that specific page and provide maybe we'll do some changes but so be aware that we have the aggregation history for the specific information of provide when was the last time that we have aggregated information transform and then we have this for the general information be aware of that of that because sometimes I receive requests there are other things that maybe we should highlight but these are the the five highlights that I want to do if you have any so Andre is also sharing here the links if you have anything to report feel free so now we will detail the broker service so we will remind you and us how this service works and then call you will I like also the based on the workflow that we have the changes okay there are some changes basically due to scalability of the service so maybe you probably can explain better we were quite happy with the service okay we need to have some improvements in terms of updates and things like that sometimes repository managers found all the events that in fact are already updated in their repository but they still have in the they still are in the in the open air this is due to it's impossible to have a clear synchronization between the aggregation and the changes that you do but so we always need to take into consideration a delay or a difference from one to two months between the the dates that you do and how they're visible or how are they visible in the broker but now it would this will be different so probably I did all my information so feel free to ask questions directly using your audio and connect your camera to cloud you during the presentation or in the shot at the end probably you want to I can stop sharing my screen or if you want I can share the screen and hello hi everyone yes can you hear me yes perfect you want me to stop sharing the screen and you do it or uh well actually I think we are I have the very same presentation so I can just ask you to move forward no need to if you can just yeah thanks thanks so a brief recap of what the broker service the open air broker service is about the main concept behind the broker lies on the fact that that among the goals of open air is to an increase of quality on the information that we gather in repository open air tries to build another value in terms of quality of the metadata records because as an aggregator it's important to build a uniform information space actually a graph of information but uniformity is a key aspect here because given the number of data sources that we have on board and the fragmentation of formats and the nature of providers uh open air has to solve quite important issues I might say in terms of uniformity of the of the data just to give an example the same information like the authors of a publication can be exposed in several ways by repositories not to mention the references to projects or um the language of a publication so there are there is a set of processes that are in charge for normalizing these fields across the aggregation pipeline so the goal of the broker is to give these added value back to repositories so that repository managers can cherry pick the typologies of enrichments that are more interesting for them so in fact in this slide the key message is about potentially of interest to them because we don't assume that every kind of enrichment that open air can produce is actually interesting for a given repository manager so end of the line is that the enrichment of the records in the repository the regional repository collection uh is performed with extra metadata information can we go forward to the next slide please so this is uh okay this is the slide where I try to sketch the main changes that we are introducing in the broker service architecture so the moment that open air updates the information in the aggregate aggregated graph we designed a set of algorithms that try that are aimed to identify which are the events that represents the enrichments that we want to deliver back to repositories these events are in the past were built completely and I must say we underestimated the amount of information that we were enriching from repositories for the different topics so the different kind of enrichments uh that uh we could synthesize from the graph so in in the previous implementation we built this full set of enrichments and we observed after after some time that the way we designed the backends elastic search in this case uh could not cope with that amount of information so we decided to limit the amount of events built by default to the top 100 events per topic per repository so this will give a repository manager the possibility to preview the set of events that opener can build for them uh still giving an indication for every topic on the number the total number of events that can be potentially built so in phase two if a repository manager is interested in uh some enrichment aspects can perform a subscription and for from that moment onwards the algorithm the opener algorithms will build the full set of uh events for them that will become then notifications because uh we have now someone that has expressed interest interest in that in some enrichment topic then uh this was actually the phase number three where uh we match uh the information built by the events with the subscriptions then this information will be accessible now for the first time through notifications that can be explored both on the content provided dashboard user interface but also consumed by uh public api that will be available under api open air u slash broker we are finalizing the deployment uh during these weeks uh in the beta environment then we will open uh let's say semi public session with some uh pilot use cases uh integrate out designing clients to automatically integrate these notifications into their repositories so this will close the cycle to give back uh in a semi-automated way let's say events back to uh the repository collections can we go to the next slide please uh yes so oh yes this slide was about um bit of explanation of how uh the enrichments are synthesized from the opener graph um as you might have uh already heard in other presentations among the processes that we run on top of the opener uh content there is one named the application which is aimed to identify multiple instances of the same scientific products deposited into either self different repositories or or even inside uh the same repositories it's it is based on different criterias that take persistent ad's as well as publication titles or the authors into account to decide if two scientific products are the same or not but essentially given that different instances of these bibliographic records can expose different pieces of information since the algorithm produces as an output a group of publications let's say we can identify which of these have some piece of information and the other does not so thanks to the provenance information that accompanies every record we can synthesize the enrichments for every repository that takes part to a group of duplicate records in the graph so this is the baseline for uh the methodology that we implemented to derive the enrichments then how we categorized this uh data given that some records in repositories might already have persistent ad's or uh open access versions we decided to uh categorize them into more and missing giving the flavor that we are suggesting more of subject classifications or more open access versions or uh information that was missing in the original collection like your your publication for these repositories doesn't have an abstract so we can provide one because we got it from i don't know from crossref for example there is still a set of categories of topics that are have been available for some time now only in the beta version of the broker as Pedro highlighted if you will remember in the last community call uh an evaluation was carried on for links to software perhaps we can spend some words about it Pedro if something has changed in what we learned from this process and which are the next steps on that yeah we need to take some decisions about yes exactly but then since the software is those the one related with mentions are because in some cases that they are not really supplementary software is just a kind of mention that we we had in the publication yeah perhaps we could continue the discussion because many of the decisions taken by the algorithms that we implemented depend on the semantic of the information that we find in the graph and since the semantic depends on the precision of the mapping layers that we have uh filtering the information from repositories from large collections like crossref perhaps something could have been changed from the last time so now perhaps it's more likely that we can deliver more precise information yeah if people are interested we can at the end we can share some results just people to see it and then and we can decide what to do yeah thank you that's and don't remember if there was more oh yes last but not least the concept of trust yes it's important because for example references to projects are indeed acquired from repositories but considerable number is also produced by automated algorithms these algorithms are not 100 precise so there is a margin of uncertainty in every let's say inference process that implements heuristics so it's important that the data that opener produces reflects this degree of uncertainty so that's why in the content provided dashboard before creating a subscription you repository managers has the possibility to play with some slider that allows to filter according to this trust information meaning that the more you slide towards the one the more trustable this information should be according to the confidence level generated by the algorithm and yes this is the example of the number of the numbers that it's not easy to know actually don't remember when we updated this table for the last time but probably probably we run we updated the events in production at least a couple of times since we built this table together better so perhaps we can revise it yeah but the numbers here are not are not relevant is just to okay to list all the all the types of of events that we have that people will have the sample of the top one another events enrichment events for each and then they can decide to subscribe for the different events okay our kid our kid is already available and then we have the links to software that we need to address maybe we can address that that part okay yes these were still open questions from the last community call if we will remember some of them are still only ideas as the developer teams behind the broker service in the past months have been working on the re-implementation of the algorithms to generate the events following up the release of the extended opener graph that is now available in production and since that task have been a quite important leap ahead in terms of technological upgrade it has drawn essentially all of our resources for many months so these ideas that we got on the table stayed just ideas and still ideas for the moment yeah but we will share it with our community which is important in order for them to be aware of that we want to have kind of metadata alerts apart from those notifications that these are a result of subscription of event enrichment event subscriptions we also want to have kind of alerts metadata alerts sending to all repository managers we know what we want to do we need to time to do what we need okay so thank you thank you cloudy I think what is important I will put the slide here because I think this is what I asked you all to to understand and to check if it's clear for you and please put your comments or just turn on the microphone and share with us so this is these two steps that we have these different phases that we have here from the time that we just generate let's say one under the events per per the topic of the enrichment we share it to you we also identify the number the total number that you have that you can expect to have for that specific topic and then it's in your site you decide to subscribe or not and then if you subscribe you we will generate those those events so and with this approach as Claudio said we can better cope with our scalability of our infrastructure and with our resources available for this service do you have any comment in fact this this was something that so now we have this need but we are aware that not all repository managers want to receive and are interested in all the the events we are aware that they in some cases they are interested in three or four other cases they are interested in more in some cases they just want to receive links to projects and it's done so for this from the when we do this when we put this change in production okay then you need to subscribe to receive this information this is what what is important to say but Claudio do you have any any clear information for our our colleagues here so that we are going to move the current the current subscriptions and yes this new approach or subscriptions that we are already performed will be migrated to the new implementation absolutely yes okay we'll try not to lose anything that was available yeah from the past yes okay so do you have any any comment it's clear let's you can turn on your microphone if you want or just right here in the shot if you need to say something i understand that this is something that you are waiting also to see in practice in the dashboard but at least we wanted to inform you and this is a a big change in the way that we deliver this service for for you okay thank you Sandra and John for your feedback yeah we also have some questions in the meeting notes not related to the broker but we have three questions okay so we can you can address okay so if we if we don't have any any questions so i think i hope that you we can we will in the coming meeting we will highlight this and maybe maybe demonstrate this so for sure i think we will have here an increase of quality in terms of service delivering and okay then you just need to perform the actions that you need to do to subscribe and it's okay thank you very much thank you Claudius for to support us with this okay so questions so we are here we have some minutes for questions and where is the if we are not sure about the answer we can address it later but let's let's have no okay the first question is the statistic is in open new city are not aligned with the statistics in open area okay any better okay yeah this is right i think you you can help me also with this question Claudius because we you are aware of the timeline of this change that our colleagues in in Athens are doing i think from yes let me open the link in the meeting notes so that i can imagine better imagine what the question is about yeah it's a difference between it's a difference between the the provided and what they see in yes the thing is currently monitor which is the service where the statistics are exposed is a bit behind in terms of updates because the backends just like the graph in explore wasn't entirely rewritten the same thing is still being finalized for what concerns the statistics that are synthesized from the graph and that part is a couple of months behind we are having meetings basically at least a couple of times a week to coordinate the effort to finalize that work so i think next friday we're going to decide when the full set of statistics will be available again through that portal so it should start to receive updates in line with the content updates that we push to explore to the explore portal so it should be come back on track soon yeah great if we have any additional information or some so we will make sure that our colleagues Antonis and Dimitris that are in charge of that we will be aware of your question and they can reply if needed and Brianna having a new list of projects local funder is a very rare task and i have forgotten how to do it please help okay yeah i think i think now we also are having in place some better workflows so you just need to send us and contact right now that the contact person is Ari and reply here and you can send the in fact is Ari for the list is Ari and the medium so and we can put the emails here from medium from cnr that is the person then that will check and put the list of projects in our information space and Ari will manage the test mining and things like that so i will put here the emails you know some maybe you know the area but i will put here so we have a new local funder from Serbia great no no no the same funder but the new call ah the new content the new ah the new new projects okay okay so so just send to medium it's done i forgot the area because Ari will interact medium we will interact with Ari just so yeah thank you thank you good good good with our i work on we are also doing you we are also doing the same for fct we did it last week and so you see you see so we are here we are in a line yes yes yes okay okay what needs to be done to activate the school school xapi oh there's one test that i think the linking service is working okay okay here i'm not sure if i can help cloud you maybe can help if not cloud you can i'm not sure either because school x is a format and it's implemented only for the moment by the school explorer service which is a sibling aggregation system that has a different set of data sources different from open air i mean contributing to building the aggregated information space so in principle to activate school xapi you have to be a provider to the school school explorer aggregator or be part of one of the collections that school explorer aggregates so if this is the case perhaps some queries can be performed directly on the school explorer apis perhaps at least this is my take on the question not sure if i can go deeper but you if you want to to to work on your microphone maybe we can for example if you want to benefit for example to identify uh links between the different resources and between projects and publications and things like that maybe you want you want more as you are mentioning school x i think you can you can you can benefit from what we have in in the in the book content and also from the doi boost not a set no probably well uh school explorer only has links between publications and data sets for the moment yeah but i'm i'm saying that the jill can benefit from our from all the links that we have in open air in our now we're yes but not yet in school x format not yet in school x format yeah okay implementing among our goals but not yet there you need to do it programmatically to work with our yeah the way that we are exposing the okay not sure if it was helpful for you but to free to to open your microphone okay um there is another question so from your then looking at open innovation open air i see that there is a working progress for broker service integration in institutional repositories um yeah so what we can say about this call you we can say something from the because you are you are in fact the point of contact for one of these projects from the tenders from the innovation tender yes i know you are in contact with for science are you in contact or is it you no it's uh it's the last call i participated to the calls but they were organized by alessia with support of michelle anyway i know uh that they uh were interested as a pilot case to integrate a client on the the d-space uh priest platform capable to automatically get the enrichments produced by the broker so by consuming the information from the api that we are working on so basically they participated to uh the design of the uh metadata format and the methods implemented by this api yes and this is being done by this in this project from for science so yes you're then you can benefit from that when we when it's done i think it's it need to be implemented until december no december the projects need to be finalized in december yes yes so let's say that by for sure by january next year you will have information that maybe we will have i think we will have also an open presentations from the uh from from from this some of these results of this standard so it's good thank you Jordan okay so we are coming to the end uh i'm just uh so be aware as i am in the the open page be aware of the the open the public open every next next week so we will have this this a set of different meetings um next week so we have some internal sessions but then we have every afternoon we have public sessions um you can register you can check the the agenda so um we will start the with um a public section on the implementation uh of the of open science and then the and then the open area within yos can do work with other players in the in the global scholarly communication landscape uh and uh in the second day uh we have a public session european national international alignment with different uh uh people from from open air and from those that we are collaborating with from other regions of the world the work that we do together with kawar uh to engage in in south korea and canada and in africa and and in and in different in in latin america also then a session on on provide the related topics let's say one thing about the graph the open air research graph kind of status overview of the westward and also a session on the open air guidelines this is on day three the 14th of october and then the last public session no no two not the last the last one is on friday so open air for researchers and behind where we are highlighting all the rdm services uh that we have for researchers am easy as you know the argos uh on day four you know for over in the end on friday uh we will have on another service related with a connect service building open science gateways to open and thinking research outcomes um where we have different use cases so for all the the general assembly public sessions we will have several use cases to be presented so feel free to join this is an open session targeting different um areas of the work of open air and the collaborations we had within europe and and in different parts of the world so be sure that you are welcome to this um this session okay to this week with five sessions um okay and um not sure if there is anything here in the chat no so next call november fourth okay same time of past two central european time on the first Wednesday of november and and do not forget to follow the newsletter or to subscribe or to invite others to subscribe the newsletter we send it every month in the first week of of the month with the different news about this the open air provider services and related services so many thanks for your participation i hope that was useful if you have questions put it in the in the notes we will check it during this week and if we have if we have more inputs from other colleagues that are helpful to reply your questions we will we will send you a message okay twilight that we have answers in the notes okay thank you very much and see you next week it seems for samsung or or in one in one month in the other provide community call thank you very much for your participation bye bye