 Thank you very much, Gwen. And hello again from our side. Yes, welcome to our webinar. Open APC, Transparency of Open Access Publishing today. And as Gwen said, we have some rules. If you have questions, we have a Q&A section. So in the last couple of minutes of this webinar, we will answer some questions from your side. The speaker today is my colleague, Christoph Brzezinski. He is a member of the DFG Funding Project Intact and this starts in 2015. And Christoph shares his knowledge about APCs, Open APCs, Gold Open Access Publishing and he is an expert in the field of processing charges. So my name is Andreas Czerniak. I'm the project officer of Open AIR at Bielefeld University and responsible for the work package for Open AIR. Short words about Open AIR. Open AIR stands for Open Access Infrastructure for Research in Europe and was funded in 2009 from the European Commission and has around about 55 institutions in the European countries and beyond. Open AIR collaborates with projects like Open APC also ERC, e-infra-central and many, many more organizations. So I will take the floor over to Christoph and here's the link for your floor. Well, yes thank you very much Andreas and also for me a very warm welcome to this small webinar we are going to hold. A small warning in advance. I have been feeling a bit sick in the last week and well sorry in advance for any sneezings or cut things. I'm probably getting on your nerves within the future but I'm trying to mute them away as good as I can. All right, my part of this presentation is named Transfelsi of Publication Peace and the Open APC project and I'm going to talk about our part of the Open APC project. First for a little introduction, what is Open APC in a nutshell? Well, if you want to put it on one line, Open APC is an open data project and it starts with the collection, processing and dissemination of data on APC expenditures. I'm sure that most of you are already familiar with this but I think it can't be too bad to for short reminder on what APCs are. APC is an abbreviation for article processing charge and an APC is a one-time fee which is to be paid to a publisher if you want to have an article published in an open access scientific journal and this fee can either be mandatory in case it's a fully open access journal or it can be optional in case it's a hybrid open access journal. In the lettercase you can decide as an author if you want to pay an APC and if you do your article will be accessible as open access and if you don't the article will stay closed and it will only be accessible in a traditional way via journal subscription. And if we talk about how high is the average APC, well it's usually in the 1,000 to 3,000 euro range. The average value we caught in our Open APC dataset is slightly below 2,000 euros and this brings us right to the second questions, why are APCs important? Well I think you all know that we are right in the middle of a great transition process. The scientific publishing is slowly but certainly shifting to open access and that means at the same time that the payments which are done in this whole field, the payments from institutions and libraries to publishers, they are also transforming mostly from subscription fees to APC based models and this is why APCs are important because they are really big, big amounts of money at stake. I think some of you who have a library background will be familiar with how big the sums are which are changing hands year to year in subscription fees and these big amounts of money they are now transforming to APC based models. And what are the benefits of making these APC transactions accessible as open data, which is what we are doing in the Open APC project? There are two arguments there. The first one is some kind of a large scale argument, it's policy making because there's a lot of research and education policy going around this transition process. For example, you might have heard of the big deal project we are doing in Germany here right at the moment. It's a deal which the institutions are having negotiated with the publisher Wiley and all these things revolving around this transition transition transition process. They need a solid data background so monitoring and steering of this transition process requires data on these APC payments and there's also a kind of a small scale argument because in practice these APC amounts they are often negotiable which means if you as an author go ahead and have to pay an APC because you want to publish an open access article you can often go to the publisher and say oh well I really can't pay that much can't we just negotiate the APC amount and surprisingly this often works and as it is with all negotiations if you have information, information is power and it can really strengthen your position in this kind of publisher negotiations if you know what other institutions have paid for their APCs. All right now for some facts and numbers. As Andreas already said our project was established in mid-2014 here at Bielefeld University Library. Our first participants were German universities which took part in the open access publishing program of the DFG that's the Deutsche Forschungsgemeinschaft German Research Society and our project also was officially funded by the DFG from 2015 to 2018 under the name Intakt and since 2018 we are funded by the BMBF the Federal Ministry of Education and Research here in Germany under the project name COA and well what are our roots our basic principles? Our first and foremost principle is that we want to apply a maximum of transparency this this word open in our mind it should not only apply to the data but also to things like the code the workflow or the whole project history we see in in a moment how we are trying to achieve that. We also employ a high degree of automation for example we use lots of scripting and programming languages for the data enrichment for testing and for building and we also try to lay the entry barriers very low for new participants for example there's no such thing as a formal membership in open APC you just have to submit data to us and basically then you remember of our project. Let's have a look at the current state of open APC as of October 2019 our metadata collection now consists of almost 85,000 articles with associated APC costs they have been contributed by 247 institutions from 16 different countries and the total reported APC amount if you sum together all the articles present in our data set amounts to 160 million euros so you can easily see that it's correct what I said before there's really a lot a big amount of money at stake in all this scholarly publishing and this money is now transferring or transforming from subscription fees to APCs. Now to go a bit more into the details about our data format and our data processing all the data we have collected in open APC is stored in a very simple table format it's simple csv file comma separated value and what's interesting is that our data file consists of 80 metadata fields but only five of them are mandatory when you're reporting data to open APC that's because the remaining metadata is automatically imported from external sources let's have a more close look at that what we can see here this table um that's what our metadata format looks like and only this five fields on the left hand side we can see here in orange um these are the fields you have to report to as an institution if you want to take part in open APC um what we can see here is of course well the institution name that's Bamberg University in this case the period the APC in question was paid then of course the most important metadata is the dual amount that's the APC then uh also very important the DOI um every article you submit to us should have a DOI because a DOI is very important for the following metadata enrichment process and the last one is an identifier a boolean value is hybrid this indicates if the um if the journal this article was published in it's a hybrid or a fully open access journal and all the remaining metadata fields on the right hand side those are inferred automatically from external sources for example the blue fields we see here we import them from crossref we have the DOI for example the publisher name the the journal title several ISSN values and a license uh a license identifier then we have two values we are importing from your public central department ID and the public central ID then the ISSN L that's the so-called linking ISSN it's a general identifier issued also by the ISSN organization which links together so to say the first three ISSN values it's important for certain bibliographical analysis then we have the UT that's the web of science article identifier we are important obviously from the web of science and the last one is this DOIJ identifier it's also boolean value and it tells if the journal in question is listed in the DOIJ the directory of open access journals and this approach has a lot of advantages the first one is of course it's means that there's less work for participants you really have to submit only these five first fields to us and all the other ones you don't have to care about because we do that for you importing them from external sources the second advantage is that this process leads to unified publisher and journal designations because as you can see we import the publisher name and the journal full title from crossref so it means that those designations are unified if every institution would fill out these fields for themselves of course you would can easily see that we would import a lot of heterogeneity in the data and that's what we don't want so importing this from crossref is very advantageous and the third one is of course that corrections can be applied retroactively from time to time we'll repeat this enrichment process also for the articles we already have in our data set and if any corrections for example there's another isis n number for example because the first one was wrong in crossref then we will get these corrections automatically into our data set okay let's talk a bit more about our open apc infrastructure what's important to note is that we are using git for maintaining our data and we are publishing it on github some of you might be familiar with this git is a well a code revision system it's usually used for software development but as it turned out it's also very useful for maintaining an open data project we can take a look at our github repository what we can see here this is actually all the data we are currently maintaining an open apc and we are using this git repository to share it with other people and for maintaining this open data approach yeah we can have a look at it we have this this readme file this index front plate here as we can see there's a lot of participants already in open apc we're listed in here and we also have some tables and plots here about our data but what's really interesting is this folder structure here because that's really where the data is we have this data folder here and what we can see here is a folder for all our participating institutions and that's where you can find the csv tables they have submitted to us and what we can also see here i'm scrolling down here the apc de file that's our data collection so the apc de file that's where all our data is stored i'm showing it you in a text editor because if i would open it here it would take some moments because it is it at least it has reached the size of about i think 12 megabytes so easier to view it here well as you can see well csv file very easy file format and it's a really large file of almost 85 000 articles large large large okay and what's also very interesting about about git and github is that it automatically keeps a history every action which changes or submits new data is the so-called commit in git and we can see all the commits listed here so there were 4 000 almost 4 300 points of time where data was changed or added in our repository and what's really cool is that we can list all these commits here and we can also see what has happened in these commits so for example we can just let's click onto this one and now we see what was changed in this commit for example here this was a this was a small correction this door here was with an uppercase and in open apc we only want lowercase letters in the doors that's convention we apply so this was it was before in the red line the green line tells us how it was after the commit so and this commit they form a sort of a commit chain if we want we can go back really to the to the start of the project in 2014 when this repository was founded and you can see all the changes which were adequately traced them back and see what has happened over time in the open apc project so that's what i meant when i talked about that we are also applying the open data principle to our project history and what we also do we have this python directory the the programming language python and that's where we store all the scripts we use for example for testing or enriching our data it's all also open data in this case it's software so it's more called open source and that's what we mean when we are applying this open data principle also to our software and scripts all right then we are also operating our open apc project block we can also look at this one whenever an institution makes a new data submission to us we are preparing a blog post let's have a look at the last one this was virginia tech from the us reporting their 2017 and 2018 apc expenditures to us and well there's some some general information and for example the last commit by virginia tech they submitted 145 articles to us with a certain lump sum and average fee and you can see a table of all the publishers which were involved in this in this apc summary and some plots also a nice tree map and some plots about the average cost for the virginia tech for example that's a nice thing because it's it's um it disseminates better the the data about opposite participants what they're doing and and what data they are submitting to us and the last thing i want to show you is our interactive tree map graphics that's actually what perhaps open apc is most famous for because these tree map graphics they're often referenced the site we're operating is tree maps.interproject.org and we have small tree map visualizations first for all our participants but there's also an aggregated view that's what i'm opening now and this is actually an interactive tree map so we can use it to to view and to to so to so to say to delve into our data for example we can change the hierarchy at the moment we are looking at the institution branch so the tree map is is sorted by institutions but we can also change the view now we are looking at the publishers for example for Elsevier we have 32 a million euros in apc in our dataset for Springer nature we have the class for Wiley for for plus and so on and we can also change this for example to the the journal or we get a really a really colorful tree map that's the that's the distribution over all the different journals and as I said before we can really use this to to explore and to delve into the data because for example let's change back to the institutional view if we let's say we want to examine the University of Cambridge then we can just click on this on this Cambridge rectangle and then we go down another level and now we see how this lump sum for Cambridge distributes over all the journals over all the journals Cambridge has published and paid apc for and reported to us and we can do the same again for example we want to examine the journal nature communications then we click on to this rectangle and then we get down to the last level to the level of given of individual articles now we see all the articles the apc was paid for them and we can also if we want we can even click on this on this link here it's a doi link so we can even get to the article text there are some other options we can also filter the tree map for example we can filter it to certain periods or let's say we all we only want to observe a certain hybrid status for example only the journals published in open access all the articles published in open access journals for all the articles published in hybrid journals and we can also filter for example for different countries and if we have found a view we want to examine more deeply we can also download the current view we are looking at via this button we can download either as a csv or a jason file all right um now um becoming part of open apc because um for that at least the part what this webinar is about um we want to encourage you to also send data to us if you're working for example at a university or library which is also paying apcs so let's say your institution wants to participate you know maybe see what steps do we have to undertake well um the first one is of course you would have to prepare an additional data set according to our submission guidelines it's also defined here in our guitar wiki and it's important to note that this initial data set doesn't have to be large um we have participants which are only well contributing perhaps five articles per year and that's perfectly in order so you don't have to be Cambridge University and and submit hundreds and hundreds of apc apc transactions per year there's an option to create a small read me file which would provide some background information to you and your institution you can do that but you don't have to um you would have to designate a contact person at your institution just the name and the mail address so uh we know who to contact if there is anything going on with your data if there are any problems then you would mail everything to our project address open apc at univillified.de and well that's it and uh we would then go ahead check your data enrich it publish it in our open apc github and then you're basically a member of open apc if you're feeling a bit more adventurous or you have people which are more technically inclined at your institution you might also want to consider doing a pull request on github we won't go into this in any more detail and that's really a bit for the more for the tech people and there's another way of contributing which i want to mention here because um if i see this correctly in our audience there might be some people which are dealing with institutional repositories so this might be of some interest to you we have two participants which have done this already um they went ahead and modified the institutional repository so that the apc costs were directly attached as metadata to their article records um and then they went ahead and told us the relevant parameters and now we are going to them and harvest the metadata automatically in regular intervals via the oai pmh protocol and um that's very interesting because it effectively reverses the flow of communication it's not longer you which are sending mails with csv files to us but it's us coming to you and obtaining the data from your institutional repository so it lessens the burden on you so to say and uh well that's very convenient for both sides and um i want to show you a small example because uh Regensburg university the the uh aprons people they are they have already done this and um we can have a small look at the um oai response from uh from the epub server at Regensburg as we can see here um that's response format and um here that's the that's the important point of course they have attached the um the apc which was paid for this article in question directly to the metadata uh which is delivered by their institutional repository server all right but uh really it's only a small thing i wanted to show you um it's a very very advanced of course and um if you're just or every day participant you would just go ahead and send us an email with the attached csv file so um now we want to talk a bit of course about the benefits of taking part in open apc um because um we talked a lot about uh what you have to do if you um want to be participant but um you might of course go ahead and ask well what what are our benefits what what we we gain as an institution if we um participate and send you our apc transaction data um of course the first one i can offer is a warm cozy feeling of doing the right thing because it's an open data project and open data it's of course cool and the good side and so on and so on but um well i i can understand if you want something more than that so um the first benefit i can offer you is our testing routines because i already mentioned that um all data which is which is contributed to open apc is automatically checked by a test routine um for example we do um of course if we say we it's of course a program script we don't do this manually um we do consistency consistency tests um if two records share the same is as n they must also have identical publisher and journal names because same is and same journal um there are things like logical tests for example if a journal is listed in the doj then this uh this this hybrid metadata field must be false because it is known that the doj only lists fully open access journals um we also do some content checking we take the doi and check if it's intactively correct and if it does resolve which is of course very important for doi and we also do uh duplicate checks we check if a doi does a few more than once in our open apc data set um if it does it's an error because um that's per convention our open apc set data set must be completely duplicate free and um well if we encounter any errors during this testing um we go ahead and report them back to you as an institution and um this can be helpful in several ways for you for example if we find any malformed dois we will report this to you and then you can correct them because if we report malformed dois to us it's likely that they are also malformed in your institutional accounting system back and repository whatsoever because usually doys aren't touched anywhere in the in the data processing um if we encounter any non-resolving dois during our tests we also report them to you and then you would usually go ahead and uh report this to the publishers because um they are usually responsible for properly registering their dois with the uh with the agencies and um if we encounter any doi duplicates um this may point to actually quite serious data errors um if they appear within the same institution and sometimes we even have cases where these uh duplicates occur with um within different institutions and uh this might actually point to accounting errors or even to fraud and uh that's actually not great theory because we already had those cases um we had cases of publishers double charging so um they wrote two receipts for the same institution or um we had a case of an author applying for full apc funding at more than one institution and putting the rest of the money in its own pocket which was of course a case of fraud and these articles were these cases were discovered because of apc did this uh duplicate checking within their framework and um the second benefit for you is our system of report generation um you will receive an independent data report from us upon every contribution this is an automatically generated pdf document and um it is meant to discover pointers to possibly fraud apc amounts because um this uh first section here the testing that's made more on the more on hard errors you like to call it because um um for example malformed your eyes or wrong eyes' ends these are all uh hard metadata errors they are easy to spot and easy to correct but um uh looking at the apc amounts and telling if an apc amount is possibly wrong or unusually that's much more difficult and um what we do is that we try to apply statistical measures and we report back those apcs to you which diverge more than two standard deviations from the calculated means standard deviation uh you might know that it's a standard statistical measure to um well to to identify outliers in the data set or unusual amounts and um well this is so uh more the soft testing because um the results are not necessarily errors they're statistical outliers as I just said so um it's up to you to look at them and decide if there's really something wrong with the apc amount and um I can show you one of these reports I've just generated for University of Bielefeld this is what it looks like as I said it's a really easy apc PDF document uh there's some general data here some structural data and um what we can see here that's the important section this is where we list all the articles where we discovered um an outlier or um well where we discovered that the that the apc amount reported to us diverges more than two standard deviations from the mean um for example this is a article here um it was published in the journal cell reports by Elsevier and um well it was in 2018 here's the doi and Bielefeld reported an apc cost of 2100 euros and as you can see here the mv the mean value so that's the mean value for uh this journal cell reports where from our open apc data set is 4400 euros so um there's a difference of 2300 euros the University of Bielefeld played less for this articles for this article and of course that's a very interesting case and um I mentioned this right at the start of my presentation that um apcs are negotiable and we know this because we have actually talked to the author of this paper and this was really a case of negotiation because um here at Bielefeld University we have our institutional publication fund operating and this fund um has an upper limit of what it pays for individual apcs and our upper limit is 2000 euros and what this particularly author did he just when he just went got in contact with Elsevier and just said well I want to publish this article open access but uh 4400 euros that's just too much for me I can only pay 2000 because that's what our what our institutional fund will be pay for me well and some negotiation went ahead and well in the end they agreed to an apc amount of 2000 euros so that's actually a good example of how um of of auto margin there is with with apc amount there is really a margin for for negotiation and well there are other cases here of articles which are uh outliers or which are somewhat unusual but as I said um we cannot really say that these are errors these are just statistical outliers and we are just reporting them back to you as an institution so you may have a second look at them and well this overview here as you can see most of the articles of course um are not deviating much they are not significant in statistical terms with Bielefeld we had 491 articles which were um not deviating much and we had only significantly deviating six articles so uh this report is something you as a document you will get also on every data contribution to open apc all right um we are nearing the end of my presentation now I want to talk shortly about some applications because um open apc is uh maintaining an open data set and with all data um data gets stronger the more you combine it with more data and um there are also already some project where the open apc data set could be used by combining it with other data for example we had a very interesting um analysis here at Bielefeld University we are also operating the national open access contact point here for Germany and the people at the contact point they conducted an analysis by combining the open apc data set with a data set on journal impact metrics the so-called SNIP or slip factor and what I wanted to examine is if there is a relation between impact factors and apc amounts and you might guess there is um journal with a higher impact factor which are more prestigious um they also live by higher apcs who would have guessed that um we also combined um our open apc data set with um publisher provided data in that case it was the open access publisher uh frontiers uh with that we could measure the effectiveness of the uh mentioned dfg publication funds by uh by comparing the publisher data with the reported data so we could see um what share of articles um which are funded by this publication funds um is reported by the by the uh by the publisher itself and we also conducted some biomedical analysis by combining data we obtained from the Springer web portal Springer link to estimate the effects of certain transitional agreements um I'm not going to more detail here if you're interested if you're interested in that um you might have a look at these slides after this presentation and then you can click the links all for yourself so um yeah I think that's uh for my part now Andreas will go ahead and I think he will demonstrate um another uh kind of um open apc data combination um a combination between open air and open apc data so now I am not summary that's for later I am giving the mic to back to Andreas very much Chris off um yeah and your great overview about the open apc project and the benefits of the institutions of the open apc project so um I will talk about uh another possible reuse or use of these data in open air um open air has a lot of funder informations and funding informations so in the first case we integrate these open apc data set um in open air and found in coverage of 98% um based on the DOIs uh from the open apc data set in the open air graph research graph so um there's an average of less of 2000 euros and as Chris said more than 160 million euros in open apc data set we have taken a look into the open apc data set and found um and related to the funders and here you can show uh you can see the major funders that we found um in our data set it's the Velcro trust has uh more than 10,000 articles and an average of 2500 euros and so on this can also be culminated to the funder programs um here's the three major programs um the fp7 uh was actually more than 5000 articles we found an average of more than 2000 euros in this case so there are many many more relations between the open apc data sets and the open air research graph and um we actually integrate the open apc data sets uh on the release dates from github into the open air and research graph and we'll present these information in our community dashboards or in the explore dashboards you will see uh you can see um the open air portal these information was also present in the development um ap api for um integrating these informations into your applications so you can go to api.openair.u and um we will also present the apc or the processing charges in this case um at on the level of the programming interface as uh because of mentioned so we also suggest that repository developers include these apcs or bpcs these processing charges into the repository software so it could be how was it automatically via our opm edge interface and this is uh very nice to have um use case at the moment but um it could be possible to have this in the future so I would like to come to the end of this webinar um we have a question during the webinar I think this was before uh Christoph come to the benefits of the open apc project and we have another question here um Thomas um how do you register apcs with or without fat um okay I think I can step in here and because I think all these questions are more or less directed to me um and we ask the new yeah thank you okay um well I think the first question was um what are the main reasons institutions refuse to join your initiative or provide the data um well um what I can say from my point of view is that um this might come as a surprise but most often it's not for political reasons um you might think that institutions would be uh hesitating to um publish apcs because it's their financial data it's from their from their wholly internal uh workings and accounting systems but uh actually that's that's not the case often um it's uh more often that um institutions uh want to participate and contribute data but um they aren't able to and uh the reason is that for um compiling these um open apc tables for us um they have to so to say they have to um bridge a gap between between two different worlds and the two different worlds I'm talking to here about here is um the world of uh of of libraries and of of metadata that's world of your eyes and on uh publisher titles and journal titles and eyes is ends you know on the other hand it's the world of uh of financial and institutional accounting where the um where the invoices for example they receive from the publishers they are stored in for example in accounting systems and uh bridging the gap between these two worlds and they have to do this to to bring them together because they have to assign a doi on the other hand to an apc amount on the other hand and that's that's actually the the um the most important obstacle um from our point of view the institutions are many institutions are really willing to contribute to open apc but um it's this this gap that uh that holds them up um bringing together apcs uh and bibliographical data that's the that's really um that's the obstacle which is hindering them um okay I just uh I hope this uh answers your question then we have a question from Thomas Messerschmitt hello I'd like to ask uh how do you register apcs with or without uh vat uh yeah very good question um in general in open apc we are interesting in the final sum um this means that we want apcs with uh vat where you add a text and also with any discounts for example so for example there were institutions which are receiving discounts from certain publishers because they are uh or because they are um they're operating a membership for example with uh with research with certain research research societies and we also like to that these kind of discounts include into the final price I'm just someone else is sharing um could you give me back the screen perhaps then I can show you thank you um I can show you our data submission and how I manage I showed you briefly before um we also have a section on definition of costs that's actually how we liked this um the sum in the in the euro metadata field how we would like it to have calculated and as you can see here um it really starts with all report publication fees across values modifiers like taxes or discounts should be included into the amount and um I said that it's a good question because um the initial definition what is an apc um as it turned out during the the course of our project um it's not so clear as it might be at first glance because um if you look at the other bullet points here um for example what is uh the matter with with uh page or color charges because even in the age of mostly electronic publishing there are still uh publishers uh which um lately uh levi page and color charges because uh well I don't know because colorful electrons are more expensive than gray electrons or something like that and uh what about for example corrigenda when you uh add um when you have to make a correction or addition to your article then uh there's also a fee to pay um is this fee part of the initial of the uh initial apc we say it isn't because um the apc is it's not part really of the apc it's it's an it's an addition which comes later and um you really have to have a look at this definition of costs if you um provide data to us because uh that's actually the that's the most important thing that we um get a common definition of apc in our dataset because otherwise it wouldn't be amount to be indeed hard to uh they would be hard to compare um okay then we have another question do you include or collect information on the special agreement institutions have with publishers it's for example like mbti um no we don't do that well the the short answer is that we um do not want our our apc dataset to be cluttered with metadata fields so to say um of course you could argue that we should uh introduce additional fields to well to to mark certain circumstances this article was published was published under for example uh as you say in your question because there was a special agreement with mbti with which um gives a certain discount to the institution for example and um but we decided against it because we want to keep our our metadata compatible over the course of time and uh we do not want to introduce fields which would be uh which would have no value for most of the entries so um if there is um let's say an agreement with mbti um we encourage our institutions to um give information about the circumstances in the read me file um i mentioned this briefly in my presentations institutions can add a read me file to their institutional folder um i think we can have a look at that um let's see if i find the institution where that's the case um i think dresden university is doing that yes um well yeah as you can see here um dresden university they have uh an agreement with mbti and they are giving some information on that in this read me file which is attached to their institutional data directory so um it's not a really gold solution we are aware of that because uh this kind of information is is free text so it's not uh machine processable but um i think it's still preferable to really cluttering our metadata set with lots of additional fields uh dealing with certain circumstances which can occur when paying apc's um okay i hope this answers your question um yeah i think i don't see any more additional questions here then i'll shortly come back to our final summary as a presentation there it is so our summary um yeah we saw that uh how open apc works we are an open data project which collects and disseminates article-based apc data and i also showed how transparency is a really very very important point in our daily working in open apc we apply this transfer transparency principle not only to the data and the apc's serve but also to our enrichment to our history and to our software processing scripts and uh i also showed how you can contribute to open apc it's really very easy there's no red tape no complicated data formats no uh submission forms or something like that you just submit our data to us we are via an email and then attach this v file and that's about it and there's also no need for large data sets more is better of course but every bit helps if your institution uh doesn't publish that much it'll say only five or ten articles per year that's perfectly acceptable and um final hint i want to give um we talked a bit about this uh about this complicated process of of bridging the worlds between accounting and and library metadata and um there might be a useful hint if your institution operates an open access publishing fund or something like that um there are many universities and libraries doing that now at times um then it's likely that they already have the data we could use because um this publishing funds they usually usually they have an excel or csv file already floating around somewhere where the data is in the form we needed a list of articles and article identifiers do is linked to paid apc amounts um publishing funds they usually have the data so if your institution if there's an open access publishing fund operating it's at your institution um that might be a good starting point they might really have the data we could use all right um i think this concludes it wow 12 o'clock really great um if you're interested interested in our presentation you can find it at this bit.ly link here um it's uh in a form of an x8 xhtml file so um you can also go ahead and click all the links in this presentation if you want to uh we read them or i'm interested in interested in more details all right um yeah i think that's it andrius we want to say some thank you very much christoph so and thank you for attending this webinar we hope this was helpful for you and your work for your daily work um yeah because of said the presentation are as soon as possible available and if you have any further questions please contact us um at our email address and um yes thank you very much okay thank you very much andrius and christoph and thank you very much for being so good on your on your time this is a very very impressive um i will add a link to the presentation and um a link to the recordings to the open air webinar page um so just be patient for a little bit and then you will see it appear there so thank you very much everybody for attending uh i hope we'll see you again soon at one of our other webinars uh and uh you can expect one more email from me with an evaluation form but that will also be probably uh early next week so thank you very much