 Hello, everyone. Thank you ever so much. I can see a couple of more people join. Thank you so much for joining us this morning for our depositing large scale data with the UK data service. I'm Christina. I am the data collections development manager at the UK data service. So I overlook the acquisitions of all the different types of data we make available via UK data service. We do take a very proactive approach. So for example, if a critical number of users do get in touch with a specific survey, we will try our best to get in contact with the data owner to make available that study, of course, as long as legally and ethically that is possible. But I also overlook the research data management training that we provide as part of the UK data service. So it's Monday morning and let's start with a quick icebreaker. So in your opinion, what's the best take invention of the 21st century? We can see e-readers indeed. We forgot about books now. We do use quite a lot of e-readers. Streaming, yes. Streaming comes in very handy. Mobile operating systems. Yes, if we remember the Nokia 3310, I think it was. It has a calculator on it. It was working from a mobile phone perspective, but now we can do our emails. We can do Excel. We can do even remote desktop in our own phones. So the next question is, e-readers, we got another. Smartphones, yes. Smartphones indeed. They do come in very handy. But the second question, and I'm going to present this now, in a few words, why do you think archiving data is important? It can come from your own perspective, from your organization perspective. Why do you think archiving data, making data available, is important? Data sharing, yes. We know now a lot of different funding bodies are actually mandating data sharing. ESRC, we're the very first UKRI council in the UK to mandate data sharing. We are funded by the ESRC as well. So other people can make use of it, yes. That's one of the main aims. The data lifecycle is so big in collecting all this data, thus coming handy for others. Reproducibility, openness and transparency in research, increased data reuse, avoid duplication of research and manage publication bias. Provides long-term value to data and enables researchers to further increase the value of this data. Lovely examples, lovely answers. We are here at the UK data service to enable data sharing to help you as data owners to make your data available in a nice and understandable way for the secondary data users as well. So now I can go back to my presentation from the current slide. So as I was saying, the data itself has a much larger lifespan than the project that has created it. And we can see this on a daily basis at UK data service. We host data from government organization, researchers, academia, and so many data users, secondary data users, come and download the data from us. And they end up creating this amazing, beautiful project just using secondary data that they can access. It can be reused for a variety of different purposes. And we have seen lately there is a drive into especially linking current survey data with admin data, health data, etc. Now, when we're talking about who is benefiting from data sharing, as you said in the icebreaker, yes, the researchers clearly benefit from it, even from a reproducibility perspective, data citation perspective. But also the public because transparency, we open up the data, the funding bodies as well because they end up spending public money for public good. So by sharing that data, again, we are more transparent in what we do. But also the research participants themselves are actually benefiting a lot, especially if we're talking about data that has been collected from specific populations that are harder to reach. It's really great to actually provide that data. And again, as long as it's ethical and share legally possible. Now when it comes to benefits to depositing with the UK data service, the first one we get quite a lot of questions, how much does it cost to deposit with us. There are no costs whatsoever. Everyone that creates social science and population data is most welcome to deposit data with us. However, I do like to make a note here while there are no costs directly associated with the data, there are costs associated with preparing the data for deposit in the documentation and we're going to talk about that a bit later on. We do provide a fully online platform for creating, uploading and managing the deposits. And today in the webinar, I'm going to present the platform for the curated repository, we host the UK data service. We're going to see you can find the license via the platform and we do provide very robust access policy with a three tier licensing framework, which again, I'm going to talk a bit more later on. And just a note here, licensing can be quite difficult, especially if it's the very first time you're depositing. And again, we always welcome anyone getting in touch with us if you have any queries or concerns about the data. We're not quite sure what access level would be most feasible. We can even help with assessing the disclosure risk in the data and seeing what would be the best access level you can share the data. We do provide data management guidance and support. We are here to ensure that all the data that is shared is actually done in an ethical and legal way. We do provide control access to sensitive and confidential data. You might have heard about the five states framework, which is our nationally used. It was first developed by us at UK data service and the Office for National Statistics in 2017, but now more trusted research environments are adopting the five state framework in order to provide access to the identified data. So again, the identified under UK GDPR is considered personal data compared to the anonymized one, which is not considered personal data anymore. We do provide resource discovery and we help data creators showcase their impact with the data that they've created, all the data sets that are all the data collections. It could be called data as well. They do get a data site DIY, which is a persistent identifier. So anyone using your collection for secondary use would have to cite that DIY, increasing the visibility of your data assets. We do promote all the deposits. We are part of the consortium of data archives, the European consortium of data archives, and we actually feed into their data catalog. So everything that is promoted via our own data service catalog, we promote via the CESDA data catalog as well. We do have newsletters. Some of you might already be subscribed to it. We also have partner archives and research organizations that do promote our data assets. We can also provide the identified study level metrics for your data sets and data collections. So it's for those that are made available under safeguarded or controlled access. Now, at the UK Data Service, we have a collections development policy in place and an appraisal process to ensure that all the new data offers are appraised accordingly. And our decision is based on what you will deposit. We can either accept the data into the curated repository, accept the data in the self-deposit repository richer, or advise you to use an institutional repository or alternative place of deposit. If very rarely, if we advise you to use an institutional repository or another responsible repository is because we think the user community of that specific repository would make much more use of the data than our user community at the UK Data Service would. For the curated repository, here we're talking about large-scale data. And when I'm saying large-scale data, yes, we're talking about thousands of people taking part in the survey or in the QAL collection, but also sometimes you might have a population that is quite small. So it might be just three, four hundred people. Maybe that suffer of a specific condition that is very rare that would still fit the curated repository. Now, how do you deposit in the curated repository? You do need to have an account and to bear in mind if these are institutional accounts, so individual accounts, not institutional accounts. Everyone that registered needs to sign our End User License Agreement, which is on an individual basis. So while you might have other colleagues based at your institution that already deposited data or wish to deposit data in the future, you would have to create your own account. I'll do a bit of a demo as well because with the online account, you can offer the data that you want to offer by completing a short offer form. The offer form contains minimal metadata for us in order to make a decision whether should we accept the data in the curated repository, is it better suited for vShare or is it better suited for another repository? To bear in mind is sometimes we might get in touch if there is not enough information or we have any queries about specific things in the offer form. We might get in touch before conducting the appraisal. You will be able to edit the metadata and submit the deposit form as well as sign the license electronically. We have gone full electronic scene September 2018. There is no paperwork to complete and everything can be done online. You will also be able to track the process of your data deposit from your account and let's share now our quick demo. I can see Q&As are coming. Yes, please put any Q&As that you might have any questions and I will get to them at the end of the session. I hope everyone can see now the UK Data Service login webpage. Again, you will have to have an account logged in with us in order to offer the data. You might see here the link is a bit different. I am doing all the testing on staging, but staging is exactly as the live version would be. So in order to deposit the data, we will first create an offer form. So we do an offer form here. Very important with the title, it needs to reflect what your collection is about. For example, survey of people on low income 2021. Ideally at the end, we put the year as well. We do tweak the title for our curation team, tweak the title in order to make it more describing if that is needed. In the abstract, and here is very important, please do provide information about the background of the survey or of the QAL collection of the data collection that you have gathered. Why did you gather the data? What were the main aims of the project? Did you accomplish the main aims? What are the main topics in your collection? What does exactly cover? Please do note if there is any funding. So for example, funding might come from ESRC or MRC. It's very important, especially for ESRC ground numbers to mention the ground number here. Because we, at our end, we signed the grants off as compliant. But to bear in mind is if you have a smaller data collection, let's say a couple of interviews, your data might be better suited for our Visha repository. And we're going to guide you to deposit via Visha rather than the curated one. As I specified, we do have the five safe framework in place to provide access to data, which is considered personal under data protection legislation. If that is the case, we do ask data owners to mention here that the data is personal data. It has not been anonymized. It has only been de-identified and to mention whether the person depositing the data is the data controller or authorised on behalf of the data controller. Let's say our survey is not controlled access. Under contacts, and I can't stress this enough, it's very, very important to note who is the data creator. Because most of the cases, the data depositor is not the data creator. It's just an individual that can create all the offer form and deposit form and send us the data, but it's not actually the data creator. The data creator can be an individual or an organisation. It depends how the project has been set up, especially for the large scale surveys and data collections. It's usually more an organisation rather than an individual, but it can happen. But please do note all the data creators in the contact information. The data collector as well. So for example, if you've hired any research agency that conducts collection of data, please do mention who the data collector is. Are there any contributors? And if you have any partner projects, any projects in the project, any partners that have helped, you can name them as contributors for acknowledgement purposes. Now, the geographical coverage, we gather information in the offer form about the countries. We do get quite a lot of data that covers multiple countries. We do get quite a lot of data that covers only the United Kingdom. We will see here, this is something we call a controlled vocabulary. If we start typing, I will start typing another country because that gives call chister, as this is where I am based, Lumbia for example, and we select it. You can add as many as you need. You can remove them. The same applies for regions and if you include towns and villages as well. Please note when we're talking about towns and villages to bear in mind is our data de-identified rather than anonymized. Is there any other geography covered in your data collection? And the following one, the population information. Again, this is very important when it comes to the appraisal process that our collections development team does, because we need to have information about the population your sample has been drawn from. It can be, for example, all 16 to 64 aged individual based in private households in the UK. If you want to mention exactly the achieved sample as well, that's better, but at least the very basic information about the population of your data collection is needed. Now we're getting into consent and confidentiality, two very important fields to bear in mind. His consent is being given for the sharing of the data to take place and is the data anonymized. Under anonymization, we do ask all the data owners to specify if they have any concerns, especially with the UK GDPR, we have the special category data. Is special category data included? What is the level of anonymization? Have you done top and bottom coding, banding, etc. So as much information as you can include in this section, the better. If you want to include some references and publications or a lot of data owners might have already published their technical report or even a user guide, you can provide the URL or the DIY of that specific resource here. And at the appraisal stage, we actually verify the information there if information in the offer form is not sufficient. Again, we might have to come back if we have any additional questions. Under file description, ideally for the description of the data files, it would be great to specify, for example, two data files of 1,500 variables each or 50 transcripts of approximate 20 to 30 pages each. The more details, the better. Now the documentation files as well are very important. And in the presentation today, I am going a bit on what documentation we are expecting when we receive data from data owners. So please include, for example, if we're talking about one data, you might say, well, I will be able to upload the questionnaire, a data dictionary. I have a tech report as well. If we're talking about quality, oh, I have the topic guides that could be of help to secondary data users. Now, once we have completed this, we're happy with it. Do take a look over again. You might need help from different colleagues as well to complete the form. So it might be best to complete it in a Word document and then put it online. That is totally fine. But once we're happy with it, we can save the form at any given time. Save the form, go talk with a colleague, come back and edit it. Once we're happy, we can submit the form. And we'll see now. So under here, we have under deposit or appraisal survey of people of low income 2021. So it's with our team for review and we will be in touch in due course. We also see that we get quite a lot of automated emails and instead of showing one that I've got from the testing, I will show one here. I'm just making the text a bit bigger. You will receive the moment you press submit, you will receive an automated email that you will know that we've received your offer and we aim to conduct all appraisals within the five working days. If there any queries or concerns with the offer form, please do get in touch. You can get in touch directly by replying to this email. It comes to the shared mailbox. So please do feel free to get in touch by replying to collections at UK data service. Now we'll go back to the presentation. As I said, it's a lot of back and forth, but just to demonstrate everything as clearly as possible. So we've seen the offer form. We've seen how we completed. Please do make sure to offer as much information as possible, but bearing in mind this is the minimal metadata that we need to make the appraisal process happen. Now, what is this appraisal process I keep talking about? We do have a data appraisal group that consists of people from the collections development team, from the curation team, from the user support team with different expertise. And when we do need to refer different offers, a lot of different people do come to help with that specific appraisal of the data. What do we look when we get an offer form? What is the strategic value of that specific data collection? Is there potentially secondary use and analysis for research? Can the data be used for teaching and learning purposes? Would it help with replication and validation of existing research? And to bear in mind the validation and replication of existing research, sometimes we do need that the identified data to be able to provide the replication. Is it relevant to the UK data strategy, UK data service strategy forum? So what type of data is there deposited? So we have longitudinal data addressing different challenges such as linked data, data about organization, cross-sectional data. We look at whether the data owner has a mandate to share with the UK data service. And as mentioned with the ESRC funded data sets, the data must be deposited in the responsible repository, which can be UK data service, the curated or the reshare depending on the type of data. But it can also be deposited in the institutional repository, but in order for us to sign off that ground market as compliant, we would need metadata records then available via the reshare platform. We do have a separate webinar about reshare, and I welcome all of you to attend that. It's recorded as well, so you can see it at your own pace, you don't need to attend a new one. But any questions about reshare can be addressed to our reshare team. We do look, is the data useful for supporting decision making and policy formation? So again, the curation repository at the UK data service is about population representative data, large scale data. Is it useful for international and longitudinal research? Is the micro data otherwise not available to the research community? Either it's more explosive and it could have been made available via another repository, or we do look if that specific data set has been published with another responsible repository. We are very keen on not duplicating persistent identifiers, so we wouldn't want to create another persistent identifier when there's already one available. Of course, is the data unique? Is it already related to what we have in the collection? And you might have seen in our catalog, we have a lot of key theories. The annual population survey, the labour force survey, crime survey for England and Wales, et cetera, and every year a new survey appears, so that's when that might be ticked. We do make sure that the data can be made available, as I said, legally and ethically, and they do not prohibit the use. There are a number of, I wouldn't say very common cases, but it happens when the participant information sheet or the consent form have precluded data sharing. So we're always in touch with the data owners to discuss those aspects. Here's enough information about the documentation being provided. So do we know we're going to get a PDF user guide and questionnaire or topics guide, et cetera. Can we convert them very easily? And we're going to talk about what our curation team does when it comes to converting data. Does the data exceed five gigs? And you might wonder, it's 2021, why is five gigs so important? It's actually downloading via the web browser. So for example, if you save guarded data, if we download via Chrome, sometimes if it's more than five gigs, it becomes unstable. So we try to take all the different approaches there, maybe bundling the different types of data. You might have a collection with quant and qual and having the quant separate from the qual so that researchers can download it quite easily from their web browser. Is the third party reliable and is it likely for them to change the data throughout time? That's not a problem. And at the very end of the webinar today, I'm going to cover new editions. So you've already published a survey. What happens if there changes to it? And it is very straightforward. Now, once we have accepted your offer in the curated repository, you will get a nice email informing you that we have completed our appraisal. We are very happy to let you know that the data has been accepted into the curated repository. And we kindly ask the data owners to review the license agreement and get in touch with us at collections in regards to the access conditions of that specific data collection. To bear in mind is we can't accept any data files before having the license agreement in place. The license agreement is the one that enables us to look at the data, process the data, make the data available. Now, because we're talking about the license and the access framework, just a quick overview. We are very keen on ensuring that proper access safeguards are put depending on what is in the data. So we provide the options of open, safeguarded and controlled data. When we're talking about open data, there's no real disclosure risk in that data. Most of the time we're talking about aggregate data. And we provide all the Creative Commons licenses, why it says almost no restrictions on reuse. We do provide, for example, Creative Commons non-derivatives as well. If it would be a problem if derivatives would be created. However, when it comes to open, we do encourage data owners to think of a very less restrictive license such as Creative Commons attribution. Depending it might be non-commercial purposes if the funded project required the data not to be used for commercial purposes. Now the safeguarded data is available to all our registered users. So again, users that have to sign our end user license agreement, clearly stating they will maintain the confidentiality if they find any problems, they will be in touch with us as soon as possible. And here we're talking about data that has this zero to low disclosure risk. So we're talking about data that makes available the occupation of the individual, their age, their location as in their government office region. And while putting all these elements together, there might be a residual disclosure risk in there. So we're taking the safeguarded approach in order to protect our participants. The safeguarded option also provides the option for any data owners to apply additional conditions to their specific data collection. And we've seen in the email it did say, if you have any additional conditions, please do get in touch with us and mention it when you mention about the access levels. These additional conditions could range from, for example, you might want to be informed 10 days before a publication that a secondary user is conducting or it might even be depositor permission. For specific studies where there might be a bit more sensitivity, we do have the depositor permission only option which the data owner has to complete the depositor permission. It's a very short form, a couple of pages where they actually specify why they want to use the data and then the form is sent to the data owner. We can't approve it unless we have the data owner approval and only once approved the data owner, the data user will be able to download the data via the same interface. We just tick a box in our backend and they can download the data. Now when we're talking about control data, this is well the real disclosure risk lies. We're talking about data that has been de-identified, so we have a lot of granular information. We might have full-digit post-codes, for example, socio-economic classification for digits, so very detailed as well. And by putting all that information together, the risk of identifying someone is quite high. Because the risk of re-identification is quite high, we made this data available in our secure lab, the UK Data Service Secure Lab. It's a secure research environment which is accessed only by approved researchers, so they will need to complete an approved researcher application. They will need to complete a project approval, which needs to be approved by the data owner. They will need to provide us information about their tech setup. We only allow certain IP addresses. They would basically need their IP address to be listed, white listed, for them to gain access. They do need to have training and pass actually the training to be able to access the data and even their outputs are actually checked for secondary disclosure. The threshold of the UK Data Service is 10, so nothing below 10 can be taken out of the secure lab environment. However, if a data owner considers maybe eight, it's feasible. Again, that is a discussion with the collections development team saying within thresholds of eight are acceptable for our data, and we can then set up a meeting to discuss this further. The deposit for. Okay, so we have been invited to submit the deposit form and get in touch with the collections development team. I will have to share again the screens. I cannot access the staging on my own personal machine. So that's why I keep going back into my remote desktop for work. What I have done yesterday, I have created a test offer form, which is at the edit and submit the deposit form. Please bear in mind while you are doing this. We're under the assumption you have already contacted the collections development team with information about the license. We're going to show the license in just a minute. We make the license available for you for signature. Once you submit this form, if the license is not in place, please do not send the data because we're unable to pick it up until the license is signed. If the license is signed, everything can be sent to us. The difference between the offer and the deposit form is the deposit form contains all the information that will be available to data users in the data catalog. So it's a lot more information than in the offer form. We separated the main topics in here because you might want to provide further information. Sometimes even further information about specific variables which might have not been included in a specific access category. Everything that you've entered before will be copied to this so you don't need to edit that information unless, for example, you wish to do so because you might have forgotten to add something that's totally fine. You can edit it but everything has moved across. We will see under coverage. We now have to specify the spatial unit. Do you have something like government, office region in the subject category? What does your data cover exactly? Again, this is what I've said we call a controlled vocabulary and it offers different options. You can tick as many applicable as they are. We usually advise not to tick more than six. What are the dates of the fieldwork and what is the time period that the collection covers? Sometimes they're different. If they are the same, please do just complete the fieldwork. And we understand that the time period is the same as the fieldwork time period. So that's not a problem. We do have an additional section here about methodology where we clearly ask what is the number of variables. This is very important for a consistency perspective. For example, if you mentioned there are 300 variables and we get a file with 1,300 variables, we will come back asking if you send the files which we shouldn't have got or is there something wrong in the deposit form and we can change this information for you. That is not a problem. How many number of units? So we hear talking about the achieved sample that you have. So while the population was all 16 to 64, your number of units were, I don't know, 25,000 individuals, for example. What was the observation unit? So we've seen we can have data about the organization, about people, about families. Again, a controlled vocabulary. For example, we have individual. Where was the individual in our case located? Is this national? We're talking about UK. FAPNational, maybe we are talking about a survey only done in Essex or is it other and you can provide information about what other means. And of course, we have cross national for the data that covers different countries as well. What were the methods of data collection? Again, we're talking about a controlled vocabulary. You just need to select how the data was collected. You can select as many as necessary. There are quite a lot of different data collections that have different types of method of data collection. So you might have a questionnaire, but you might have clinical measures in it as well. What was the sampling procedure? Was it multi-stage stratified? Is it the total universe? If it is other, you can specify other in here, other sampling procedure. And then details about how you've actually done the sampling for the collection. Is there a weighting variable or variables in your data collection? Again, curated because it's all about large-scale population representative data. Most of the time, yes, the weighting was used and the documentation covers how the weighting has been created, how people can apply it, etc. If you have the total universe, it might not be applicable. Or there might be no weighting variable in the data. And you can get in touch with us saying we might want to add the weighting variable at the later stage as well. It does happen. Or you might want to simply update the weighting variable as time passes by. Again, you can complete more references and publications if you want. If you have already completed something, you would have information here. So you just need to add other extras that you might have. We will see. Under the file description, we have a couple of more boxes that appeared. If you have provided enough information about the data files and the documentation files, don't need to worry about that. However, if, for example, you have done digitization, less, I would say, common now to happen. But if you have done transcription, we do ask data owners to provide information about how they transcribe the data. Was it done in-house? Was it done by another institution? There are quite a lot of data owners that hire professional transcribers to do. And just a couple of information. If you have a transcription plan that you have used, please always deposit that as documentation. It always comes with handy. Now, you might have also used actually different data sources to create your own data. To bear in mind here, and please mention as many as you have used. And if you have any queries, I can't stress enough, do get in touch with us. Because while data might be available via different websites, that doesn't mean you can create different things without the permission from the original data owner. So we always check when it comes to data, is permission in place if we're talking about derived data from an existing collection. If the original data has been published under, let's say, creative commons by just the attribution, the very standard, very low creative commons license, that is not a problem. But most of the times we see data created from collections that are published under stricter access conditions. And of course, we do ask for the source location in the access level for all of them. However, best to get in touch as soon as possible, if you are creating data from other sources, just to double check, can the data be shared? Or do I need permission from the original data owner before being able to share the data? Once we have completed everything, we can submit the form. It comes to us in the Collections Development Team. You will get an automated email thanking you for submitting the form, informing you that if you have signed the license, please submit the materials now. When it comes to licenses, again, if everything is done by your account, you can find the licenses under the Licenses tab just in here. And we can see the one for my test offer form. I think you can see that I've received the automated email that the deposit form has been submitted successfully. When it comes to the license, please do read the license. We're not going to go through all the different clauses here. It's a short license. We've tried to make it as human-readable as possible, but if there are any queries or concerns, again, please do get in touch. What's most important, under further condition, we clearly specify in human-readable language how we'll make the data available as you have instructed us to do. For example, the data collection is available to users registered with the UK Data Service and commercial use of the data requires approval from the data owner. This is our standard safeguarded access. Most of our collections are available under this standard safeguarded access level. So it will reflect exactly what you've told us in the email to collection. Again, we can easily sign the license via our account and to bear in mind, as the data depositor, you might not have the copyright in the data, but you might be authorized to sign the license. This happens, I would say, probably 80% in the deposit cases is that the data depositor has been authorized on behalf of the data owner to sign the license and to deposit the data with the UK Data Service. So in this case, you would think not the owner of the copyright, but I am authorized to grant the license. If you are the owner of the copyright, that's fine. Again, not a problem, but please make sure you read the declaration, take the books that applies. If you have any queries, for example, you're unsure, can I sign the license? Again, do get in touch. It's usually a discussion that you and your institution need to have. Can I sign this license? You can also download the license as a PDF and pass it by your colleagues to double check if everything is fine and they're happy for the license to be signed by you. If the license needs to be signed by one of your colleagues, that is not a problem. They will need to have an active account with us, but as long as they have the active account with us, we just assign the license to them and then they can sign the license. Just by clicking here, sign license, the license has been signed. Now, we can go back to our presentation from the current slide. We've signed the license, we've seen the license declaration. I can already hear the email, the automated email I received saying thank you for signing the license. You can now submit the materials. I've included a couple of slides and I hope this will come in handy at your own pace. Throughout the data lifecycle, it's best to check how am I going to make the data available, what documentation am I making available, and throughout the project to make sure that you're making this possible because otherwise, before the deposit, you will need quite a lot of time to prepare everything. So we start by using consistent and meaningful file names. If we have quant data, we have self-explanatory variable names which are actually nicely labeled. It doesn't necessarily need to be SPSS or STATA that come with embedded metadata. It can even be a separate CSV file that describes the variables, but we need these descriptions very nicely available. So to make sure that there's consistency across the file, remove any temporary variables, remove any repetitions of variables. Sometimes it happens, you just copy a variable to create your own derived, just making sure that the file that you're depositing is the final version. Providing an anonymized primary sampling unit information is available, making sure the level of anonymization is appropriate in getting in touch with us if you have any queries about that. We can always, always help, include weights. If available, make sure the weights are documented in the documentation. Use our recommended transcription format for all the qual files. If you yourself in-house are converting the data from, let's say, STATA to SPSS or CSV, et cetera, do make sure that there's no data loss there. It rarely happens, but it can happen. Ideally, you can just deposit one format, be it SPSS, STATA, CSV, whatever you have and whatever you use in-house, and then we do the creation of the different formats. Again, if you use data sources, make sure that copyright permission is in place where applicable. If you have any queries regarding that, we do get in touch with you and we might require a short meeting to discuss. Finally, do try to make sure that the data you are depositing is in the final version. If it's not in the final version, that's not a problem. It happens every now and then. We think it's a final version, but oh, we forgot to include this. We can create a new edition or ask our curation team to stop with the processing until you send us the revised files. Documentation. So when we're talking about quant data, I've already discussed a bit in the offer form. We are expecting things like the technical report, information leaflet and consent forms, if any. Questionnaires with the universe and routing instructions. If you show cards, please send us the show cards, coding frames and coding instructions in your specific data files. If you're talking about quality data, please do submit the participant information sheet and blank copies of the consent forms used. We just want to ensure that it is okay to share that data. Copy guides this come extremely handy. If the data is made available under safeguarded access data, the documentation that data comes with will always be available as open. So others can see Oh, this collection is about XYZ. It will actually come of help. So I'm going to deposit the collection. Also a data list and we do provide a template, which is more of a read me file for a quality collection serving as a finding aid. Submitting materials is very straightforward by the University of a six into service. If your organization has a preferred method. Again, please do get in touch that is not a problem. We can create accounts for your own specific platform and do make sure that all the data sent to us is password protected. And if you're sending confidential data that you actually mark that as confidential. We do use restricted internally we put an underscore restricted that's the that's the suffix. But if you use any other type, for example, underscore confidential that's totally fine. Just let us know that you are sending confidential data, which needs to be encrypted as well besides password protected. We do review all the materials that we get from a confidentiality and data integrity perspective. Again, if any queries arise, we will be in touch to bear in mind is while we might have some proposed changes. Sometimes most of them this are about anonymization, we might think the disclosure risk is a bit high. What we actually do is we make a plan. For example, we think age should be bandied and postcode shouldn't be included. We can include government office region or income needs to be tropium bottom coded. All of this is documented and send to you as the data depositor. Would you agree with the changes from our perspective the disclosure risk is too high. Are you happy for these changes to take place and only once we have your approval, we will then apply the changes. The curation process you sent everything. Okay, so what do you do with my data and documentation files. We do carry out enhancement. So if the labels in the files, for example, are not complete, we would complete the labels in the in the quant files for quality that we might add headers as standard for our collection. We do generate multiple data formats for dissemination and also for preservation purposes. So researchers can download the data in SPSS data or tap delimited us to format. But we also have preservation formats for one data, which made sure that if IBM decide not to release SPSS anymore. We can actually reconstruct the data and use a different software. The same applies for for quality data. We save it as RTF rather than dot X or dot for Word documents. We collect and prepare user documentation for the documentation that you send. We nicely put headers on it. We make it more user friendly to say we do create the DDI compliant catalog metadata which is available to you all the users. So sometimes we might tweak a little bit the abstract to make it more concise. But everything is so that the secondary data user gets information that is very easy to read. We do assign as mentioned digital object identifier data say data site DIYs in within 30 working days of receiving the data set. We publish it via our data catalog for a call collection. It might take 35 days. Now, as I said, I'm sorry and keep checking the clock, but I can see we only have two questions or that should be fine. If your studies have already been published and you wish to publish a new edition, you have updated weights, weighting variables in the data. You have made some changes. You want rather new wave of data, for example, do get in touch with our collections development team. We can start the process for you and you don't need to complete the offer form again because this is an existing study. You would only need to complete the deposit form and mentioning any changes to the data in the abstract information. A new edition December 2021 with updated weighting variables has been uploaded. As mentioned, I have included further resources and pleased to make use of them once we made the slides available. More than welcome to get in touch with us with any questions you might have. I will include before making the slides available because I kept mentioning Richard as well for more academic base data. I will include the Richard mailbox as well for any specific questions about depositing in the Richard repository. Thank you so much for coming. I know have tried to cover as much as possible in just an hour. I'm glad to see people are saying that the session has been informative. Please don't hesitate to contact us with any questions. We are here to help. That's the very bottom line. Thank you so much. Thank you. And I'm sure I'll speak with at least some of you soon.