 I'm Oluwashi Omobashala, a librarian from Nigeria working at the British Library in London on a project tag, big data and libraries. The video you're about to watch focuses on digital archiving. The aim of the video is to provide basic knowledge about digital archiving for librarians and archivists in training, especially in Africa. Digital archiving starts with digitization of archival materials which involve the use of technology, both hardware and software in managing valuable records that are born digital and those converted into digital form. Materials for digital archiving should be original, authentic, reliable and usable. Digital archiving involves planning, creation of digital objects, acquisition or injection, cataloging, preservation and storage, access for use and reuse and evaluation. However, the processes can vary depending on the type of material. To plan well for digitization one needs to spend quite a bit of time on scoping for projects and this would involve answering quite a few questions such as what are we to digitize and why is this important, who is the audience, we're digitizing this for, how are we going to achieve the digitization, what access would look like and what do we know about the collection, do we have metadata available and of course to consider what the budget will be, what the condition of the collection is and are there any rights issues with the collection. Archiving ensures preservation of valuable records for long-term use, efficient retrieval of important records for use and reuse. It prevents loss of valuable records. From an archiving point of view the main things we're worried about is web resources rot disappear very quickly so we did some studies on the stuff we've collected over the years and whether it's still on the web now and we found that even within one or two years more than half of the content we'd archived had already disappeared from the live web so the risk, so it's not just about collecting a snapshot of how the stuff looked at a given time, it's also the fact that most of this content simply disappears within a year or two of being published and we found that when you're 10 years out there's less than about 5% of content is still on the web, you know unchanged as it was 10 years ago. Archiving ensures the provision of records that can tell stories about events, individuals, people or organizations and our calver materials can be used as evidence during legal proceedings to ensure justice. During preparation for digital archiving there is the need for a careful consideration of both hardware and software that will be used to avoid digital obsolescence. When you have a book you can put a book on a shelf very safely for a long term, long time without anything happening to it but if you have a digital object and you don't check it regularly it can become inaccessible very quickly and that's because the world around it changes so quickly so the means that you needed to have in order to access it may not be available anymore but we may not be aware of it so what we need to do is we need to what we call characterize it so we need to make an inventory of all the very important properties of this digital object and then we need to keep a watch and see how the world changes and whether there's a certain risk arising that may affect this digital object so when we find out that for example file formats no longer supported or a license for a software product expires and we have no right anymore to access it then we need to think about what are the preservation actions that we need to take in order to ensure the long-term access or the continued access of this and so what we have is we have constant watchfulness and we have as I said earlier the provenance where we're going from possibly one migration to another so we keep the thing accessible or alternatively if we don't want to migrate for example we could say we emulate the platform on which the digital object was used so if I have an old computer system and this computer system is just not used anymore but there are lots of files that all use to run on it what I can do instead I can take a modern computer system and I can emulate that old system on it and that means then on the emulation platform I can still use all those digital objects hardware used for digital archiving include computers scanners cameras and storage devices the scanners the main scanners we use the small the social 1200 scanners there is kind of that has a glass platen but that platen doesn't have to be used we can scan without the glass on those machines so for more fragile items stuff that we have to be particularly careful with those machines are very good for that sort of work also the software will correct out the curvature in the pages to a to a large degree the beds are highly adjustable so regardless of the size of the item we can we can get them to sit on there nicely the larger machines like the one behind me although we can scan without the glass it's a little more restrictive on the machine like this however on these machines the beds are highly adjustable the pressure of both the glass and the bed itself can be adjusted to very high degree to make sure that there's no damage to the material but some material is sent to us and we are told you cannot touch the surface with the glass so this will dictate which machines we use for which projects as I was speaking about the collection care issues earlier also within one project we might have different types of material this is the machine we would use for any cut sheet items or loose sheet items a lot of projects that we get people sending could be anything from card files we might have material that's actually been cut so we can feed it through as individual sheets and this automatically scans both sides at the same time it cannot detect colour and black and white or grey scale and then we can output to all the different formats from this as well the operator only has to select whichever job with the predetermined job settings in so they select them it defaults to open we size the tray at the bottom it has ultrasonic sensors which detect this paper there and will also detect two sheets feeding at once as well and will give us a warning so when we're happy it's loaded okay we can then start the batch scan which I've just done and as you can see it's very fast it's I haven't actually got the auto rotate on but we can turn the auto rotate on we don't always have it on because sometimes you have images that aren't meant to be rotated we do that manually afterwards but as you can see it's captured front and back if it's identified some colour it will capture in colour if it hasn't identified colour it'll capture in the black and white or grey scale card files it's actually got ignore blank page on so it's not captured the back because there's nothing on there and it's just captured the front of the card we would then once it's finished just click on the finalised button and that actually if you can see this it's starting to process the batch and that will be creating from the job setting either we've asked it to just send a tiff or a multi-tiff to a particular folder on the network it might be creating a PDF and it might be doing the OCRing of that PDF as well and delivering it to a folder we've specified already for that job or someone to them check the work and post-process it further the dates the archival object was born should be in the ISO 601 2004 standard the ISO 8601 2004 standard formats provide a consistent method for fashion tracking over the years it is also important to state the context in which a digital object was created for an estimation of the file size for an image the formula displayed can be used for easy access either globally or locally archival materials can be deployed by adapting the open archive information system ISO standard 147 121 2003 see ICS standards for more information the key I think first step if you want people to access your information is think about who are the like the user groups that may want to come to you and where they're likely to find the information that you want to get them to access so having your own library catalog or archival catalog or sort of a page of information of the collections that you have is quite important but equally you have to make sure the other services can pick up the information so for example when you design your website it's quite important to think about how can Google and other search engines index that website and have enough information on there that sort of set up in the way that Google can find the information the keyword here is search engine optimization and there are quite a lot of websites that sort of provide basic information on how to set this up so it's effectively thinking outside the existing library catalog and say what are my users using are there social networks that are perhaps prominent in the country that you are in or are there sort of tools that are provided by publishers Digital objects DOI should be obtained for a cover material A DOI is a special persistent tag for an uncover material DOIs link the user to three things the object its metadata and the current provider's commit met statement we use as a as an overall framework met the format that was developed at the Library of Congress for maintaining information about digital objects we use that as a framework and within that we may plug in then other metadata elements relating to items like elements like preservation and description so for preservation we would use the premise standard for descriptive metadata we'd use the descriptive standard which is appropriate to the type of material so in the case of books and serials traditional material that is you that is cataloged by the library we would use something like the Mark 21 or Mark XML descriptive framework use the new RDA resource description access standard for cataloging rules for that material but for something like manuscript material we would use a different set of descriptive standards which are appropriate to that archival world so for example the ISADG standard for general materials we might in the case of doing some detailed digitization of a manuscript particular asian materials at the moment we're experimenting with the text encoding initiative metadata standard for enriching the descriptive information around the manuscript itself and also creating metadata we're holding that in our integrated archive and manuscript system and we are then exporting that to users as required with commercial material often you can use standard identifiers such as in the case of an ebook an ISBN or e-journal an ISSN to sometimes derive additional data to enhance the description but in the case of the unique manuscript and older digital material that's much harder to do due to the lack of widely available unique identifiers that type of material but one thing we are also looking at doing is where possible enhancing certain elements within the descriptive metadata itself through the addition of things like the new international standard name identifier the effective at the ISBN for authors effective a unique identifier that enables us to assign identities incredibly wide range of material to deal with a huge amount of challenge to to handle and a lot of expect expectations to to deal with as well and that's always perhaps the the greatest challenge of all because people are always seeing the most cutting-edge developments in the world I hope you've enjoyed this video and you found it both useful and informative many thanks to Shabnam British Library and British Library staff for supporting the project