 What is the importance of file formats in research data management? A file format is a way of encoding information within a computer file. When a program or an application wants to use a file, it needs to recognize the file format so that it can access the content within the file. One of the most common ways to recognize a file format is to look at the file name extension. Usually, this is represented by three or four characters at the end of the file name, after a full stop. When you start planning your project, it is important to consider the file formats that you will use throughout your research. Sometimes, the choice of a format is influenced by norms within your research discipline, for instance by commonly used software programs or file formats that you and your colleagues have used in the past. Or, the choice might also depend on the type of hardware or instrument you will use. In other situations, choosing a specific format might limit the possibility to collaborate with other scientists who don't have access to the same software tools. And last but not least, certain file formats are less future-proof than others. We often refer to this problem as file format obsolescence, and there are several reasons for this to happen. For example, sometimes older versions of a file format are no longer supported by newer versions of the software. Or, the software supporting the format is no longer available or cannot be used in newer operating systems. So, choosing the appropriate file format has implications during your research project, but also for the long-term usability of your data. All file formats are vulnerable to obsolescence to a certain degree. However, the sustainability of a file format can increase when you use what we call open and standard formats. Remember that a file format describes the way in which information is stored and organized within a computer file. If this description, known as file format specification, is available for anyone to see and free of charge, then we talk about open file formats. When the specification is not publicly available, or there are limitations on how it can be reused, we talk about closed formats. Let's have a closer look. Closed formats, often called proprietary formats, are usually developed for commercial software applications. These files might only be readable with the same software used to create them, for which a license is needed. Files produced with one version of the software might not be compatible with older or newer versions. Because the use of closed file formats depends on a specific software package, or even a specific version of the software, they are more vulnerable to obsolescence. In other words, closed formats are less sustainable. Open file formats can be both proprietary and non-proprietary. Sometimes they are developed and maintained by a commercial company, but most often they are released by a standardization body or a community without commercial interest. In any case, the file format specifications are open, which means that anyone could potentially develop software packages or applications that can use these formats, maximizing the interoperability and reuse of information they contain. Besides, compatibility with older versions of the format is a priority, making these types of formats less subject to obsolescence issues. So, in general, open and non-proprietary formats are more sustainable when it comes to preserving data. But, sometimes, we don't have a choice but to use closed formats. In such cases, it is good practice to also create a copy of your files in an open format to increase the chances of your data being accessible in the long term. Actually, file format conversion might be needed at any moment of your research, for example to use the data with a different software package, or to share it with others. However, you should realize that converting files from one format to another might result in the loss of content, metadata, or quality. Before you migrate files to another format, it is important to be aware of the risks and to test what can go wrong. It is also good practice to keep the original files, so you could always return to them and repair any errors or changes in your data if something goes wrong during file conversion. When choosing a file format, it might also be important to check what kind of compression is used. Compression is the process of encoding information using fewer bits than the original representation. There are two types of compression. Lossless compression allows the original data to be perfectly reconstructed from the compressed data. When a file is compressed using this method, uncompressing it again results in a file that is identical to the original file. But there are also lossy types of compression, where some of the content is lost. Before performing a lossy compression, it is advisable to do some research to understand what compression parameters should be used. The objective should be to retain the critical information needed to make your files reusable. As you can see, there are a few things to consider when it comes to file formats. To get started, you can access a list of recommended formats via our website. Take a look.