Finding and reusing your data will be easier, both for you and your fellow researchers, if you plan early in the process to how you will name your data files and what file formats you will use to store your data. If you are planning to archive or share your data, you will also want to consider best practices for describing data.
The format of the electronic data files you work with during your research may be determined by the research equipment and computer hardware and software that you utilize. However, for long-term preservation and ease of sharing, best practices may dictate that the files be converted to a different format after your project has ended. Planning for this eventuality at the outset will save time later.
Consider the following:
Will your data be in a format that requires proprietary software for access?
If you will be depositing your data in a repository at the end of your project, does the repository have specific guidelines or requirements regarding file format?
What features of your data might be lost or modified in the conversion to another file format?
Stanford University Libraries - Data Management Services provides a useful overview of preferred file formats:
Additional helpful guidelines for selecting file formats can be found at these websites:
Choosing formats (Cambridge University Libraries - Data Management)
File formats (Cornell University - Research Data Management Service Group)
ResearchWorks Archive List of Preferred File Formats (University of Washington - University Libraries)
How you organize and name files will impact your ability to find the files later and to understand what they contain. You should be consistent and descriptive in naming and organizing files so that it is clear where to find specific data and what the files contain.
A good idea is to set up a clear directory structure that includes the project title, a date, and some type of unique identifier. Individual directories may be set up by date, the researcher, experimental run, or what makes sense for you and the research.
File names should allow you to identify a precise experiment from the name. Choose a format for naming files and use it consistently.
You many consider including some of the following information in file names, but be sure to include any information that allows you to distinguish the files from one another:
Another best practice is to include in the directory a readme.txt file that explains the naming format with any abbreviations or codes used. (More information on readme files on this page.)
You may already have a lot of data collected for your project and wish to organize and rename the files for easier data management. If you have too many files to rename them all manually, try one of the following applications for renaming the files: