Subject and Course Guides: Metadata: Data Documentation

Data Documentation

Basic elements of data documentation and best practices to follow while conducting your research.

Basic Elements of Data Documentation

Title - The name of the research project or dataset.
Creator - Names of individuals who created the dataset, including organizational affiliation.
Dates - The date range during which the data was collected, processed, and/or modified, as well as the dates to which the data pertain.
Methodology - The process by which the data was created or captured, including any code, software, equipment, or protocols used.
Subjects or Keywords - Words of phrases which describe the type or content of the data, the location in which it was collected, as well as the discipline or domain to which it pertains.
Funders - The agencies or organizations which funded the research that produced the dataset.
Rights Statement - Information regarding conditions governing access to or use of the data, as well as who holds the intellectual property rights for the data.
Unique Identifiers - Any name, number, or alpha-numeric text string used to uniquely identify the project or dataset, including grant numbers or internal reference numbers.

Levels of Metadata

For a given research project, metadata are generally created at two levels: project- and data-level. Project-level metadata describes the “who, what, where, when, how and why” of the dataset, which provides context for understanding why the data were collected and how they were used.

Examples of project-level metadata are:

Name of the project
Dataset title
Project description
Dataset abstract
Principal investigator and collaborators
Contact information
Dataset handle (DOI or URL)
Dataset citation
Data publication date
Geographic description
Time period of data collection
Subject/keywords
Project sponsor
Dataset usage rights

Dataset level metadata are more granular. They explain, in much better detail, the data and dataset.

Dataset level metadata might include:

Data origin: experimental, observational, raw or derived, physical collections, models, images, etc.
Data type: integer, Boolean, character, floating point, etc.
Specialized tools: microscopes, cameras, etc.
Data acquisition details: sensor deployment methods, experimental design, sensor calibration methods, etc.
File type: CSV, mat, xlsx, tiff, HDF, NetCDF, etc.
Data processing methods, software used
Data processing scripts or codes
Dataset parameter list, including
- Variable names
- Description of each variable
- Units

Data Documentation Good Practice

During your research, document all research data formats utilized by your project. Research data comes in many varied formats, such as:

Text - flat text files, Word, Portable Document Format (PDF), Rich Text Format (RTF), Extensible Markup Languague (XML).
Numerical - Statistical Package for the Social Sciences (SPSS), Stata, Excel.
Multimedia - jpeg, tiff, dicom, mpeg, quicktime.
Models - 3D, statistical.
Software - Java, C.
Discipline specific - Flexible Image Transport System (FITS) in astronomy, Crystallographic Information File (CIF) in chemistry.
Instrument specific - Olympus Confocal Microscope Data Format, Carl Zeiss Digital Microscopic Image Format (ZVI).

Dataset Documentation:

Variable names, and descriptions
Explanation of codes and classification schemes used
Algorithms used to transform data
File format
Software - version, OS

Additional Resources

Research Data Management
Research data management by the UK Data Service
Preparing Tabular Data
General guidelines prepared by Cornell University for preparing tabular data for inclusion in a repository or for sharing it with other researchers.

University of Illinois Chicago

University
Library

Search UIC Library Collections

Search UIC Library Website

Metadata

Data Documentation

Basic Elements of Data Documentation

Levels of Metadata

Data Documentation Good Practice

Additional Resources

Richard J. Daley Library

Library of the Health Sciences, Chicago

Library of the Health Sciences, Peoria

Crawford Library of the Health Sciences, Rockford

UIC Law Library