Workshop Description
Computational tools make it easier than ever to collect, analyze, and share data and results in transparent and reproducible ways. To achieve reproducibility, it is important to set systems in place and manage data thoroughly. Data management is the process of handling, organizing, documenting, and preserving data used in a project/research. It helps your research outcomes be transparent, maximizing your work’s practical use and value. This lesson provides an overview of all the meaningful steps in a research project and dives into how to plan for successful data management. We will also discuss tools and systems to help you organize and document your work for better reproducibility, to make your analysis robust, and to facilitate collaboration (including with yourself).
Learning Objectives
- Understand the importance and benefits of data management and data management plans.
- Apply the Data Life Cycle steps to organize and plan a research project.
- Introduce tools and techniques for establishing reproducible analytical workflows.
- Discuss why we should aim for reproducibility and its importance for collaboration.
Find the video recording of this workshop in this link.
Resources
- The Research Data Management Workbook by Kristin Briney
- Data inventory spreadsheet template
- Documenting Things: Openly for Future Us, by Julia Stewart Lowndes at posit::conf(2023). Slides & Recording
- Ocean Health Index Documentation: Methods and SOP for data management
- Arctic Data Center data policy template
- LTER Working Group README template