9 Session 9: Hands On: Exploration of Data Resources and Synthesis Development
9.1 Datasets of Interest
There is an excel spreadsheet of datasets of interest located in the sharepoint here. The vast majority of these datasets are already available online, in some form, which makes synthesis much easier. There are several other datasets that are not yet published, but will be published by week two of the workshop.
This chart shows a summary of the excel spreadsheet, with links to published datasets on the left side, and hover text with more information over each bar. The datasets have been categorized (very loosely) into groups, though many of the fish datasets also contain water quality or plankton data.
9.2 Finding more data
If, in exploring and discussing the information above, groups decide they are missing pieces of data, searching through DataONE is a good start. We created a trial data portal on DataONE that shows just datasets in our geographic area of interest. You can view the portal here.
A portal is a collection of data packages from repositories within the DataONE federated network, displayed on a unique webpage. This Open Science Synthesis for the Delta Science Program portal was constructed by defining geographic boundaries for the region of interest and it displays 9,189 data packages from within the DataONE collection of over 838K. This is a dynamic subset that will continue to grow as more data are added to repositories within the DataONE network.
Typically, a research project’s website won’t be maintained beyond the life of the project and all the information on the website that provides context for the data collection is lost. Portals can provide a means to preserve information regarding the projects’ objectives, scopes, and organization and couple this with the data files so it’s clear how to use and interpret the data for years to come. Plus, when datasets are scattered across the repositories in the DataONE network, portals can help see them all in one convenient webpage.
Portals also leverage DataONE’s metric features, which create statistics describing the project’s data packages. Information such as total size of data, proportion of data file types, and data collection periods are immediately available from the portal webpage.
Here is an example of a fully fledged data portal that contains not only searchable raw and derived data, but also pages describing the process, people, and key results of the working groups.
Step-by-step instructoin on how to create a data portal is provided within Section 16, ‘Additional Resources.’