NODE Overview

NODE (NOAA Ocean DNA Explorer) is a platform for exploring ocean DNA data. This help documentation will guide you through the various features and functionalities of the platform.

Our goal is to make marine genomic data more accessible, interoperable, and usable for researchers, policymakers, and the public.

Features Overview

NODE provides several key features to help you work with marine genomic data:

  • Explore projects, samples, analyses, features, and taxonomies
  • Search across datasets using powerful query capabilities
  • Submit your own data in standardized formats
  • Download and reuse existing datasets
  • Visualize taxonomic information with integrated visualization tools

Contact Us, Report a Bug, Request a Feature

We welcome your feedback to improve NODE. If you encounter any issues or have suggestions for new features, please let us know.

You can submit bug reports, feature requests, or general feedback through our GitHub issues page:

NODE GitHub Issues

When reporting bugs, please include:

  • A clear description of the issue
  • Steps to reproduce the problem
  • What you expected to happen
  • What actually happened
  • Screenshots if applicable

Explore

The Explore section allows you to browse through different categories of data in the NODE platform. Each category provides specific views and functionality for different types of information.

Projects

Projects represent research initiatives or sampling campaigns. Each project contains multiple samples and may have associated analyses.

Key project information includes:

  • Project name and description
  • Principal investigator and collaborators
  • Temporal and geographic scope
  • Associated samples and analyses

Note: You can remove projects you have submitted. Removing a project will also remove any associated analyses.

Samples

Samples represent physical specimens or environmental samples collected during a project. They form the basis for subsequent analyses.

Sample data typically includes:

  • Collection location and date
  • Sample type and processing method
  • Environmental context data
  • Storage information

Analyses

Analyses represent the results of processing samples through various methods such as DNA sequencing, PCR, or other molecular techniques.

Important information about analyses:

  • Analyses are linked to projects
  • You can add analyses to projects you did not submit
  • You can see who the project belongs to when adding analyses
  • You can view and remove your own analyses through the My Submissions Manager

Analysis data typically includes information about the sequencing method, bioinformatic processing, and taxonomic assignments.

Features

Features represent unique DNA sequences (e.g., Amplicon Sequence Variants or ASVs) found in samples, typically representing distinct organisms.

Each feature includes:

  • A unique identifier
  • The DNA sequence
  • Sequence length information
  • Consensus taxonomic classification
  • Prevalence across samples

Features provide the foundation for taxonomic classification and biodiversity assessment.

Taxonomies

Taxonomies show the biological classification of organisms identified in your samples, from domain to species level.

The taxonomic outline image is sourced through PhyloPic, using GBIF Suggest API to match our taxonomy with PhyloPic's database. Images on PhyloPic are contributed by scientists and artists worldwide under various Creative Commons licenses.

If no image is displayed for a taxonomy, it could be due to:

  • The taxonomy is unregistered in reference databases
  • The taxonomy is a CLADE designation
  • PhyloPic does not have an image for that taxonomy
  • GBIF Suggest API did not return a matching taxonomy

Submit

The Submit section allows you to contribute your own data to the NODE platform. You can submit projects and analyses to share with the scientific community.

Data Format Rationale

NODE follows community standards to ensure data quality, interoperability, and usability. Our data formats are designed with the following principles in mind:

  • Darwin Core (DwC) for biodiversity data
  • FAIR principles (Findable, Accessible, Interoperable, Reusable)
  • Other relevant community standards

Standardized formats ensure that your data can be easily discovered, understood, and reused by other researchers.

My Submissions Manager

The My Submissions Manager allows you to view, manage, and track your submitted data.

Note: You must be signed in to access the My Submissions Manager.

From the My Submissions Manager, you can:

  • View your submitted projects and analyses
  • Check submission status
  • Edit or update submissions
  • Remove your submissions

Project Submissions

To submit a project, you'll need to prepare a metadata file following our template format.

The project submission process includes:

  1. Preparing your project metadata
  2. Filling out the submission form
  3. Uploading your metadata file
  4. Reviewing and confirming your submission

Required fields include project name, description, principal investigator, and collection dates.

All files must be in TSV format and follow the template structure exactly.

Analysis Submissions

Analysis submissions require information about the analysis methods, results, and associated project.

The analysis submission process includes:

  1. Selecting the associated project
  2. Preparing your analysis data files
  3. Filling out the analysis metadata form
  4. Uploading your files
  5. Reviewing and confirming your submission

Required fields include analysis type, sequencing method, and bioinformatic processing details.

All files must be in TSV format and follow the template structure exactly.

FAQ

Frequently asked questions about using the NODE platform.

Q: Why are zebras striped?

A: While scientists have been debating this for years, the most widely accepted theory is that zebra stripes help deter biting flies like horse flies that can spread disease. The stripes may also help with thermoregulation and social recognition among zebras.

Q: Who can submit data to NODE?

A: NODE is open to submissions from researchers, government agencies, NGOs, and other organizations working with marine genomic data. You need to create an account to submit data.

Q: Can I download the entire database?

A: While individual datasets can be downloaded, we currently don't provide a bulk download of the entire database. For large-scale data access, please contact us to discuss your needs.

Q: How do I cite data from NODE?

A: Each project and analysis has a recommended citation format provided on its page. Please use these citations to properly acknowledge the data contributors.

Q: What browsers are supported?

A: NODE works best with modern browsers like Chrome, Firefox, Safari, and Edge. We recommend keeping your browser updated to the latest version.

Q: How secure is my submitted data?

A: All data transfer is encrypted using HTTPS. Your account information is secured, and you control the visibility of your submitted data.

Q: Why can't I see my submission immediately?

A: Submissions go through a brief validation process to ensure data quality and integrity. Most submissions are processed within 24-48 hours.