NODE

NODE Overview

NODE (NOAA Ocean DNA Explorer) is a platform for exploring ocean DNA data. This help documentation will guide you through the various features and functionalities of the platform.

Our goal is to make marine genomic data more accessible, interoperable, and usable for researchers, policymakers, and the public.

Features Overview

NODE provides several key features to help you work with marine genomic data:

Explore projects, samples, analyses, features, and taxonomies
Search across datasets using powerful query capabilities
Submit your own data in standardized formats
Download and reuse existing datasets
Visualize taxonomic information with integrated visualization tools

Contact Us, Report a Bug, Request a Feature

We welcome your feedback to improve NODE. If you encounter any issues or have suggestions for new features, please let us know.

You can submit bug reports, feature requests, or general feedback through our GitHub issues page:

NODE GitHub Issues

When reporting bugs, please include:

A clear description of the issue
Steps to reproduce the problem
What you expected to happen
What actually happened
Screenshots if applicable

Explore

The Explore section allows you to browse through different categories of data in the NODE platform. Each category provides specific views and functionality for different types of information.

Projects

Projects represent research initiatives or sampling campaigns. Each project contains multiple samples and may have associated analyses.

Key project information includes:

Project name and description
Principal investigator and collaborators
Temporal and geographic scope
Associated samples and analyses

Note: You can remove projects you have submitted. Removing a project will also remove any associated analyses.

Samples

Samples represent physical specimens or environmental samples collected during a project. They form the basis for subsequent analyses.

Sample data typically includes:

Collection location and date
Sample type and processing method
Environmental context data
Storage information

Analyses

Analyses represent the results of processing samples through various methods such as DNA sequencing, PCR, or other molecular techniques.

Important information about analyses:

Analyses are linked to projects
You can add analyses to projects you did not submit
You can see who the project belongs to when adding analyses
You can view and remove your own analyses through the My Submissions Manager

Analysis data typically includes information about the sequencing method, bioinformatic processing, and taxonomic assignments.

Features

Features represent unique DNA sequences (e.g., Amplicon Sequence Variants or ASVs) found in samples, typically representing distinct organisms.

Each feature includes:

A unique identifier
The DNA sequence
Sequence length information
Consensus taxonomic classification
Prevalence across samples

Features provide the foundation for taxonomic classification and biodiversity assessment.

Taxonomies

Taxonomies show the biological classification of organisms identified in your samples, from domain to species level.

The taxonomic outline image is sourced through PhyloPic, using GBIF Suggest API to match our taxonomy with PhyloPic's database. Images on PhyloPic are contributed by scientists and artists worldwide under various Creative Commons licenses.

If no image is displayed for a taxonomy, it could be due to:

The taxonomy is unregistered in reference databases
The taxonomy is a CLADE designation
PhyloPic does not have an image for that taxonomy
GBIF Suggest API did not return a matching taxonomy

Search

NODE's search functionality allows you to find specific data across all categories in the platform. You can search by various parameters to narrow down results.

Basic Usage

To perform a basic search:

Enter keywords in the search bar
Select the category you want to search (Projects, Samples, etc.)
Use filters to narrow down results
Review the matching results

The search function looks for matches in titles, descriptions, metadata, and other relevant fields.

Query Recipes (examples)

Here are some example queries to help you get started:

Find all samples from a specific location:

location:"Gulf of Mexico"

Find projects containing a specific species:

taxonomy:"Salmonidae"

Find analyses from a specific date range:

date:[2020-01-01 TO 2022-12-31]

Combine multiple search terms:

location:"Atlantic Ocean" AND taxonomy:"Bacteria"

Submit

The Submit section allows you to contribute your own data to the NODE platform. You can submit projects and analyses to share with the scientific community.

Data Format Rationale

NODE follows community standards to ensure data quality, interoperability, and usability. Our data formats are designed with the following principles in mind:

Darwin Core (DwC) for biodiversity data
FAIR principles (Findable, Accessible, Interoperable, Reusable)
Other relevant community standards

Standardized formats ensure that your data can be easily discovered, understood, and reused by other researchers.

My Submissions Manager

The My Submissions Manager allows you to view, manage, and track your submitted data.

Note: You must be signed in to access the My Submissions Manager.

From the My Submissions Manager, you can:

View your submitted projects and analyses
Check submission status
Edit or update submissions
Remove your submissions

Project Submissions

To submit a project, you'll need to prepare a metadata file following our template format.

The project submission process includes:

Preparing your project metadata
Filling out the submission form
Uploading your metadata file
Reviewing and confirming your submission

Required fields include project name, description, principal investigator, and collection dates.

All files must be in TSV format and follow the template structure exactly.

Analysis Submissions

Analysis submissions require information about the analysis methods, results, and associated project.

The analysis submission process includes:

Selecting the associated project
Preparing your analysis data files
Filling out the analysis metadata form
Uploading your files
Reviewing and confirming your submission

Required fields include analysis type, sequencing method, and bioinformatic processing details.

All files must be in TSV format and follow the template structure exactly.

FAQ

Frequently asked questions about using the NODE platform.

Q: Why are zebras striped?

A: While scientists have been debating this for years, the most widely accepted theory is that zebra stripes help deter biting flies like horse flies that can spread disease. The stripes may also help with thermoregulation and social recognition among zebras.

Q: Who can submit data to NODE?

A: NODE is open to submissions from researchers, government agencies, NGOs, and other organizations working with marine genomic data. You need to create an account to submit data.

Q: Can I download the entire database?

A: While individual datasets can be downloaded, we currently don't provide a bulk download of the entire database. For large-scale data access, please contact us to discuss your needs.

Q: How do I cite data from NODE?

A: Each project and analysis has a recommended citation format provided on its page. Please use these citations to properly acknowledge the data contributors.

Q: What browsers are supported?

A: NODE works best with modern browsers like Chrome, Firefox, Safari, and Edge. We recommend keeping your browser updated to the latest version.

Q: How secure is my submitted data?

A: All data transfer is encrypted using HTTPS. Your account information is secured, and you control the visibility of your submitted data.

Q: Why can't I see my submission immediately?

A: Submissions go through a brief validation process to ensure data quality and integrity. Most submissions are processed within 24-48 hours.