NODE Overview
NODE (NOAA Ocean DNA Explorer) is a platform for exploring ocean DNA data. This help documentation will guide you through the various features and functionalities of the platform.
Our goal is to make marine genomic data more accessible, interoperable, and usable for researchers, policymakers, and the public.
Features Overview
NODE provides several key features to help you work with marine genomic data:
- Explore projects, samples, analyses, features, and taxonomies
- Search across datasets using powerful query capabilities
- Submit your own data in standardized formats
- Download and reuse existing datasets
- Visualize taxonomic information with integrated visualization tools
Contact Us, Report a Bug, Request a Feature
We welcome your feedback to improve NODE. If you encounter any issues or have suggestions for new features, please let us know.
You can submit bug reports, feature requests, or general feedback through our GitHub issues page:
When reporting bugs, please include:
- A clear description of the issue
- Steps to reproduce the problem
- What you expected to happen
- What actually happened
- Screenshots if applicable
Explore
The Explore section allows you to browse through different categories of data in the NODE platform. Each category provides specific views and functionality for different types of information.
Projects
Projects represent research initiatives or sampling campaigns. Each project contains multiple samples and may have associated analyses.
Key project information includes:
- Project name and description
- Principal investigator and collaborators
- Temporal and geographic scope
- Associated samples and analyses
Note: You can remove projects you have submitted. Removing a project will also remove any associated analyses.
Samples
Samples represent physical specimens or environmental samples collected during a project. They form the basis for subsequent analyses.
Sample data typically includes:
- Collection location and date
- Sample type and processing method
- Environmental context data
- Storage information
Analyses
Analyses represent the results of processing samples through various methods such as DNA sequencing, PCR, or other molecular techniques.
Important information about analyses:
- Analyses are linked to projects
- You can add analyses to projects you did not submit
- You can see who the project belongs to when adding analyses
- You can view and remove your own analyses through the My Submissions Manager
Analysis data typically includes information about the sequencing method, bioinformatic processing, and taxonomic assignments.
Features
Features represent unique DNA sequences (e.g., Amplicon Sequence Variants or ASVs) found in samples, typically representing distinct organisms.
Each feature includes:
- A unique identifier
- The DNA sequence
- Sequence length information
- Consensus taxonomic classification
- Prevalence across samples
Features provide the foundation for taxonomic classification and biodiversity assessment.
Taxonomies
Taxonomies show the biological classification of organisms identified in your samples, from domain to species level.
The taxonomic outline image is sourced through PhyloPic, using GBIF Suggest API to match our taxonomy with PhyloPic's database. Images on PhyloPic are contributed by scientists and artists worldwide under various Creative Commons licenses.
If no image is displayed for a taxonomy, it could be due to:
- The taxonomy is unregistered in reference databases
- The taxonomy is a CLADE designation
- PhyloPic does not have an image for that taxonomy
- GBIF Suggest API did not return a matching taxonomy
Search
NODE's search functionality allows you to find specific data across all categories in the platform. You can search by various parameters to narrow down results.
Basic Usage
To perform a basic search:
- Enter keywords in the search bar
- Select the category you want to search (Projects, Samples, etc.)
- Use filters to narrow down results
- Review the matching results
The search function looks for matches in titles, descriptions, metadata, and other relevant fields.
Query Recipes (examples)
Here are some example queries to help you get started:
Find all samples from a specific location:
Find projects containing a specific species:
Find analyses from a specific date range:
Combine multiple search terms:
Submit
The Submit section allows you to contribute your own data to the NODE platform. You can submit projects and analyses to share with the scientific community.
Data Format Rationale
NODE follows community standards to ensure data quality, interoperability, and usability. Our data formats are designed with the following principles in mind:
- Darwin Core (DwC) for biodiversity data
- FAIR principles (Findable, Accessible, Interoperable, Reusable)
- Other relevant community standards
Standardized formats ensure that your data can be easily discovered, understood, and reused by other researchers.
My Submissions Manager
The My Submissions Manager allows you to view, manage, and track your submitted data.
Note: You must be signed in to access the My Submissions Manager.
From the My Submissions Manager, you can:
- View your submitted projects and analyses
- Check submission status
- Edit or update submissions
- Remove your submissions
Project Submissions
To submit a project, you'll need to prepare a metadata file following our template format.
The project submission process includes:
- Preparing your project metadata
- Filling out the submission form
- Uploading your metadata file
- Reviewing and confirming your submission
Required fields include project name, description, principal investigator, and collection dates.
All files must be in TSV format and follow the template structure exactly.
Analysis Submissions
Analysis submissions require information about the analysis methods, results, and associated project.
The analysis submission process includes:
- Selecting the associated project
- Preparing your analysis data files
- Filling out the analysis metadata form
- Uploading your files
- Reviewing and confirming your submission
Required fields include analysis type, sequencing method, and bioinformatic processing details.
All files must be in TSV format and follow the template structure exactly.
FAQ
Frequently asked questions about using the NODE platform.
Q: Why are zebras striped?
A: While scientists have been debating this for years, the most widely accepted theory is that zebra stripes help deter biting flies like horse flies that can spread disease. The stripes may also help with thermoregulation and social recognition among zebras.
Q: Who can submit data to NODE?
A: NODE is open to submissions from researchers, government agencies, NGOs, and other organizations working with marine genomic data. You need to create an account to submit data.
Q: Can I download the entire database?
A: While individual datasets can be downloaded, we currently don't provide a bulk download of the entire database. For large-scale data access, please contact us to discuss your needs.
Q: How do I cite data from NODE?
A: Each project and analysis has a recommended citation format provided on its page. Please use these citations to properly acknowledge the data contributors.
Q: What browsers are supported?
A: NODE works best with modern browsers like Chrome, Firefox, Safari, and Edge. We recommend keeping your browser updated to the latest version.
Q: How secure is my submitted data?
A: All data transfer is encrypted using HTTPS. Your account information is secured, and you control the visibility of your submitted data.
Q: Why can't I see my submission immediately?
A: Submissions go through a brief validation process to ensure data quality and integrity. Most submissions are processed within 24-48 hours.