Skip to content

Current ProjectsΒΆ

Each project has a corresponding configuration YAML file that defines its settings. These are used by various tools such as aidata to load data, download data, and argo-workflows to run processing pipelines. Use the links below directly in the tools or download the files to your local machine if you prefer. Contact the project lead for more information before using the data.

UAV Production PipelineΒΆ

This is the current production pipeline for the UAV project. It is used to process high-resolution images collected by UAVs. Data from this is provided externally to drone.mbari.org. uav_prod_pipeline.png

UAV Labeling PipelineΒΆ

This was the processing pipeline used to label the UAV images. Output from this pipeline was used to train the two UAV models: for one for detecting targets in the high-resolution UAV images, and another for classifying them. uav_label_pipeline.png

Data Mining PipelineΒΆ

This was used to mine data from the MBARI video archive. bio_data_mine_pipeline.png

I2MAP Labeling PipelineΒΆ

This was used for initial I2MAP labeling. Images were extracted from ProRes videos and then run through an accurate single-class "Megadetector" to find potential targets, then assigned to the closest match by exemplar. i2map_label_pipeline.png

i2MAP VideoΒΆ

This is short example of the I2MAP video processed by a model trained on pseudo-labeled I2MAP data.
The RF-DETR model was used to detect and classify the targets in the video. Simple bytetrack for tracking. video_processed.png

πŸ”— πŸ“– Kudela Plankton Reference

Hawaii ISIIS Clustering PipelineΒΆ

This was used to discover data from videos collected in Hawaii from the ISIIS camera. This is the stage after images were extracted from the videos then ran through a detector to find potential regions of interest (ROIs). hawaii_cluster_pipeline.png

High Magnification Clustering PipelineΒΆ

This was used to seed data for the Planktivore model to classify particles. Here, a classification model closer to the domain was trained using the publically available IFCB dataset then used to cluster the particles. ptvr_cluster_pipeline.png


πŸ—“οΈ Updated: 2026-02-16