Skip to content

Developing with aidata

The aidata is a command line tool to do basic extract, transform, load and download operations from AI data for a number of projects at MBARI that require detection, clustering or classification.

You will need to install python, Docker and Just for development.

1. Check out the repository and install the Anaconda environment

git clone http://github.com/mbari-org/aidata.git
cd aidata
conda env create 
conda activate aidata
export PYTHONPATH=$PWD

2. Launch a Tator stack

Instructions are in database_setup.md. Tator uses a postgreSQL database to store metadata and a REDIS database for caching.
This is the same database that is used in the FathomNet project.

# 3. Setup the remaining services
This will setup the remaining services needed for development, including a REDIS and NGINX server.
```shell
cd aidata
just setup-docker-dev #or follow the instructions in the Justfile

What you should see when you navigate to http://localhost:8082/data is a list of images that are served from the tests/data directory. This is useful for testing the image serving capabilities of the module.

nginx_images

Set your Tator token in an environment variable or pass it as an option with --token

export TATOR_TOKEN=15afoobaryouraccesstoken

 Image link


🗓️ Updated: 2026-06-03