Download this notebook by clicking on the download icon logo or find it in the repository

PacificSound16kHz

Distributed under the terms of the GPL License
Maintainer: dcline@mbari.org
Authors: Danelle Cline dcline@mbari.org, John Ryan ryjo@mbari.org

Basic Exploration of the 16 kHz Pacific Ocean Audio Data in the AWS Open Data Registry¶

An extensive (5+ years and growing) archive of sound recordings from a deep-sea location along the eastern margin of the North Pacific Ocean has been made available through AWS Open data. Temporal coverage of the recording archive has been 95% since project inception in July 2015. The original recordings have a sample rate of 256 kHz. For many research applications it is convenient to work with data having a lower sample rate. This notebook illustrates basic methods to access and process a calibrated spectrogram from the decimated 16 kHz audio archive.

If you use this data set, please cite our project.

Data Overview¶

The decimated audio data are in WAV format in an s3 bucket named pacific-sound-16khz. They are further organized by year and month. Buckets are stored as objects, so the data isn't physically stored in folders or directories as you may be famaliar with, but you can think of it conceptually as follows:

pacific-sound-16khz
      |
      ----2020
        |
        |----01
        ...
        |----12

Install required dependencies¶

First, let's install the required software dependencies.

If you are using this notebook in a cloud environment, select a Python3 compatible kernel and run this next section. This only needs to be done once for the duration of this notebook.

If you are working on local computer, you can skip this next cell. Change your kernel to pacific-sound-notebooks, which you installed according to the instructions in the README - this has all the dependencies that are needed.

In [1]:

Copied!





!pip install -q boto3 --quiet
!pip install -q soundfile --quiet
!pip install -q scipy --quiet
!pip install -q numpy --quiet
!pip install -q matplotlib --quiet
!pip install -q boto3 --quiet
!pip install -q soundfile --quiet
!pip install -q scipy --quiet
!pip install -q numpy --quiet
!pip install -q matplotlib --quiet

     |██▌                             | 10 kB 25.4 MB/s eta 0:00:01
     |█████                           | 20 kB 21.5 MB/s eta 0:00:01
     |███████▍                        | 30 kB 28.3 MB/s eta 0:00:01
     |██████████                      | 40 kB 13.4 MB/s eta 0:00:01
     |████████████▍                   | 51 kB 12.3 MB/s eta 0:00:01
     |██████████████▉                 | 61 kB 13.9 MB/s eta 0:00:01
     |█████████████████▎              | 71 kB 13.6 MB/s eta 0:00:01
     |███████████████████▉            | 81 kB 14.6 MB/s eta 0:00:01
     |██████████████████████▎         | 92 kB 15.9 MB/s eta 0:00:01
     |████████████████████████▊       | 102 kB 15.0 MB/s eta 0:00:01
     |███████████████████████████▏    | 112 kB 15.0 MB/s eta 0:00:01
     |█████████████████████████████▊  | 122 kB 15.0 MB/s eta 0:00:01
     |████████████████████████████████| 132 kB 15.0 MB/s 
     |████████████████████████████████| 79 kB 8.3 MB/s 
     |████████████████████████████████| 9.2 MB 60.8 MB/s 
     |████████████████████████████████| 140 kB 84.4 MB/s 
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
requests 2.23.0 requires urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1, but you have urllib3 1.26.12 which is incompatible.

Import all packages¶

In [2]:

Copied!





import boto3
from botocore import UNSIGNED
from botocore.client import Config
from six.moves.urllib.request import urlopen
from pathlib import Path
import io
import os
import scipy
from scipy import signal
import numpy as np
import soundfile as sf
import matplotlib.pyplot as plt
import boto3
from botocore import UNSIGNED
from botocore.client import Config
from six.moves.urllib.request import urlopen
from pathlib import Path
import io
import os
import scipy
from scipy import signal
import numpy as np
import soundfile as sf
import matplotlib.pyplot as plt

List the contents of a monthly directory¶

In [3]:

Copied!





s3_client = boto3.client('s3',
    aws_access_key_id='',
    aws_secret_access_key='', 
    config=Config(signature_version=UNSIGNED))
s3_client = boto3.client('s3',
    aws_access_key_id='',
    aws_secret_access_key='', 
    config=Config(signature_version=UNSIGNED))

In [4]:

Copied!





year = "2018"
month = "01"
bucket = 'pacific-sound-16khz'

for obj in s3_client.list_objects_v2(Bucket=bucket, Prefix=f'{year}/{month}')['Contents']:
    print(obj['Key'])
year = "2018"
month = "01"
bucket = 'pacific-sound-16khz'

for obj in s3_client.list_objects_v2(Bucket=bucket, Prefix=f'{year}/{month}')['Contents']:
    print(obj['Key'])

2018/01/MARS-20180101T000000Z-16kHz.wav
2018/01/MARS-20180102T000000Z-16kHz.wav
2018/01/MARS-20180103T000000Z-16kHz.wav
2018/01/MARS-20180104T000000Z-16kHz.wav
2018/01/MARS-20180105T000000Z-16kHz.wav
2018/01/MARS-20180106T000000Z-16kHz.wav
2018/01/MARS-20180107T000000Z-16kHz.wav
2018/01/MARS-20180108T000000Z-16kHz.wav
2018/01/MARS-20180109T000000Z-16kHz.wav
2018/01/MARS-20180110T000000Z-16kHz.wav
2018/01/MARS-20180111T000000Z-16kHz.wav
2018/01/MARS-20180112T000000Z-16kHz.wav
2018/01/MARS-20180113T000000Z-16kHz.wav
2018/01/MARS-20180114T000000Z-16kHz.wav
2018/01/MARS-20180115T000000Z-16kHz.wav
2018/01/MARS-20180116T000000Z-16kHz.wav
2018/01/MARS-20180117T000000Z-16kHz.wav
2018/01/MARS-20180118T000000Z-16kHz.wav
2018/01/MARS-20180119T000000Z-16kHz.wav
2018/01/MARS-20180123T000000Z-16kHz.wav
2018/01/MARS-20180124T000000Z-16kHz.wav
2018/01/MARS-20180125T000000Z-16kHz.wav
2018/01/MARS-20180126T000000Z-16kHz.wav
2018/01/MARS-20180127T000000Z-16kHz.wav
2018/01/MARS-20180128T000000Z-16kHz.wav
2018/01/MARS-20180129T000000Z-16kHz.wav
2018/01/MARS-20180130T000000Z-16kHz.wav
2018/01/MARS-20180131T000000Z-16kHz.wav

Retrieve metadata for a file¶

In [5]:

Copied!





year = 2018
month = 1
filename = 'MARS-20180101T000000Z-16kHz.wav'
bucket = 'pacific-sound-16khz'
key = f'{year:04d}/{month:02d}/{filename}'

url = f'https://{bucket}.s3.amazonaws.com/{key}'

sf.info(io.BytesIO(urlopen(url).read(20_000)), verbose=True)
year = 2018
month = 1
filename = 'MARS-20180101T000000Z-16kHz.wav'
bucket = 'pacific-sound-16khz'
key = f'{year:04d}/{month:02d}/{filename}'

url = f'https://{bucket}.s3.amazonaws.com/{key}'

sf.info(io.BytesIO(urlopen(url).read(20_000)), verbose=True) 

Out[5]:

<_io.BytesIO object at 0x7fc35b8faef0>
samplerate: 16000 Hz
channels: 1
duration: 6556 samples
format: WAV (Microsoft) [WAV]
subtype: Signed 24 bit PCM [PCM_24]
endian: FILE
sections: 1
frames: 6556
extra_info: """
    Length : 20000
    RIFF : 4147200324 (should be 19992)
    WAVE
    fmt  : 16
      Format        : 0x1 => WAVE_FORMAT_PCM
      Channels      : 1
      Sample Rate   : 16000
      Block Align   : 3
      Bit Width     : 24
      Bytes/sec     : 48000
    LIST : 280
      INFO
        INAM : MBARI ocean audio data, start 20180101T000000 UTC
        ICMT : If you use these data, please cite https://doi.org/10.1109/OCEANS.2016.7761363. Recording metadata can be found at https://bitbucket.org/mbari/pacific-sound/src/master/MBARI_MARS_Hydrophone_Deployment02.json.
    data : 4147200000 (should be 19668)
    End
    """

Calibrated Spectrum Levels¶

The file metadata retrieved above includes a link to a json file containing complete deployment metadata. For convenience, those links (one for each hydrophone deployment) are:

Frequency-dependent hydrophone sensitivity data are included in these json files. Considering the 16 kHz data, hydrophone sensitivity at only the two lowest frequencies are relevant. For the example file identified above, which is from the second hydrophone deployment, these sensitivities at 250 Hz and 10 kHz are -177.90 and -176.80 dB V re $\mu$Pa, respectively. For simplicity in this example, we can apply the average sensitivity, -177.35 dB V re $\mu$Pa.

Save the daily file¶

In [6]:

Copied!





# The 16 kHz daily file may be too large to fit into memory for some instances, so let's download it first
# then seek into the file to read a chunk later
print(f'Copying s3://{bucket}/{key}')
  
# only download if needed
if not Path(filename).exists():
    s3 = boto3.resource('s3',
        aws_access_key_id='',
        aws_secret_access_key='',
        config=Config(signature_version=UNSIGNED))

    # Alternatively, it can be downloaded directly in SageMaker with
    # !aws s3 cp s3://{bucket}/{key} . 

    print('Downloading') 
    s3.Bucket(bucket).download_file(key, filename)
    print('Done')
# The 16 kHz daily file may be too large to fit into memory for some instances, so let's download it first
# then seek into the file to read a chunk later
print(f'Copying s3://{bucket}/{key}')
  
# only download if needed
if not Path(filename).exists():
    s3 = boto3.resource('s3',
        aws_access_key_id='',
        aws_secret_access_key='',
        config=Config(signature_version=UNSIGNED))

    # Alternatively, it can be downloaded directly in SageMaker with
    # !aws s3 cp s3://{bucket}/{key} . 

    print('Downloading') 
    s3.Bucket(bucket).download_file(key, filename)
    print('Done')

Copying s3://pacific-sound-16khz/2018/01/MARS-20180101T000000Z-16kHz.wav
Downloading
Done

Read hours 9-12 of the day¶

In [7]:

Copied!





sample_rate = int(16e3)
start_frame = int(sample_rate*9*3600)
duration_frames =  int(sample_rate*3*3600)

pacsound_file = sf.SoundFile(filename)
pacsound_file.seek(start_frame)
x = pacsound_file.read(duration_frames, dtype='float32')
sample_rate = int(16e3)
start_frame = int(sample_rate*9*3600)
duration_frames =  int(sample_rate*3*3600)

pacsound_file = sf.SoundFile(filename)
pacsound_file.seek(start_frame)
x = pacsound_file.read(duration_frames, dtype='float32')    

In [8]:

Copied!





v = x*3   # convert scaled voltage to volts
v.shape, v.size, sample_rate
a = np.arange(v.size)+1
# define segment processing
nsec = (v.size)/sample_rate # number of seconds in vector
spa = 60  # seconds per average
nseg = int(nsec/spa)
print(nseg,'segments of length',spa,'seconds','in',nsec,'seconds of audio')
v = x*3   # convert scaled voltage to volts
v.shape, v.size, sample_rate
a = np.arange(v.size)+1
# define segment processing
nsec = (v.size)/sample_rate # number of seconds in vector
spa = 60  # seconds per average
nseg = int(nsec/spa)
print(nseg,'segments of length',spa,'seconds','in',nsec,'seconds of audio')

180 segments of length 60 seconds in 10800.0 seconds of audio

In [9]:

Copied!





# initialize empty spectrogram matrix
nfreq = int(sample_rate/2+1)
nfreq,nseg
sg = np.empty((nfreq, nseg), float)
sg.shape
# initialize empty spectrogram matrix
nfreq = int(sample_rate/2+1)
nfreq,nseg
sg = np.empty((nfreq, nseg), float)
sg.shape

Out[9]:

(8001, 180)

In [10]:

Copied!





# get window for welch
w = scipy.signal.get_window('hann',sample_rate)

# process spectrogram
for x in range(0,nseg):
  cstart = x*spa*sample_rate
  cend = (x+1)*spa*sample_rate
  f,psd = scipy.signal.welch(v[cstart:cend],fs=sample_rate,window=w,nfft=sample_rate)
  psd = 10*np.log10(psd) + 177.35
  sg[:,x] = psd
# get window for welch
w = scipy.signal.get_window('hann',sample_rate)

# process spectrogram
for x in range(0,nseg):
  cstart = x*spa*sample_rate
  cend = (x+1)*spa*sample_rate
  f,psd = scipy.signal.welch(v[cstart:cend],fs=sample_rate,window=w,nfft=sample_rate)
  psd = 10*np.log10(psd) + 177.35
  sg[:,x] = psd

Plot the spectrogram of the 6-hour window¶

Note: The sharp drop in signal approaching 8 kHz reflects the attributes of the decimation filter applied to produce the 16 kHz data from the original 256 kHz data.

In [11]:

Copied!





plt.figure(dpi=300)
im = plt.imshow(sg,aspect='auto',origin='lower',vmin=30,vmax=100)
plt.yscale('log')
plt.ylim(10,8000)
plt.colorbar(im)
plt.annotate("humpback whale song (above ~70 Hz)",(0,180))
plt.annotate("blue whale song (B-call 3rd harmonic)",(100,40))
plt.annotate("fin whale song (20 Hz pulses)",(100,18))
plt.xlabel('Minute of day')
plt.ylabel('Frequency (Hz)')
plt.title('Calibrated spectrum levels')
plt.figure(dpi=300)
im = plt.imshow(sg,aspect='auto',origin='lower',vmin=30,vmax=100)
plt.yscale('log')
plt.ylim(10,8000)
plt.colorbar(im)
plt.annotate("humpback whale song (above ~70 Hz)",(0,180))
plt.annotate("blue whale song (B-call 3rd harmonic)",(100,40))
plt.annotate("fin whale song (20 Hz pulses)",(100,18))
plt.xlabel('Minute of day')
plt.ylabel('Frequency (Hz)')
plt.title('Calibrated spectrum levels')

Out[11]:

Text(0.5, 1.0, 'Calibrated spectrum levels')

No description has been provided for this image

Zoom-in using a 1-second resolution to see a more detailed of whale song¶

The temporal resolution of the full-day spectrogram above is too coarse to see the structure of individual whale songs. So, let's produce and view a calibrated spectrogram with more detail.

In [12]:

Copied!





# examine the 9th hour of the day at 1 second resolution
spa = 1  # seconds per average
nseg = 3600
sg2 = np.empty((nfreq, nseg), float)
start_sample = int(3600*sample_rate+1)
sg2.shape
# examine the 9th hour of the day at 1 second resolution
spa = 1  # seconds per average
nseg = 3600
sg2 = np.empty((nfreq, nseg), float)
start_sample = int(3600*sample_rate+1)
sg2.shape

Out[12]:

(8001, 3600)

In [13]:

Copied!





# process spectrogram
for x in range(0,nseg):
  cstart = start_sample + x*spa*sample_rate
  cend = cstart+spa*sample_rate
  f,psd = scipy.signal.welch(v[cstart:cend],fs=sample_rate,window=w,nfft=sample_rate)
  psd = 10*np.log10(psd) + 177.35
  sg2[:,x] = psd
# process spectrogram
for x in range(0,nseg):
  cstart = start_sample + x*spa*sample_rate
  cend = cstart+spa*sample_rate
  f,psd = scipy.signal.welch(v[cstart:cend],fs=sample_rate,window=w,nfft=sample_rate)
  psd = 10*np.log10(psd) + 177.35
  sg2[:,x] = psd

/usr/local/lib/python3.7/dist-packages/ipykernel_launcher.py:6: RuntimeWarning: divide by zero encountered in log10

In [14]:

Copied!





# plot it
plt.figure(dpi=300)
im = plt.imshow(sg2,aspect='auto',origin='lower',vmin=30,vmax=100)
plt.yscale('log')
plt.ylim(10,8000)
plt.colorbar(im)
plt.annotate("humpback whale song",(0,180))
plt.annotate("blue whale song",(1200,40))
plt.annotate("fin whale song",(1400,18))
plt.annotate("earthquake",(150,11))
plt.xlabel('Second of hour')
plt.ylabel('Frequency (Hz)')
plt.title('Calibrated spectrum levels')
# plot it
plt.figure(dpi=300)
im = plt.imshow(sg2,aspect='auto',origin='lower',vmin=30,vmax=100)
plt.yscale('log')
plt.ylim(10,8000)
plt.colorbar(im)
plt.annotate("humpback whale song",(0,180))
plt.annotate("blue whale song",(1200,40))
plt.annotate("fin whale song",(1400,18))
plt.annotate("earthquake",(150,11))
plt.xlabel('Second of hour')
plt.ylabel('Frequency (Hz)')
plt.title('Calibrated spectrum levels')

Out[14]:

Text(0.5, 1.0, 'Calibrated spectrum levels')