How to configure audio preprocessing

Use this guide to set sample-rate and waveform-level preprocessing behaviour.

1) Set audio loader settings

The audio loader config controls resampling.

samplerate: 256000
resample:
  enabled: true
  method: poly

If your recordings are already at the expected sample rate, you can disable resampling.

samplerate: 256000
resample:
  enabled: false

Waveform transforms are configured in preprocess.audio_transforms.

preprocess:
  audio_transforms:
    - name: center_audio
    - name: scale_audio
    - name: fix_duration
      duration: 0.5

Available built-ins:

For CLI inference/evaluation, use --audio-config.

batdetect2 predict directory \
  path/to/model.ckpt \
  path/to/audio_dir \
  path/to/outputs \
  --audio-config path/to/audio.yaml

Run on a small folder first and confirm that outputs and runtime are as expected before full-batch runs.