lionel/batdetect2

mirror of https://github.com/macaodha/batdetect2.git synced 2026-04-04 15:20:19 +02:00

mbsantiago 67bb66db3c Incorporate previous docs into new structure

2026-03-28 19:42:09 +00:00

1.4 KiB

Raw Blame History

Target encoding and decoding

batdetect2 turns annotated sound events into training targets, then maps model outputs back into interpretable predictions.

Encoding path (annotations -> model targets)

At training time, the target system:

checks whether an event belongs to the configured detection target,
assigns a classification label (or none for non-specific class matches),
maps event geometry into position and size targets.

This behaviour is configured through TargetConfig, TargetClassConfig, and ROI mapper settings.

Decoding path (model outputs -> tags and geometry)

At inference time, class labels and ROI parameters are decoded back into annotation tags and geometry.

This makes outputs interpretable in the same conceptual space as your original annotations.

Why this matters

Target definitions are not just metadata. They directly shape:

what events are treated as positive examples,
which class names the model learns,
how geometry is represented and reconstructed.

Small changes here can alter both training outcomes and prediction semantics.

Configure detection target logic: {doc}../how_to/configure-target-definitions
Configure class mapping: {doc}../how_to/define-target-classes
Configure ROI mapping: {doc}../how_to/configure-roi-mapping
Target config reference: {doc}../reference/targets-config-workflow