Audio data annotation with NEAL (Nature + Energy Audio labeler).

Overview
Creative Commons License: CC-BY Questions:
Objectives:
  • Learning the use of a Galaxy interactive tool

  • Learning the use of NEAL interactive tool

  • Introduction to Audio annotation

Requirements:
Time estimation: 1 hour
Supporting Materials:
Published: Oct 14, 2025
Last modification: Oct 14, 2025
License: Tutorial Content is licensed under Creative Commons Attribution 4.0 International License. The GTN Framework is licensed under MIT
version Revision: 1

This tutorial is designed to guide you through the annotation of audio data via the NEAL interactive tool.

NEAL (Nature+Energy Audio Labeller) A et al. 2023 is an open-source interactive audio data annotation tool. Built using R and the associated R Shiny framework, the tool provides a reactive environment where users can quickly annotate audio files and adjust settings which automatically change the corresponding elements of the user interface. The app has been designed with the goal of having both expert birders and citizen scientists contribute to acoustic annotation projects GIBBONS.

The data we will work with in this tutorial is a 0:43min recording of a common blackbird (Turdus merula) recorded in Armenia above Alaverdi town. For more information, you can see metadata on the xeno-canto site.

In this tutorial, you will learn how to import an audio file into Galaxy, run the NEAL interactive tool, annotate the spectrogram, and export the resulting annotation file.

Many of you may be new to Galaxy or to audio annotation, so we will go step by step. If you’re new to Galaxy, feel free to check out other tutorials like A short introduction to Galaxy or Galaxy Basics for everyone.

Agenda

In this tutorial, we will cover:

  1. Get data
  2. Run NEAL interactive tool
  3. Inside the NEAL interactive tool
    1. Start session
    2. Listen to the audio
    3. Annotate the audio file
    4. Download the label file
    5. Exit the interactive tool
    6. Shutdown the tool
  4. Rerun NEAL interactive tool with a previous labels file
  5. Conclusion

Get data

So, the first step is to your audio file into Galaxy. In this tutorial, we will use the recording mentioned earlier, but feel free to use your own file(s). The NEAL interactive tool allows working on multiple files, so you can upload the number of files you want.

Hands On: Data Upload
  1. Create a new history for this tutorial and give it a name, e.g., “NEAL tutorial” for you to find it again later if needed.

    To create a new history simply click the new-history icon at the top of the history panel:

    UI for creating new history

    1. Click on galaxy-pencil (Edit) next to the history name (which by default is “Unnamed history”)
    2. Type the new name
    3. Click on Save
    4. To cancel renaming, click the galaxy-undo “Cancel” button

    If you do not have the galaxy-pencil (Edit) next to the history name (which can be the case if you are using an older version of Galaxy) do the following:

    1. Click on Unnamed history (or the current name of the history) (Click to rename history) at the top of your history panel
    2. Type the new name
    3. Press Enter

  2. Import the example audio file from xeno-canto

    Audio file :

    https://xeno-canto.org/1008421/download
    
    • Copy the link location
    • Click galaxy-upload Upload Data at the top of the tool panel

    • Select galaxy-wf-edit Paste/Fetch Data
    • Paste the link(s) into the text field

    • Press Start

    • Close the window

    As an alternative to uploading the data from a URL or your computer, the files may also have been made available from a Choose remote files:

    1. Click on Upload Data on the top of the left panel
    2. Click on Choose remote files and scroll down to find your data folder or type the folder name in the search box on the top.

    3. click on OK
    4. Click on Start
    5. Click on Close
    6. You can find the dataset has begun loading in you history.

  3. Rename the dataset (default name: Download) to something more explicit, e.g., 1008421.wav to have a name linked to the xeno-canto identifier.

    • Click on the galaxy-pencil pencil icon for the dataset to edit its attributes
    • In the central panel, change the Name field
    • Click the Save button

Note that you also can provide the 1008421.wav filename on the “Paste/Fetch data” upload module to avoid renaming after importing the file.

  1. Make sure the dataset’s datatype is .wav, as the tool only supports this format.

    • Click on the galaxy-pencil pencil icon for the dataset to edit its attributes
    • In the central panel, click galaxy-chart-select-data Datatypes tab on the top
    • In the galaxy-chart-select-data Assign Datatype, select your desired datatype from “New Type” dropdown
      • Tip: you can start typing the datatype into the field to filter the dropdown menu
    • Click the Save button

  2. You can add a tag to your dataset if you need to distinguish multiple datasets in your history.

    Datasets can be tagged. This simplifies the tracking of datasets across the Galaxy interface. Tags can contain any combination of letters or numbers but cannot contain spaces.

    To tag a dataset:

    1. Click on the dataset to expand it
    2. Click on Add Tags galaxy-tags
    3. Add tag text. Tags starting with # will be automatically propagated to the outputs of tools using this dataset (see below).
    4. Press Enter
    5. Check that the tag appears below the dataset name

    Tags beginning with # are special!

    They are called Name tags. The unique feature of these tags is that they propagate: if a dataset is labelled with a name tag, all derivatives (children) of this dataset will automatically inherit this tag (see below). The figure below explains why this is so useful. Consider the following analysis (numbers in parenthesis correspond to dataset numbers in the figure below):

    1. a set of forward and reverse reads (datasets 1 and 2) is mapped against a reference using Bowtie2 generating dataset 3;
    2. dataset 3 is used to calculate read coverage using BedTools Genome Coverage separately for + and - strands. This generates two datasets (4 and 5 for plus and minus, respectively);
    3. datasets 4 and 5 are used as inputs to Macs2 broadCall datasets generating datasets 6 and 8;
    4. datasets 6 and 8 are intersected with coordinates of genes (dataset 9) using BedTools Intersect generating datasets 10 and 11.

    A history without name tags versus history with name tags

    Now consider that this analysis is done without name tags. This is shown on the left side of the figure. It is hard to trace which datasets contain “plus” data versus “minus” data. For example, does dataset 10 contain “plus” data or “minus” data? Probably “minus” but are you sure? In the case of a small history like the one shown here, it is possible to trace this manually but as the size of a history grows it will become very challenging.

    The right side of the figure shows exactly the same analysis, but using name tags. When the analysis was conducted datasets 4 and 5 were tagged with #plus and #minus, respectively. When they were used as inputs to Macs2 resulting datasets 6 and 8 automatically inherited them and so on… As a result it is straightforward to trace both branches (plus and minus) of this analysis.

    More information is in a dedicated #nametag tutorial.

Run NEAL interactive tool

All the work will be done inside the NEAL interactive tool, so let’s run it.

Hands On: Run NEAL interactive tool
  1. NEAL with the following parameters:
    • param-file “Input sound data file(s) in wav format”: 1008421.wav
    • param-file “Do you already have a labels file?”: Start with a fresh labels file
  2. Wait for the tool to deploy it. It may take few a minutes, but you will see a message like this:

    Wait status. Open image in new tab

    Figure 1: Wait status example
  3. You can now display the interactive interface. You can display it with different methods:

    1. You can open it clicking ont the status message “Open”
    Open status. Open image in new tab

    Figure 2: Open status example
    1. You can open it by clicking on the link next to the red square in the “Launch and manage interactive tool” panel.
    Manage interactive tool. Open image in new tab

    Figure 3: Manage interactive tools panel
    1. You can also click on the eye icon on the “yellow” “NEAL on dataX” history dataset.
    History status. Open image in new tab

    Figure 4: History status

Inside the NEAL interactive tool

Congratulations 🎉 You are now inside the NEAL interface. Let’s explore the main features.

Start session

To start annotating your audio file, you will need to start the session.

Hands On: Start session
  1. Click on the user icon (top left).

  2. Click the green Start labelling button.
    Green button. Open image in new tab

    Figure 5: Green button example
  3. The sonogram of the audio file will appear.
    Sonogram. Open image in new tab

    Figure 6: Sonogram example

Listen to the audio

Hands On

Use the play button below the sonogram to listen to the audio recording.
You can pause, stop, or replay as needed. And, adjust playback speed if necessary.

Warning

As you can see, regardless of the total duration of your file, NEAL lets you listen to the audio in 15-second segments.

Annotate the audio file

Add a label

If this is your first annotation, you will need to create a label (e.g. “Blackbird song”, “Noise”, etc.).

Hands On: Add a label
  1. Type the label name (e.g. Turdus merula).
  2. Click the Add button in the Category buttons section.

    Add labels. Open image in new tab

    Figure 7: Add labels

For this tutorial, you will only need the Turdus merula label you just added and the Insect Noise label, already present in the Class List: Richfield.

Class list. Open image in new tab

Figure 8: Class list

Select a time/frequency region

Hands On
  1. Click and drag on the sonogram to select a time–frequency region.

    The selected area will be highlighted. Here we can try to two different frequencies ranges :

    • One selection between 1 kHz and 10 kHz:
    Selection on sonogram. Open image in new tab

    Figure 9: Selection on sonogram
    • One selection between 0kHz and 1kHz:
    Selection on sonogram second example. Open image in new tab

    Figure 10: Selection on sonogram second example
Question
  1. Do you hear different sounds between the two frequencies we select?
  1. There are two sounds:
    • In the higher frequencies, you can hear Turdus merula sounds
    • In the lower frequencies, you can hear a flying insect noise.

Annotate the selection

After selection of an area, you can assign it a label.

Hands On: Add a label to the selected area
  1. After selecting a region, choose the label that matches what you hear in the Class list.

  2. Click on the Save selection button

    • This will link the time–frequency selection to a label.
    • You can now annotate the Turdus merula sound…
Merula noise. Open image in new tab

Figure 11: Merula Noise
  - ... and the insect noise we selected in the previous step. 
Insect noise. Open image in new tab

Figure 12: Insect Noise

Download the label file

Once you have finished annotating your file you can save and download your annotations.

Hands On
  1. Export the labels by downloading the CSV file containing your annotations. Simply click the download button in the Label buttons area:

    Labels save. Open image in new tab

    Figure 13: Labels save

    This CSV file can be re-used later to resume annotation.

Exit the interactive tool

After saving your file you can exit the NEAL interactive tool.

Hands On
  1. Click on the End Labelling button to end labelling

    Your sonogram will disappear.

  2. Click on the log out button before leaving the NEAL interface.

    Log out example. Open image in new tab

    Figure 14: Log out example

Shutdown the tool

Back on Galaxy, you can now Shutdown the NEAL interactive tool to free resources.

Hands On

To do so you can :

  1. Click on the square on the tool running in your galaxy history.
Shutdown example. Open image in new tab

Figure 15: Shutdown example
  1. Click on the red square in the “launch and manage interactive tools” panel.
Shutdown example 2. Open image in new tab

Figure 16: Shutdown example 2
Warning

Don’t worry if the NEAL tool appears in red in your history after shutting it down. This is due to how Galaxy handles the termination of interactive tools and won’t impact your work.

Rerun NEAL interactive tool with a previous labels file

If you need to continue your work on this audio file later you can restart the NEAL interactive tool using the CSV file containing your previous labels.

Hands On: Rerun NEAL interactive tool
  1. NEAL with the following parameters:
    • param-file “Input sound data file(s) in wav format”: 1008421.wav
    • param-file “Do you already have a labels file?”: Load a previous labels file
    • param-file “Input sound data labels file in csv format”: my_labels_file.csv

You can now continue your work from where you left off.

Warning

Using it into Galaxy, NEAL does not automatically save labels in its Class List.
Labels are stored only inside the sonogram annotations.
This means you may need to recreate/re-add labels before adding new annotations.

Conclusion

👏 Congratulations! You have completed the NEAL tutorial.

You have learned how to:

  • Import audio data into Galaxy
  • Launch and use the NEAL interactive tool
  • Annotate audio files
  • Export and re-use annotation files

If you have questions or encounter issues, please check the documentation or contact the Galaxy community for support.