BirdSynthData

This is a holding page:...

The synthesised birds database contains the following:

1. Recordings of bird song taken from xeno-canto.org (converted to .wav).

2. Synthesised versions of these recordings using SMS tools (http://mtg.upf.edu/technologies/sms), a python implementation music synthesis toolkit, in .wav format. These audio files were synthesised using a sinusoid plus residual (SpR) system.

3. A ground truth pitch vector, used by the synthesis system, in .mat format (loadable to matlab).

4. Sine model and residual .wav files are also included.

5. Matlab file "a1_plot_all.m" that plots the original recording's spectrogram, the synthesised recording's spectrogram and the ground truth pitch.

The bird song recordings were grouped into whistles (107 examples, 40m 09s), trills (65 examples, 13m 02s) and nasal (63 examples, 12m 32s) sounds. This data set was used in experiments for work currently submitted to Interspeech 2016.

Page last modified on May 24, 2016