The Ushichka Dataset: waiting to be analysed!

Left: The Uschichka setup with three cameras (small blue lights), a large array and many smaller microphones (not visible). Right: The bats flying around in the cave!

As part of my PhD looking at how bats manage to echolocate in groups, I collected a multichannel audio-video dataset consisting of between 12-22 microphones and three thermal cameras. I call this combined recording system 'Ushichka' (Ушичка) (OO-shi-ch-kaa), the Bulgarian dimunitive for something with multiple ears. The Ushichka system was built with Holger R Goerlitz to record bat groups as they echolocate in their natural habitat. The dataset can provide direct glimpses into how bats change their flight and echolocation behaviours as the group size around them increases.

Why is it interesting?

  • Reconstructing the sensory umwelt of a bat- sensory input to flight behaviour in bat groups:

    The nice part about having so many cameras and microphones is that the sensorimotor decisions of each bat in a group can be tracked very well. How close did a bat fly to its neighbour, when did it swerve away, what did its call look like (duration, spectrum, etc). The cave that they're flying in has also been LiDAR scanned (check out the fly-through video from the Kamburov et al 2018 paper here.

    The combination of 3D video and acoustic tracking along with the cave scans basically provide us a rather 'controlled' environment in which we can infer the sensory inputs and outputs of each bat! This 'multi-modal' dataset allows a detailed study of the what, how and why bats are able to fly and echolocate in groups.

  • Exploiting the natural variation in group size

    The cool thing about studying animals in their natural environment is that you end up observing a larger variety of their behaviours than in a lab setting. For instance in the Orlova Chuka cave, (the field site), bat aggregations grow from a few bats to many bats over the course of an hour or two after sunset. This allows a comparison of how flight and echolocation behaviours change with group size over the course of an evening and across multiple evenings in the same space. Even though the bats leave at more or less the same time in the evening - they return as individuals just before sunrise. The fact that they arrive as individuals is a nice natural 'control' to understand how bats will fly and echolocate as individuals, in comparison to how they do it in groups!.

    The Ushichka dataset consists of short automatically triggered recordings (10-15s length) starting at sunset and going all the way till sun rise. This means each 'night' of recording captures the entire variation in bat behaviour across group sizes in a constant environment.

  • Comparing observed data with predictions

    hello hello

Left: schematic based on our 2019 model showing the different types of sounds in echolocating groups, Right: schematic showing the same masking sounds in the field, and how they can be estimated/measured in the field

I've been working on how bats manage to echolocate in groups over the course of my PhD, and have a series of predictions outlined in this paper. The computational model was built to simulate the acoustical and sensory phenomena bats may be experiencing while echolocating in groups, and was parametrised with experimentally observed values typical of an 'average' bat. The idea would be to parametrise this computational model for the behaviour of the study species in the recordings (Myotis myotis) and run new predictions to see whether the observed changes they show actually lead to decreased masking in the group. The simulations in the paper above were 'static' simulations with no dynamic nature. Real bats of course sense their world dynamically and alter/update their sensorimotor decisions according to the inputs they receive.

The technical details:

  • Frame synchronised and 3D calibrated video recordings with 3 thermal cameras

    That's me doing a calibration run with a common object at the end of the pole. Note: The 2 GIFS here are sometimes displayed with a bit of delay - the actual video data is frame synchronised!

    Three thermal cameras running at 25Hz were automatically triggered to start recording whenever the sound levels increased beyond a threshold. The cameras were TeAx ThermalCapture 2.0 models running with custom trigger circuits.

  • Multi-channel audio recordings

    Close-ups of the audio equipment. Left: Audio jacks, because, cables are cool :). Middle: Tiny mics pinned onto the cave walls and their trailing cables. Right: The actual audio interfaces + amplifiers controlling the audio and video data generation

    I started out with 12 microphones and had 22 mics by the end of the field season! All audio was recorded by a combination of Fireface USB devices running at 192 kHz. I used only one 'standard' array with four microphones (the inverted T-shaped thing visible in the GIF on top) and the rest of the microphones were placed on the walls of the cave to keep the whole recording system as unobtrusive as possible to the bats.

  • Tracking bat aggregations with the combined power of acoustic and video tracking

    The data from the recording system is in the form 10-15 second files with multichannel video at 25Hz and audio recorded at 192 kHz. For every recording in the Uschichka dataset a set of acoustic and video trajectories can be produced from the mics and cameras respectively. The microphones capture bat locations whenever bats emit calls at ~10Hz, while the video trajectory generates bat locations even when the bats are not calling. In addition, the acoustic and video trajectories can be used complementarily when bats occlude each other on camera, or an acoustic localisation is ambiguous. The challenge in analysing the Uschichka dataset is in the audio. Standard acoustic tracking setups have no difficuly tracking a few bats (<5) in open air conditions. The challenge arises in call tracking and trajectory assignment in limited volumes with reverberations. Each of the 12-22 microphones in the Uschichka datset provide a unique 'glimpse' of where a call could be located and thus allow for robust tracking despite the complex audio scene.

Supporting datasets/methods around Uschichka:

  • LiDAR scans of the cave

    The whole recording volume and a sizeable area of the cave around it has been scanned with a LiDAR scanner by Asparuh Kamburov. The details of the scanning and data collection are described in our 2018 report. The dataset serves as a nice basis to investigate how bats flying within the volume make their sensorimotor decisions. Through tracking we can estimate when the bats emitted their calls, and the LiDAR scans allow a calculation of the time of arrival and intensity of the sounds as they reflect off the walls. Some cool 'room acoustics' could be done using basic ray tracing perhaps to simulate sound propagation (with a package like pyroomacoustics?

  • Automatic microphone position calibration

    Working with many microphones that are not on arrays means constantly having to measure their positions and inter-mic distances. While this is feasible for a few mics, the number of measurements to be made with more mice increases horribly fast. While looking for automated solutions to this problem I reached out to Kalle Åström and with example audio playbacks made in the cave, we were able to show the automatic inference of mic positions just from playbacks. If you haven't worked with acoustic localisation this may seem uninteresting, but if you have - you'll be delighted hearing that such things are in development :P!. A part of the Uschichka dataset was used for the paper published on this method. See Batstone et al 2019 for more details.

Taking it forward

This is only a teaser of the whole dataset and the things that could be done with it. It may already be too much information. I'd love to talk with you about potential ways to take the analysis of this dataset forward as a postdoc!


Of course, even though I said that 'I' collected the dataset, the data collection was only truly possible thanks to a whole group of people backing it at every point along the way. Aditya Krishna and Neetash Mysuru helped with data collection in the caves, Antoniya Hubancheva for sparking the whole idea of doing the field work at the Siemers field station itself, and the rest of the Tabachka crew for the awesome times. Благодаря, danke schon, tak, and thanks to all of you (named, and not named) in this section.

The valley below the Tabachka field station!