In this demo we present an audio-driven interface which allows a user to vocalize the sound they want to select and an automatic process matches that input to the most appropriate sound.
This is bonkers (Jump to the demo movie here)  A user isolates portions of audio, not by using a typical UI but by simply mimicking the sound by singing, whistling or grunting(?).