A patcher to better link descriptors and perception

amundsen · December 29, 2022, 8:29pm

Despite the great documentation provided around FluCoMa about sound descriptors, most of them still appear very abstract to me - and I think it’s much worse for my students. So I wanted to have a tool to help figuring out how a specific descriptor can be related to perception.

I have made a patcher based on the corpus explorer patch. Instead of organizing the sounds according to the 91 dimensions of the analysis, each dimension of the analysis can be selected individually in a menu. Then, all the segments are sorted according to this single descriptor and can be played sequentially.

Moreover, as segmentation can be a bit difficult to understand and master, I excluded it from the patcher and rather require the user to provide a folder containing sounds, each sound being considered as a segment.

Also, as the patcher was intended for my students at first, the original version is in French but I translated it into an English version - you get both in the .zip. Les deux versions sont largement commentées.

Please note that the player uses a poly~ object because I needed a polyphonic player for another patch - don’t forget to put it along the patcher. Also, a short 10ms fade-in and fade-out are applied on each sound in the player.

Please let me know of any issue.

I might program some other versions in a near future to help testing other sound descriptors.

Enjoy!

Play sounds sorted according to a single descriptor - MFCC+stats - V1 - EN+FR.zip (26.0 KB)

tremblap · May 16, 2023, 1:33pm

Hello!

Thanks for sharing and sorry for the slow reply. I have 2 comments:

it is a very good way of sorting files and trying to get an intuition of what the statistical reduction of the time series of the feature compared.
but MFCCs as single dimensions are not super useful. It is like comparing a single FFT band. The explanation here is quite good.

So the same patch could be used with fluid.bufspectralshape - each feature is independent there - and they could be compared. I would probably run them as MIDI (log) to get more perceptually relevant values. And I would add the first derivative to the stats because it is interesting to see and hear the impact of the rate of change.

Nice work, I hope this helps, and thanks for sharing!