Video for my students about MLP and autoencoder

tutschku · May 24, 2021, 1:20pm

This is certainly not a beginners video as it requires some basic understanding of machine learning and the FluComa tools. It summarizes the concepts around MLP and autoencoder in Max MSP.
The patches are based on the FluComa tutorials.

jamesbradbury · May 24, 2021, 1:29pm

I think the link is to your accounts management of the video rather than the public facing one

tutschku · May 24, 2021, 1:38pm

Is it fixed now?

jamesbradbury · May 24, 2021, 1:45pm

Yes! Its embedded in the post as well which is handy.

jamesbradbury · May 24, 2021, 1:57pm

7:14 is interesting when you bring up the point about the differences to linear interpolation. I know @a.harker is sceptical about the behaviour of neural networks for this sort of thing offering beneifts over a linear interpolation approach but I think you are right - if the presets were more complex to warp between the nn would more likely be able to figure out a non-linear pathway. Theres no guarantee that will sound ‘better’ but it certainly is a difference.

Probably worth investigating with some kind of a/b comparison imo.

a.harker · May 24, 2021, 2:12pm

I should clarify that I didn’t necessarily say linear interpolation. The question for some cases of controller mapping is how different it would be to some form of interpolation, which might be (for instance) cubic or something else. The level of non-linearity is important.

And - yes - as with other debates of this nature - a true test would be better than speculation.

jamesbradbury · May 24, 2021, 2:17pm

In that regard the point to make would be that you don’t have to go and design your function, rather the computer invents it for you and that process itself can be fruitful…

a.harker · May 24, 2021, 3:23pm

When you make an interpolator you choose a function or approach, you don’t necessarily design it. I have no doubt that the exact numerical results will be different from an NN, but the question is how nonlinear is it (or more generally how far away from what an interpolator would do). Whether that difference is fruitful is the question for me. If it’s not nonlinear in a complex way other methods may produce pretty similar results.

Anecdotally, when I’ve watched the outputs of some examples people have given in the past I think I see the outputs moving towards the training points, much like they might with an interpolator. But:

1 - do I know if the difference is meaningful? No - that would need to be tested (by the person who was defining what meaningful would be).

2 - is the NN a valid way to reach the result - of course from a musical point of view, but I’m wary of the potential to imagine it is doing something more magical than building an interpolator for you - my hunch is that for a low number of training points there might be other less complex ways to reach a very similar result, but that is just a hunch and I might be wrong, perhaps very much so. I think the test might be worth doing though…

a.harker · May 24, 2021, 3:30pm

I’ve now watched the segment of the video in question and the shapes I see look very very similar to some kind of interpolator (in fact they look quite linear), but we’d have to decide which kind - a simple model might be to add something from all training points based on distance to the query point (such that they also sum to one).

Most importantly what I see is all the points moving towards the training point that the query is moving towards in the direction we would expect, so we aren’t seeing complex non-linearity emerge between two points, which might lead to something quite different from a straightforward interpolation.

jamesbradbury · May 24, 2021, 3:33pm

I think you’re splitting hairs here. Although I agree you don’t design an interpolation function (although you could) it is definitely a component of designing some patch/program/module that performs interpolation. It is undeniably a choice that one makes knowingly or unknowingly, and whether or not you implement it from a set of well known interpolation functions or have a nn approximate the function are two different things.

I think this is largely a result of the example being quite basic and illustrative. The canonical XOR example is a better demonstration of how non-linearity can be captured and where something linear would fail.

a.harker · May 24, 2021, 3:41pm

Technically and in terms of process yes - but the question for me is whether they are produce perceptually different results or not. [As a less important aside - for the NN one makes other choices (like activation function), so seeing the need to make a choice of interpolator as a downside in comparison to an NN is for me a false comparison.]

I’m not sure how that relates to this situation. My claim is not that NNs never provide solutions to non-linear problems that are hard to solve in other ways. My claim is simply that for some limited scenarios (controller mapping with limited training points being potentially one) they may be essentially a means to design an interpolator that may not be particularly perceptually different to something that would be much simpler to implement in a different way.

jamesbradbury · May 24, 2021, 3:56pm

Sorry, your claims were never ratified in text so I’m going off my memory of you being quite underwhelmed by them (mlp’s) I do agree with you in this case. I would be interested to see more examples where the function the nn learns could be quite challenging or unexpected in very simple mapping problems.

spluta · May 24, 2021, 4:00pm

I would love someone much smarter than me to be able to explain this definitively, as, intuitively I agree with what Hans has said - especially once your control space is more than 2 dimensions! I mean, once it is 3 or 4 or 5, etc, I don’t even know how you would design that interpolation. And that is part of the magic here. 5 dims is just as easy as 2 dims.

I was looking at this book today for some kind of clarity:

I didn’t get it. But for now I have drunk the cool-aid. I just don’t really understand what is in the cool-aid. But I know I like it.

a.harker · May 24, 2021, 4:06pm

There are two obvious approaches.

One creates only linear combinations of control points and thus the number of dimensions is irrelevant, you simply find the contribution of each control point to the result and multiply and take the sum.
The second is to assume that one can decouple the dimensions for the purpose of interpolation.

A really important distinction to make here is NN used to model systems that are known to be non-linear and the assumption that the non-linearity of a NN might “invent” or “create” some kind of odd interpolation path that a simpler approach wouldn’t and that is the bit I’m less sure about

jamesbradbury · May 24, 2021, 4:08pm

I

To add to this, I think that the process of working with the NN can confirm or deny suspicions about whether or not something was non-linear as one thinks. The XOR to me is quite simple, but is non-linear by definition and so in a way our intuitive understanding can betray the reality of the problem space.

tremblap · May 24, 2021, 5:32pm

My approach to this comes from the very cool assumption that I don’t need to care about linearity or not. Rebecca Fiebrink is quite clear about this, especially for small dataset like ours. In effect, I point at a few points in my arbitrary mapping space that might be imprecise and incomplete and the machine tries to devise a mapping that will be approximately good. I even get extrapolation, aka guestimates of things outside my space. So I’m happy.

As for rigorous comparison, we could randomise the test but again, if it works and is musically expressive, I really want to make more music, not try to maybe find a system that might work better for certain cases…

Speaking of complicated systems: time to practice the fretless scales - linear I’m told, but behold, it does not sound like it when i skip a day

spluta · May 24, 2021, 5:53pm

A key thing that I keep tripping myself up on is the nonlinearity of the parameter space and the nonlinearity of the sonic space - and though related, these are not the same thing.

tremblap · May 24, 2021, 6:15pm

Agreed, and there is the non-linearity of your preset space too - you might create ‘illogical’ mappings in the space you attribute positions in, yet MLP will try to make sense of it all in fascinating smooth ways.

tutschku · May 25, 2021, 12:11am

Thank you all for chiming in on this discussion. There are very valid details captured here which will continue to make me think. This group is just amazing. What are we going to do when the formal part of FluComa is coming to an end? I will certainly miss our weekly encounters VERY MUCH.

tremblap · May 25, 2021, 7:07am

They don’t need to stop