MLPRegressor training versus validation loss

doett · April 4, 2024, 10:51am

thank you very much @tremblap! great!
and yes, my bad, the loop keeps it on, so the question would be how to know when it stops (to output some indication from the MLPR object maybe, to set ~continuously_train to false would be great?!

one could also implement the early stopping test in the train-loop (compare last 10 test loss values, if they are all going up then set ~cont-train to false)
i need to safe a model today so i keep what i have for now! but for upcoming work…

and i will also test your high batches approach very soon!! ; )

this looks like a very elegant solution, i am curious!!
merci to all

tremblap · April 4, 2024, 12:02pm

I think a combination of the 2 solutions you propose is what is a rich way forward here. for instance:

run my long slow settings once, write the reported error
run it again (it’ll stop early if it needs to) and compare the error
continue until you find the error is low or the time of running is short twice in a row?

but then, there are many many more ways of exploring that, for instance you could generate many random mlp and see which one gets to the best error sooner… or you could do a few my way then test your way, keeping mlp states in between in a Dictionary so you can revert back when it starts to overfit.

this is fun, you can explore all of these until you get the music you want and/or the mega neural net structure and/or your understanding is maximal and/or you are bored and/or …

so much fun!

doett · April 4, 2024, 1:02pm

yes that is what i want, i mean its fun sitting the whole day and watching things get drawn, haha… i mean it, but i will build something automatic soon (a friend pointed me at https://docs.wandb.ai/), i want to input lets say a list of hyperparms and some model structs etc. then let it run some days and it gives me the best thing out of 100s for my data…

this is really fun, indeed, i enjoy it a lot also - still to discover and understand lots of ml things at the same time…

tedmoore · April 15, 2024, 1:17pm

Greetings @doett and @tremblap ,

Just popping back in to say that I’m glad more people are talking about validation in the FluCoMa-verse and if/when you settle on a workflow please do share! https://docs.wandb.ai/ looks great. I’m curious if/how it jives with FluCoMa tools!

doett · April 15, 2024, 4:39pm

hi @tedmoore,
what i try soon is to automate the process a bit more to let the thing run and check at the end many models and their error plots

lets say having a dataset and specify some different model params (layers, activations etc.) and hyperparams to train,
then train models on all possibilities, plot each outcome and save all models
this is what wandb can be usefull (but having no idea how to bring this into flucomaland, but as a general working pipeline idea maybe interesting)

to do this in a loop/routine, the exit has to work, so early stopping needs to be considered and i make some tests using
@tremblap ideas on this from above…

also tried to build the algo in sclang (add every test-loss to the list (size10) until it is 10 in a row going up → kill), which is hardly stopping (cause of the noisy tst loss),
smoothing it (using moving avg) can cause stopping too early (local min) or it can be too late (see the pic atached as an example… )

well…
if i come up with something useful i let you know! thanks