Is there an application of normalize and standardise that happens outside of a dataset~
object? I think that it would be so much easier to just call these as methods on part of the fluid.dataset~ object rather than having to do a copy of the data to a new place. It becomes a bit gymnastic-al keeping tabs on where everything is passed particularly if you have multiple stages of preprocessing before you train an mlp for example. I tend to go with a naming structure like input
, input.norm
, input.norm.std
just to keep tabs on where everything goes but for every stage of this I need to do more objects and have to route the outputs to get an automatic one button pipeline going.
Instead it could be nice that you just call these methods directly on the object or you can copy to a new dataset if you are going to transform new inputs based on the fit of the norm/std process. Just my 2c