I’m working on something where it would be useful to pull out the median entry in a fluid.dataset~
and I’m wondering if there’s a better (and/or more accurate) way to do this.
So this will be a fluid.dataset~
filled with a bunch of rows of descriptor data (so generally 100+ rows, and somewhere between 8-100d of descriptors).
What I’m thinking at the moment is to dump out the dataset and find the median of each column independently, then create a synthetic entry out of them, which I would then feed into a KDTree fit’d with the original dataset to find the nearest point to that.
I think that would “work”, but I’m not entirely sure that that would actually be the median value, and it does feel super faffy.
Is there a more elegant way to do this?
This is an example of the kind of data I’m working with:
fluidobject.json.zip (120.2 KB)