I know I’m quite late to the party, but based on the stuff that @tedmoore was showing in the thread about SVM stuff, I’m building a
fluid.kmeans~ thing to automatically cluster a corpus to then further experiment/test on.
Other than manually making labels/classes before I’ve not done much at all with
Now, because the visualizer I built takes a single
fluid.dataset~ as its input, I want to make a 3D dataset that contains 2Ds of umap output, with the 3rd D being the normalized clusters created by
Seems simple enough.
But unless I’m overlooking something, do I need to have
fluid.kmeans~ populate a
fluid.labelset~ (with integers (as symbols)), which I then output to a
uzi/iterate out the contents via a
get message into a list, which I then
peek~ into a
buffer~, which I then have to manually add back into another
addpoint before finally concatenating the two datasets back together. Is that right?
I understand that, fundamentally, labelsets are capable of holding symbols/names/whatever, but something like
fluid.kmeans~ spits out ints anyways.
Perhaps this is just my greenness with pairing datasets/labelsets, but for the purposes of visualize at least, I kind of want all of those things in a single thing (a
fluid.dataset~ in my case).