Continuous Training in the Cloud

heretogo · February 23, 2026, 12:39am

Dear FluCoMa community,

I’m posting to describe a project that I’m working on in collaboration with @amgum. We want to “productionize” a ML workflow using the FluCoMa libraries (i.e. create a continuous training pipeline which runs in the cloud). The goal is to provide a recipe that can be shared and re-used by others, including infra, for research purposes. It’s early but I think this workflow will involve analyzing a corpus of sound in Google Colab and then providing a proof-of-concept to “ship it” using a platform like Vertex AI (when new sounds are uploaded, it triggers the training workflow). The full pipeline is a bit speculative and involves some reduction of scope to keep it realistic and feasible. We want a small-data version of something which could scale to big-data. Probably we would use @jamesbradbury’s Python bindings, to bring FluCoMa into the more traditional Python-based data science ecosystem. We could use some help with the technical scoping.

@amgum is a data scientist while I’m a DevOps by day and we’re working together within the structure of a professional peer mentorship. We want our work to serve as an example to others. In addition to sharing the notebooks and infra code, part of this project is to reflect on and showcase what makes, for us, a successful collaboration between Data Science and DevOps. @weefuzzy shared this video with me, which might serve as orientation.

This is a learning experiment for both of us. I’ve never done full-blown MLOps and @amgum has mainly worked with financial data and things like that, not spectral time series data. The specific inspiration for this was Alice Eldridge’s demonstration of her analysis of rainforest sounds.

I hope that’s enough context. I’m eager to share this with the FluCoMa community and hope we can get your support and encouragement.

Right now, specifically, we’re looking for public datasets (either on Kaggle or elsewhere), both of sounds and derivative spectral data of the natual environment, to get our bearing on the data itself. If anybody has any pointers, would like to help or know more, please get in touch in thread or in DM.

tremblap · February 23, 2026, 5:37pm

This is a great news. @tedmoore has done some back-and-forth between Python and FluCoMa, and @rodrigo.constanzo and Jordi Shier have something coming very soon doing something very very cool: using PyTorch to optimise and train networks, from FluCoMa made datasets and towards FluCoMa MLPs.

Looking forward to hearing/seeing what is going to happen and thanks for sharing!

rodrigo.constanzo · February 23, 2026, 5:49pm

I remember, towards the start of FluCoMa, that either you (@tremblap) or it may have been Hans (@tutschku) was really keen on the idea of having something like this that kept an up-to-date analysis of all your audio samples. Each time you added to it, it would get re-analyzed automatically overnight and be available the next day.

I was reminded of the usefulness of that again with this post.

jamesbradbury · March 1, 2026, 7:15am

Would node bindings be useful? Or would you rather keep it in Python? I ask this because I am working on native bindings to both, and I am learning a lot about pybind / napi in the process, but its probably too much effort to maintain both. I wonder what would be most useful, because it would be cool to support this project.

jamesbradbury · March 1, 2026, 7:16am

See my comment here:

Could be a fun “proof of concept” for making lower-level language bindings. A small daemon that runs and is pointed to various folders, constantly updating a database which you can export to a dataset at any time.

heretogo · March 1, 2026, 4:27pm

At the moment, I would only use the Python bindings and I think there would be more users of that. What are you thinking with the Node bindings? Embedding FluCoMa stuff in websites?

Is what you’re working on a rework of the cli tools or also FluCoMa core?

I really like your project suggestion.

jamesbradbury · March 19, 2026, 1:48am

I have a personal interest in using node, mostly because typescript is where I live now for scripting. I got frustrated with python tooling though it is becoming better with uv. [bun](https://bun.com/) is very very good though, and I’ve deployed big apps on this platform (e.g I worked on the new bela ide which bun and ts). I also think that a ts backend lends itself towards more ergonomic and safe code, because you can write your frontend and backend and share types between them. I really like types now, and Python always feels too weak for me. I do think Python has the massive upper hand in ML workflows. YMMV, but that’s just my experience I also think the semantics around async are better than Python, which has never really made sense to me beyond exploiting multiprocessing. Plus, in a kind of silly way, I like the idea of running node.script in Max with FluCoMa .

I think once stable Node bindings exist, it’d make sense to look at porting for Python. There are some interface questions which emerge when bending to text-based languages that I think can be overcome quite easily with glue code. For example, which is more ergonomic? For me it’s the first, but it requires paper mache at some layer.

const normalised = new FluidNormalise(sourceDs).fit()

let targetDs = new FluidDataSet()
const throwaway = new FluidNormalise(sourceDs, targetDs)

tremblap · March 19, 2026, 7:28am

Syntax-wise, I think we should learn from Python and SuperCollider here, and create objects that are functions with states. So you create a FluidNormaliser instance in a variable and then call its methods.

jamesbradbury · March 19, 2026, 7:49am

This is what I imagine, and the above example tries to capture. We can have objects create their own data storage if necessary rather than passing containers as arguments, which is faffy. For a more full example this is what I imagine (and actually what is implemented so far):

import { FluidDataSet, FluidNormalise, FluidUMAP } from “@flucoma/node”

const jsonPath = “~/data.json” // assume this is made up elsewhere for now
const sourceDs = new FluidDataset().fromJSON(jsonPath)
const normalise = new FluidNormalise()
const normDs = normalise.fit(sourceDs)
const umap = new FluidUMAP({ components: 3, iterations 10000 })
const 3d = await umap.fitAsync(normDS) // can use a promise here to do other things while we wait

tremblap · March 19, 2026, 8:55am

good, but normDs is never created (you are using the autoDS feature?)

jamesbradbury · March 19, 2026, 9:33am

That behaviour doesn’t exist in the core code AFAICT, so I’ve implemented it separately

jamesbradbury · March 19, 2026, 3:53pm

Another example:

import { Pitch, BufStats } from '@flucoma/node';

const SAMPLE_RATE = 44100;
const DURATION = 1.0;

function generateSine(freq, duration = DURATION, sampleRate = SAMPLE_RATE) {
    const n = Math.floor(duration * sampleRate);
    const buf = new Float32Array(n);
    for (let i = 0; i < n; i++) {
        buf[i] = Math.sin(2 * Math.PI * freq * i / sampleRate);
    }
    return buf;
}

// Run BufStats on a Float32Array and return { mean, median }
function summarise(values) {
    const stats = new Float32Array(7);
    new BufStats({ numDerivs: 0 }).process(values, stats);
    return { mean: stats[0], median: stats[5] };
}

const pitch = new Pitch(SAMPLE_RATE, { unit: 'Hz' });

const signals = [
    { label: 'A4  (440 Hz)', audio: generateSine(440) },
    { label: 'A5  (880 Hz)', audio: generateSine(880) },
    { label: 'E4  (330 Hz)', audio: generateSine(329.63) },
];

console.log('Signal                  Mean pitch   Median pitch   Mean conf   Median conf');
console.log('─'.repeat(78));

for (const { label, audio } of signals) {
    const result = pitch.process(audio);

    const pitchIdx = result.features.indexOf('pitch');
    const confIdx  = result.features.indexOf('pitchConfidence');

    const pitchValues = new Float32Array(result.frames.map(f => f[pitchIdx]));
    const confValues  = new Float32Array(result.frames.map(f => f[confIdx]));

    const p = summarise(pitchValues);
    const c = summarise(confValues);

    console.log(
        `${label.padEnd(24)}` +
        `${p.mean.toFixed(2).padStart(10)} Hz` +
        `${p.median.toFixed(2).padStart(12)} Hz` +
        `${c.mean.toFixed(3).padStart(12)}` +
        `${c.median.toFixed(3).padStart(12)}`
    );
}

which prints:

Signal                  Mean pitch   Median pitch   Mean conf   Median conf
──────────────────────────────────────────────────────────────────────────────
A4  (440 Hz)                442.16 Hz      442.10 Hz       0.935       0.939
A5  (880 Hz)                881.28 Hz      881.22 Hz       0.982       0.984
E4  (330 Hz)                332.54 Hz      332.44 Hz       0.889       0.895

So you can see here that I still need to work on the “auto-buffer” thing

tedmoore · March 20, 2026, 6:39pm

I’m curious to hear more of the use-case for porting to Python. From my perspective, Python already has (all?) the tools that exist in FluCoMa. In some ways, FluCoMa is a “port” of librosa and scikit-learn to Max/SC/PD.

I suppose if you want to eventually use analyses back in Max/SC/PD and you want to make sure the same C++ code was doing the analysis in all your places, that would be a good reason?

jamesbradbury · March 22, 2026, 5:43am

Well yeah exactly. The little I do use Python now, I can reach for a bunch of other stuff. I think the selling point would be parity across all of the environments. I think there are a lot of options in Librosa, for example, that are confusing unless you know exactly what they do and read all of the documentation and possibly some of the code.

tremblap · March 22, 2026, 9:36am

indeed, FluCoMa did a fair share of interface research to make sure its interface is more musicianly otherwise the same argument from librosa towards lower level descriptor implementations…

tedmoore · March 22, 2026, 3:20pm

Yes, these are good reasons.