In relation to the conversation about mfccs
Here’s a video that shows what happens to the overall envelope when the pitch of the voice changes on one vowel (the spectral envelope stays roughly the same, but the harmonics move) as opposed to doing a varispeed playback