Hi,
You are able to extract PCM data without playing the sound: FMOD Engine | Advanced Core Api Topics - Extracting Pcm Data From A Sound would this be a valid option?
Is this a similar issue as you were experiencing before: Visualizing both classic Loudness x Time and Spectrogram?