I think this might be related:
Audio processing is definitely an issue on FP4. Unless you choose raw/unprocessed audio stream, it’s full of these artifacts.