Get an "incorrect audio shape" assert when I process the whisper models

When trying to convert the whisper models I get an assert -

assert x.shape[1:] == self.positional_embedding.shape, "incorrect audio shape"

It doesn't seem to kill the process, but the produced models don't seem to recognize much of anything (one word of the JFK sample, for example), and are different from the models others uploaded to HF. I'm running this on a Macbook Pro M2 64GB - Mac OS 13.3.