Closed
Description
When trying to convert the whisper models I get an assert -
assert x.shape[1:] == self.positional_embedding.shape, "incorrect audio shape"
It doesn't seem to kill the process, but the produced models don't seem to recognize much of anything (one word of the JFK sample, for example), and are different from the models others uploaded to HF. I'm running this on a Macbook Pro M2 64GB - Mac OS 13.3.
Metadata
Metadata
Assignees
Labels
No labels