Skip to content

Get an "incorrect audio shape" assert when I process the whisper models #1

Closed
@jasontitus

Description

@jasontitus

When trying to convert the whisper models I get an assert -

assert x.shape[1:] == self.positional_embedding.shape, "incorrect audio shape"

It doesn't seem to kill the process, but the produced models don't seem to recognize much of anything (one word of the JFK sample, for example), and are different from the models others uploaded to HF. I'm running this on a Macbook Pro M2 64GB - Mac OS 13.3.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions