Batching Behaviour

by gclose19 - opened Mar 16

Mar 16

Hello.
Thank you for providing standalone code for the speaker embeddings.
I've noticed an issue where differing lengths of padded batched inputs are not being considered by the model.
This results in different embeddings for a given audio segment depending on if the embedding was computed in a batch or not.
This is (presumably) unintentional behaviour ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment