Skip to content

Vertical bars in audio from model output #24

Closed Answered by iver56
KimberleyJensen asked this question in Q&A
Discussion options

You must be logged in to vote

Hey Kimberley Jensen! Love your work in the source separation space.

Please try again with a different window function for the (i)stft. I think the authors of the original paper used hann, which should be much more appropriate than what you have right now. The current default is all ones, which (I think) leads to those unwanted little clicks at a regular interval. The click interval matches the hop length. It would be nice to have this fixed upstream too.

Pytorch is in the process of switching their default window function to hann, but it takes time, so until then we have to set it explicitly.

Replies: 3 comments 4 replies

Comment options

You must be logged in to vote
2 replies
@lucidrains
Comment options

@KimberleyJensen
Comment options

Answer selected by lucidrains
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@lucidrains
Comment options

@ZFTurbo
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants