You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The sampling rate of the VCTK corpus is 48K Hz while the model requires the sampling rate to be 16K Hz. To match the sampling rate, I used librosa's resample function and my code looks like:
import librosa
y, sr = librosa.load(wav_file, sr=48000)
y_16k = librosa.resample(y, sr, 16000)
Is this the same code you used for downsampling the audios? I want to clarify this because I want to make sure the data distribution is the same.
The text was updated successfully, but these errors were encountered:
The sampling rate of the VCTK corpus is 48K Hz while the model requires the sampling rate to be 16K Hz. To match the sampling rate, I used librosa's resample function and my code looks like:
Is this the same code you used for downsampling the audios? I want to clarify this because I want to make sure the data distribution is the same.
The text was updated successfully, but these errors were encountered: