how to use peoples speech dataset? #57

housebaby · 2021-12-20T08:46:51Z

I have downloaded the people speech dataset, and have two questions:

How to parse the two files?
I notice there are two options: clean / other
what's the relationship between the two options?
Do I have to download the data of both the two options?
So the total audio will be 60k hours?

xiaobobo-bilibili · 2021-12-21T01:38:02Z

I have the same problem, apart from clean/ dirty, there is also the difference between CC-BY and CC-BY-SA. I have downloaded all these files but can't decompress them because it doesn't look like zip or tar files.

will-rice · 2021-12-23T00:42:24Z

If you extract what you downloaded from the "Data" button, you can use the manifest to build text/speech pairs based on the label and name keys under the training_data key. One thing I'm confused about is how to access the multilingual data. I don't see a language key in the manifest.

housebaby · 2021-12-24T06:47:20Z

I can get label and name from the manifest, but how can I get wavs from the Big File downloaded from "Data"

If you extract what you downloaded from the "Data" button, you can use the manifest to build text/speech pairs based on the label and name keys under the training_data key. One thing I'm confused about is how to access the multilingual data. I don't see a language key in the manifest.

BuaaAlban · 2022-02-09T03:19:01Z

Hi,
I found there were 260988flac files in the released tar dataset（part-00000-07a8f0d3-6d27-4299-887a-dc12a6d72f8d-c000.tar）which only has about 1000 hours, but the metainfo of the json file （part-00000-4e132642-c01c-4db6-9db0-a1e19193f6f8-c000.json）has 4321002 files in it, which mismatch.

Do you have any suggestions or do you find the same problem??

mozoltov183 · 2022-09-22T03:08:53Z

I have downloaded the people speech dataset, and have two questions:

How to parse the two files?

I notice there are two options: clean / other
what's the relationship between the two options?
Do I have to download the data of both the two options?
So the total audio will be 60k hours?

Hello, can you share the part-00000-4e132642-c01c-4db6-9db0-a1e19193f6f8-c000.json with me;
i want download it from https://mlcommons.org/en/peoples-speech/ but meet some problems,
and i alrady download part-00000-07a8f0d3-6d27-4299-887a-dc12a6d72f8d-c000.tar, thank u very much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to use peoples speech dataset? #57

how to use peoples speech dataset? #57

housebaby commented Dec 20, 2021 •

edited

Loading

xiaobobo-bilibili commented Dec 21, 2021

will-rice commented Dec 23, 2021 •

edited

Loading

housebaby commented Dec 24, 2021

BuaaAlban commented Feb 9, 2022 •

edited

Loading

mozoltov183 commented Sep 22, 2022

how to use peoples speech dataset? #57

how to use peoples speech dataset? #57

Comments

housebaby commented Dec 20, 2021 • edited Loading

xiaobobo-bilibili commented Dec 21, 2021

will-rice commented Dec 23, 2021 • edited Loading

housebaby commented Dec 24, 2021

BuaaAlban commented Feb 9, 2022 • edited Loading

mozoltov183 commented Sep 22, 2022

housebaby commented Dec 20, 2021 •

edited

Loading

will-rice commented Dec 23, 2021 •

edited

Loading

BuaaAlban commented Feb 9, 2022 •

edited

Loading