Why use different dataset for Training Ovis1.5-Gemma2-9B-S3 and Ovis1.5-Llama3-8B-S3 #39

LIRENDA621 · 2024-12-01T13:06:01Z

You used 73 datasets to train Ovis1.5-Gemma2-9B-S3, but only 71 datasets to train Ovis1.5-Llama3-8B-S3. Why is this the case? Is it a typo or is there another reason?

runninglsy · 2025-01-24T07:02:34Z

The difference in the number of datasets used for training Ovis1.5-Gemma2-9B-S3 compared to Ovis1.5-Llama3-8B-S3 is not a typo. The 9B version was trained after the 8B version, during which we constructed new data. These additional datasets were included in the training of the 9B model.

YangYang-DLUT · 2025-02-21T09:50:07Z

Great work, truly powerful models. I want to know find the open-sources part of data for train Ovis2? Is it here?https://huggingface.co/datasets/AIDC-AI/Ovis-dataset
or where i can find this list?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why use different dataset for Training Ovis1.5-Gemma2-9B-S3 and Ovis1.5-Llama3-8B-S3 #39

Why use different dataset for Training Ovis1.5-Gemma2-9B-S3 and Ovis1.5-Llama3-8B-S3 #39

LIRENDA621 commented Dec 1, 2024

runninglsy commented Jan 24, 2025

YangYang-DLUT commented Feb 21, 2025

Why use different dataset for Training Ovis1.5-Gemma2-9B-S3 and Ovis1.5-Llama3-8B-S3 #39

Why use different dataset for Training Ovis1.5-Gemma2-9B-S3 and Ovis1.5-Llama3-8B-S3 #39

Comments

LIRENDA621 commented Dec 1, 2024

runninglsy commented Jan 24, 2025

YangYang-DLUT commented Feb 21, 2025