feat: add run name support #4072

firefighter-eric · 2025-05-03T16:14:25Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Write the detail information belongs to this PR.

Experiment results

Paste your experiment result here(if needed).

export WANDB_PROJECT=llm101

CUDA_VISIBLE_DEVICES=0 \
swift sft \
    --model data/models/Qwen/Qwen2.5-0.5B-Instruct \
    --train_type full \
    --dataset data/swift/Qwen3-SFT-Mixin/qwen3_32b_distill_1k.jsonl \
    --torch_dtype bfloat16 \
    --num_train_epochs 1 \
    --per_device_train_batch_size 1 \
    --per_device_eval_batch_size 1 \
    --learning_rate 1e-5 \
    --gradient_accumulation_steps 16 \
    --eval_steps 50 \
    --save_steps 50 \
    --save_total_limit 1 \
    --logging_steps 1 \
    --max_length 2048 \
    --output_dir runs/swift/qwen2.5-0.5b \
    --system 'You are a helpful assistant.' \
    --warmup_ratio 0.05 \
    --dataloader_num_workers 4 \
    --report_to wandb \
    --attn_impl flash_attn \
    --use_liger_kernel true \
    --run_name test-swift-qwen2.5-0.5b

Resolves #4046

Jintao-Huang · 2025-05-03T16:33:25Z

请允许下面的代码

pip install pre-commit
pre-commit run --all-files

* main: fix enable_cache (modelscope#4091) Support ulysses for llm/mllm,dpo/sft (modelscope#4085) update docs (modelscope#4078) feat: support megatron wandb (modelscope#4074) feat: add run name support (modelscope#4072) fix padding_side left (modelscope#4069) bump version support MiMo-7B (modelscope#4067) fix packing eval streaming (modelscope#4066) Support empty think loss scale (modelscope#4065) support qwen3-moe awq (modelscope#4059) Fix grpo eval when gas > 1 (modelscope#4057) fix rollout(modelscope#4055) updates GRPOTrainer compatible with trl 0.17 (modelscope#3969) support Qwen2.5-Omni-3B (modelscope#4052) update wechat (modelscope#4047) # Conflicts: # swift/llm/train/tuner.py

feat: add run name support

a8f4874

Jintao-Huang approved these changes May 3, 2025

View reviewed changes

format: format with pre-commit

13547fa

Jintao-Huang merged commit f10e914 into modelscope:main May 3, 2025
2 checks passed

firefighter-eric mentioned this pull request May 4, 2025

Add run_name argument support for wandb integration #4046

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add run name support #4072

feat: add run name support #4072

firefighter-eric commented May 3, 2025 •

edited

Loading

Jintao-Huang commented May 3, 2025

feat: add run name support #4072

feat: add run name support #4072

Conversation

firefighter-eric commented May 3, 2025 • edited Loading

PR type

PR information

Experiment results

Jintao-Huang commented May 3, 2025

firefighter-eric commented May 3, 2025 •

edited

Loading