Skip to content
@PKU-Baichuan-MLSystemLab

PKU-Baichuan-MLSystemLab

Peking University ML System Lab - Baichuan Inc. Joint Laboratory

PKU-Baichuan-MLSystemLab

Welcome to the GitHub repository of the Peking University ML System Lab - Baichuan Inc. Joint Laboratory.

We are dedicated to advancing research in Data-Centric Machine Learning (DCML), Large Language Models (LLMs), and Machine Learning Systems (ML Systems).

Our goal is to develop effective and efficient data preparation systems and algorithms that support and enhance the performance of machine learning models.

Newly Released Papers and Code

🔥 2024/09/02 DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning 🌲 arXiv
🔥 2024/08/27 BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline 🌴 Repo 🌲 arXiv
🔥 2024/08/21 MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark 🌴 Repo 🌲 arXiv
🔥 2024/08/20 SysBench: Can Large Language Models Follow System Messages? 🌴 Repo 🌲 arXiv
🔥 2024/08/14 Llama3-PBM-Nova-70B Model is released! 🤗 Huggingface
🔥 2024/08/07 PAS: Data-Efficient Plug-and-Play Prompt Augmentation System 🤗 Huggingface 🌴 Repo 🌲 arXiv
🔥 2024/08/02 CFBench: A Comprehensive Constraints-Following Benchmark for LLMs 🌴 Repo 🌲 arXiv

Pinned Loading

  1. CFBench CFBench Public

    CFBench: A Comprehensive Constraints-Following Benchmark for LLMs

    Python 21 3

  2. PAS PAS Public

    Python 39

  3. SysBench SysBench Public

    SysBench: Can Large Language Models Follow System Messages?

    Python 16 2

Repositories

Showing 5 of 5 repositories
  • .github Public
    PKU-Baichuan-MLSystemLab/.github’s past year of commit activity
    0 0 0 0 Updated Sep 11, 2024
  • PAS Public
    PKU-Baichuan-MLSystemLab/PAS’s past year of commit activity
    Python 39 0 0 0 Updated Sep 11, 2024
  • SysBench Public

    SysBench: Can Large Language Models Follow System Messages?

    PKU-Baichuan-MLSystemLab/SysBench’s past year of commit activity
    Python 16 2 0 0 Updated Sep 4, 2024
  • CFBench Public

    CFBench: A Comprehensive Constraints-Following Benchmark for LLMs

    PKU-Baichuan-MLSystemLab/CFBench’s past year of commit activity
    Python 21 3 1 1 Updated Aug 26, 2024
  • PKU-Baichuan-MLSystemLab/MathScape’s past year of commit activity
    Python 3 1 0 0 Updated Aug 19, 2024

Top languages

Loading…

Most used topics

Loading…