Skip to content
View Ambier's full-sized avatar
  • alibaba
  • HangZhou

Block or report Ambier

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 224 36 Updated Feb 28, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"

Python 1,202 87 Updated Apr 14, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 19,027 2,173 Updated Apr 9, 2025

The code to perform Sequence Labelling with LLMs, including T5, FLAN, LLaMA, Alpaca and more!

Python 14 3 Updated Nov 5, 2024

A curated list of awesome data labeling tools

1 Updated Jun 17, 2024

Label, clean and enrich text datasets with LLMs.

Python 2,203 155 Updated Mar 5, 2025

Adala: Autonomous DAta (Labeling) Agent framework

Python 1,139 93 Updated Apr 11, 2025

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …

Python 8,968 1,698 Updated Mar 17, 2025

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 5,390 562 Updated Apr 13, 2025

A quick guide (especially) for trending instruction finetuning datasets

3,004 195 Updated Nov 28, 2023

Curated list of datasets and tools for post-training.

2,939 254 Updated Jan 29, 2025

Data annotation toolbox supports image, audio and video data.

Python 1,149 117 Updated Apr 10, 2025

收集整理开源的数据标注工具

845 167 Updated Oct 9, 2019

The Memory layer for AI Agents

Python 27,521 2,617 Updated Apr 14, 2025

Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.

Python 15,961 1,654 Updated Apr 13, 2025

The repo for In-context Autoencoder

Jupyter Notebook 120 14 Updated May 11, 2024

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Python 1,365 179 Updated Apr 13, 2025

Prompt, run, edit, and deploy full-stack web applications

TypeScript 14,231 11,475 Updated Dec 17, 2024

An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance

TypeScript 1,500 123 Updated Apr 11, 2025
Python 1 Updated Feb 21, 2025

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 1 Updated Feb 6, 2025

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 2,106 71 Updated Apr 10, 2025

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 660 54 Updated Apr 14, 2025

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 4,133 353 Updated Apr 15, 2025

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

2,592 179 Updated Mar 21, 2025

This is a continuously updated handbook for readers to easily track the latest NL2SQL (Text-to-SQL) techniques in the literature and provide practical guidance for researchers and practitioners. If…

Python 526 35 Updated Apr 11, 2025

基于Python的开源量化交易平台开发框架

Python 28,979 9,446 Updated Apr 4, 2025

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 330 23 Updated Sep 30, 2024

Fully open reproduction of DeepSeek-R1

Python 23,928 2,183 Updated Apr 14, 2025
Next