Skip to content
View dqxiu's full-sized avatar

Highlights

  • Pro

Block or report dqxiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 BibTeX 来增强 Overleaf。

JavaScript 57 4 Updated Apr 14, 2025
Python 6 Updated Mar 4, 2025

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

418 9 Updated Jan 17, 2025

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Python 298 24 Updated Dec 20, 2023

[ICLR 2025] Benchmarking Agentic Workflow Generation

Python 77 4 Updated Feb 19, 2025

Instruction Tuning with GPT-4

HTML 4,300 306 Updated Jun 11, 2023

A curated list of awesome instruction tuning datasets, models, papers and repositories.

Python 332 14 Updated Jun 12, 2023

A comprehensive benchmark for evaluating Large Multimodal Models' capacities of visual deep semantics.

Python 8 Updated Jul 29, 2024
Python 1,511 160 Updated Apr 18, 2025

The official Meta Llama 3 GitHub site

Python 28,622 3,365 Updated Jan 26, 2025

Arena-Hard-Auto: An automatic LLM benchmark.

Python 780 96 Updated Apr 19, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 2,929 490 Updated Jan 24, 2025

服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息

Python 31 1 Updated Aug 10, 2021

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 252 31 Updated Apr 19, 2025

📰 Must-read papers and blogs on Speculative Decoding ⚡️

688 37 Updated Apr 18, 2025

[NeurlPS D&B 2024] Generative AI for Math: MathPile

Python 410 22 Updated Apr 4, 2025

SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

Python 2,834 480 Updated Apr 18, 2025

A collection of useful .gitignore templates

165,834 83,071 Updated Apr 11, 2025
Jupyter Notebook 150 22 Updated Jan 4, 2024

Just a bunch of benchmark logs for different LLMs

119 2 Updated Jul 28, 2024

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

553 54 Updated Mar 18, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 5,206 543 Updated Apr 18, 2025

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

Python 769 80 Updated Dec 15, 2023

AllenAI's post-training codebase

Python 2,905 374 Updated Apr 19, 2025

Foundation Architecture for (M)LLMs

Python 3,069 217 Updated Apr 11, 2024

Code for EMNLP 2023 Findings paper: "Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning"

Python 12 1 Updated Oct 10, 2023
Python 15 Updated Oct 28, 2023

[NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin Liu, Lei Li, Shuhuai Ren, Rundong Gao, Shicheng Li, Sishuo Ch…

Python 53 2 Updated Mar 4, 2024

A repository for research on medium sized language models.

Python 493 69 Updated Apr 16, 2025

This repository contains everything you need to become proficient in ML/AI Research and Research Papers

576 80 Updated Mar 17, 2024
Next