Skip to content
View JiahangXu's full-sized avatar

Block or report JiahangXu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,186 186 Updated Sep 23, 2024

Automated Design of Agentic Systems

Python 933 138 Updated Oct 1, 2024
Python 359 35 Updated Sep 23, 2024

A validation and profiling tool for AI infrastructure

Python 261 56 Updated Oct 8, 2024

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Python 91 10 Updated Aug 23, 2024

A library for advanced large language model reasoning

Python 1,194 98 Updated Sep 3, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,682 437 Updated Jun 22, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 167,345 44,208 Updated Oct 9, 2024

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 9,326 844 Updated Oct 8, 2024

Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models

200 9 Updated Mar 27, 2024

A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)

Python 560 29 Updated Sep 30, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 17,619 1,341 Updated Oct 9, 2024

Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.

217 18 Updated Mar 19, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 840 98 Updated Oct 7, 2024

Extract full next-token probabilities via language model APIs

Python 227 14 Updated Feb 23, 2024

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 773 37 Updated Jun 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,080 4,149 Updated Oct 9, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,223 717 Updated Aug 5, 2024

Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.

Python 76 14 Updated Oct 8, 2024

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 545 42 Updated Mar 4, 2024

This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.

Python 74 7 Updated Sep 17, 2024

The repo for In-context Autoencoder

Jupyter Notebook 83 6 Updated May 11, 2024

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Python 520 39 Updated Mar 10, 2024

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Python 1,559 135 Updated Jul 29, 2023

A framework for few-shot evaluation of language models.

Python 6,644 1,758 Updated Oct 8, 2024

Fast and memory-efficient exact attention

Python 13,688 1,256 Updated Oct 9, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,456 669 Updated Aug 14, 2024

Awesome LLM compression research papers and tools.

1,104 66 Updated Oct 9, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,713 370 Updated Mar 14, 2024
Next