Skip to content
View gyin94's full-sized avatar

Block or report gyin94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,610 5,418 Updated Jan 14, 2025

This Repo will provide TensorFlow libraries and extended build tutorials that require compilation to build, as well as pre-compiled wheel files.

114 9 Updated Dec 18, 2024

Arena-Hard-Auto: An automatic LLM benchmark.

Python 706 84 Updated Dec 29, 2024

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

Python 138 8 Updated Dec 11, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,169 1,444 Updated Jan 14, 2025

A benchmark for emotional intelligence in large language models

Python 212 19 Updated Jul 26, 2024

RuLES: a benchmark for evaluating rule-following in language models

Python 214 15 Updated Jan 14, 2025

Must-read Papers on LLM Agents.

2,033 111 Updated Nov 12, 2024

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Python 120 8 Updated Jul 8, 2024

An Analytical Evaluation Board of Multi-turn LLM Agents

SAS 270 26 Updated May 20, 2024

Minimalistic large language model 3D-parallelism training

Python 1,384 138 Updated Jan 14, 2025
Jupyter Notebook 400 32 Updated Feb 13, 2024

[EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning

Python 12 Updated Sep 20, 2024

CRUXEval: Code Reasoning, Understanding, and Execution Evaluation

Python 119 14 Updated Oct 11, 2024

MLX: An array framework for Apple silicon

C++ 18,310 1,056 Updated Jan 14, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,387 166 Updated Jun 25, 2024

An Extensible Deep Learning Library

Python 1,920 280 Updated Jan 14, 2025

structured outputs for llms

Python 8,879 701 Updated Jan 14, 2025

The FunctionChain is a tool that simplifies and organizes the process of invoking OpenAI functions in your Node.js applications. With this toolkit, you can easily scaffold out and isolate all the O…

JavaScript 55 11 Updated Jul 10, 2023

🛠 openai function calling tools for JS/TS

TypeScript 289 26 Updated Jan 30, 2024

Python bindings for ggml

Python 136 11 Updated Sep 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,733 5,156 Updated Jan 14, 2025
Jupyter Notebook 1,025 100 Updated May 29, 2023

Fast inference engine for Transformer models

C++ 3,523 310 Updated Dec 18, 2024