Skip to content
View gyin94's full-sized avatar

Block or report gyin94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
26 results for source starred repositories
Clear filter

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,861 5,443 Updated Jan 27, 2025

This Repo will provide TensorFlow libraries and extended build tutorials that require compilation to build, as well as pre-compiled wheel files.

114 9 Updated Jan 21, 2025

Arena-Hard-Auto: An automatic LLM benchmark.

Python 717 88 Updated Dec 29, 2024

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

Python 144 8 Updated Jan 16, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,293 1,454 Updated Jan 27, 2025

A benchmark for emotional intelligence in large language models

Python 216 19 Updated Jul 26, 2024

RuLES: a benchmark for evaluating rule-following in language models

Python 214 15 Updated Jan 22, 2025

Must-read Papers on LLM Agents.

2,064 117 Updated Nov 12, 2024

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Python 119 8 Updated Jul 8, 2024

An Analytical Evaluation Board of Multi-turn LLM Agents

SAS 272 28 Updated May 20, 2024

Minimalistic large language model 3D-parallelism training

Python 1,406 140 Updated Jan 27, 2025
Jupyter Notebook 400 32 Updated Feb 13, 2024

[EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning

Python 13 Updated Sep 20, 2024

CRUXEval: Code Reasoning, Understanding, and Execution Evaluation

Python 122 14 Updated Oct 11, 2024

MLX: An array framework for Apple silicon

C++ 18,607 1,070 Updated Jan 28, 2025

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,393 161 Updated Jun 25, 2024

An Extensible Deep Learning Library

Python 1,927 281 Updated Jan 28, 2025

structured outputs for llms

Python 9,123 713 Updated Jan 27, 2025

The FunctionChain is a tool that simplifies and organizes the process of invoking OpenAI functions in your Node.js applications. With this toolkit, you can easily scaffold out and isolate all the O…

JavaScript 55 10 Updated Jul 10, 2023

🛠 openai function calling tools for JS/TS

TypeScript 290 26 Updated Jan 30, 2024

Python bindings for ggml

Python 136 11 Updated Sep 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,143 5,347 Updated Jan 28, 2025
Jupyter Notebook 1,025 100 Updated May 29, 2023

Fast inference engine for Transformer models

C++ 3,548 314 Updated Dec 18, 2024