Skip to content
View kostum123's full-sized avatar

Block or report kostum123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Python 122 8 Updated Nov 28, 2024

Contrastive Language-Audio Pretraining

Python 1,568 157 Updated Nov 21, 2024

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,053 93 Updated Jun 13, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,025 1,374 Updated Mar 3, 2025

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 1,928 129 Updated Mar 16, 2025

The evaluation code for MultiIF multi-turn and multi-lingual instruction following

Python 29 1 Updated Oct 29, 2024

A course on aligning smol models.

Jupyter Notebook 5,625 1,959 Updated Jan 24, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,233 226 Updated Mar 22, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 1,317 203 Updated Mar 21, 2025

Faster Whisper transcription with CTranslate2

Python 14,942 1,258 Updated Mar 20, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 6,832 677 Updated Mar 19, 2025

Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)

Python 170 10 Updated Dec 14, 2024

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 398 14 Updated Dec 5, 2024

Efficient optimizers

Python 183 13 Updated Mar 8, 2025

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

Python 575 47 Updated Feb 11, 2025
Python 393 33 Updated Mar 6, 2025

When it comes to optimizers, it's always better to be safe than sorry

Python 214 8 Updated Feb 23, 2025
Python 68 4 Updated Nov 18, 2024

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 256 11 Updated Mar 20, 2025

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,426 683 Updated Mar 21, 2025

Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"

Python 104 8 Updated Sep 10, 2024

A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.

Jupyter Notebook 1,352 115 Updated Mar 21, 2025

(ICLR 2025) TabM: Advancing Tabular Deep Learning With Parameter-Efficient Ensembling

Python 228 16 Updated Mar 17, 2025

Official implementation of "GPT or BERT: why not both?"

Python 49 2 Updated Mar 14, 2025

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 424 49 Updated Sep 27, 2024

For optimization algorithm research and development.

Python 501 38 Updated Mar 22, 2025

The Open Cookbook for Top-Tier Code Large Language Model

Python 1,643 100 Updated Dec 8, 2024

🏃 Python3 Solutions of All 27 Problems in MHC 2024

Python 3 1 Updated Jan 14, 2025
Next