Skip to content
View Fanb1ing's full-sized avatar
  • Tsinghua University
  • 北京
  • 01:06 (UTC -12:00)

Block or report Fanb1ing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

allRank is a framework for training learning-to-rank neural models based on PyTorch.

Python 899 122 Updated Aug 6, 2024

The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space""

HTML 146 11 Updated Nov 21, 2024

vLLM for embedding tasks using Original LLMs (Qwen2, LLaMA)

Jupyter Notebook 26 2 Updated Sep 9, 2024

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 7,976 371 Updated Jan 13, 2025

SPatial INTeraction Models (spint)

Jupyter Notebook 54 23 Updated Dec 20, 2023

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 1,717 120 Updated Jan 15, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,938 5,212 Updated Jan 18, 2025

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,981 1,990 Updated Apr 16, 2024