Fanb1ing

Bingbing Fan Fanb1ing

Stars

allRank is a framework for training learning-to-rank neural models based on PyTorch.

Python 899 122 Updated Aug 6, 2024

The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space""

HTML 146 11 Updated Nov 21, 2024

Python 194 9 Updated Oct 19, 2024

vLLM for embedding tasks using Original LLMs (Qwen2, LLaMA)

Jupyter Notebook 26 2 Updated Sep 9, 2024

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 7,976 371 Updated Jan 13, 2025

SPatial INTeraction Models (spint)

Jupyter Notebook 54 23 Updated Dec 20, 2023

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 1,717 120 Updated Jan 15, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,938 5,212 Updated Jan 18, 2025

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,981 1,990 Updated Apr 16, 2024