Stars
Learn how to use the Cognitive Services Python SDK with these samples
Lab files for AI-102 - AI Engineer
MTEB: Massive Text Embedding Benchmark
Retrieval and Retrieval-augmented LLMs
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Official community-driven Azure Machine Learning examples, tested with GitHub Actions.
A curated list of awesome open-source libraries for production LLM
Atipico1 / Kor-IR
Forked from embeddings-benchmark/mtebKor-IR: Korean Information Retrieval Benchmark
Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc
A framework for few-shot evaluation of language models.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Evaluation software used in the Text Retrieval Conference
pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Make huge neural nets fit in memory
The Universe of Data. All about data, data science, and data engineering
A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
Train transformer language models with reinforcement learning.
JVector: the most advanced embedded vector search engine
The official PyTorch implementation of Google's Gemma models
Generative Representational Instruction Tuning
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint