Stars
Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023
a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).
A curated list of resources for Cross-lingual Information Retrieval (CLIR).
Code for the paper "Adapt - $\infty$: Scalable Lifelong Multimodal Instruction Tuning"
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
A Framework of Small-scale Large Multimodal Models
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)
Using sparse coding to find distributed representations used by neural networks.
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
A game theoretic approach to explain the output of any machine learning model.
This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks"
State-of-the-Art Text Embeddings