Stars
A RLHF Infrastructure for Vision-Language Models
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Convert images of LaTex math equations into LaTex code.
Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
Aligning LMMs with Factually Augmented RLHF
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
RUCAIBox / POPE
Forked from AoiDragon/POPEThe official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
A package that achieves 95%+ transfer attack success rate against GPT-4
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
deep learning for image processing including classification and object-detection etc.
用 Vue3 和 Go 搭建的微软 New Bing 演示站点,拥有一致的 UI 体验,支持 ChatGPT 提示词,支持 API 调用,国内可用。
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Student project for using audio on the DE2-115 FPGA development board.