Starred repositories
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
The collection of awesome papers on alignment of diffusion models.
[WACV 2025] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
Code for ACMMM2024 paper: FodFoM: Fake Outlier Data by Foundation Models Creates Stronger Visual Out-of-Distribution Detector
Writing AI Conference Papers: A Handbook for Beginners
Machine learning from scratch
[ECCV2024] Official Implementation of "NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image"
2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍
Convert any PDF into a podcast episode!
3D Slicer Plugin for Segment anything in medical images
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Simple academic theme for scientist personal page
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
A collection of awesome LaTeX Thesis/Dissertation templates and beyond! (LaTeX / Word / Typst / Markdown 格式的学位论文、演示文稿、报告、项目申请书、简历、书籍等模板收藏)
GPT4V-level open-source multi-modal model based on Llama3-8B
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention)
Use naive MultiheadAttention implement to replace nn.MultiheadAttention in pytorch
Recent LLM-based CV and related works. Welcome to comment/contribute!