Skip to content
View lq13918508248's full-sized avatar
😮‍💨
9-5?
😮‍💨
9-5?

Block or report lq13918508248

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A foundation model for knowledge graph reasoning

Python 515 69 Updated Feb 3, 2025
Python 17 2 Updated Feb 16, 2025

An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanjan Mahata, Ozan Irsoy, Yujie He, and Mohit Bansal (UNC Chape…

Python 29 2 Updated Nov 13, 2024

Graph Foundation Model for Retrieval Augmented Generation

Python 50 6 Updated Mar 5, 2025

A pure python based utility to extract text and images from docx files.

Python 535 97 Updated Oct 17, 2023

GraphRAG-survey: A curated list of resources on graph-based retrieval-augmented generation.

617 90 Updated Mar 2, 2025

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,106 73 Updated Feb 24, 2025

A curated list of retrieval-augmented generation (RAG) in large language models

175 16 Updated Feb 14, 2025

GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.

Python 2,015 235 Updated Nov 9, 2024

Official style files for papers submitted to venues of the Association for Computational Linguistics

TeX 967 214 Updated Feb 11, 2025
Python 290 31 Updated Feb 23, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 14,432 1,603 Updated Feb 23, 2025

✨✨Latest Advances on Multimodal Large Language Models

14,115 906 Updated Mar 5, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,843 2,600 Updated Mar 4, 2025

Multimodal image + text captioning for 416k figures from arXiv. Uses CLIP + SciBERT + GPT-2 in an encoder-decoder architecture. CS224N final project.

Jupyter Notebook 1 Updated Mar 15, 2022

Generating figures from research papers, using textual captions from the paper.

Python 24 3 Updated Jul 17, 2023

Awesome-RAG: Collect typical RAG papers and systems.

320 23 Updated Jan 23, 2025

RAG that intelligently adapts to your use case, data, and queries

Python 2,976 161 Updated Feb 27, 2025

A simple, easy-to-hack GraphRAG implementation

Python 2,523 247 Updated Jan 15, 2025

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,381 188 Updated Dec 26, 2023

Standalone evaluation scripts and starter code for the ICDAR 2023 DUDE competition

Python 4 Updated Mar 10, 2023

DUDE: Document UnderstanDing of Everything Benchmark

7 Updated Mar 27, 2023

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,562 132 Updated Mar 5, 2025

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Jupyter Notebook 128 5 Updated Jan 13, 2025

[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents.

Python 113 6 Updated Nov 25, 2024

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Python 87 8 Updated Oct 10, 2023

Official implement of "Free Lunch: Frame-level Contrastive Learning with Text Perceiver for Robust Scene Text Recognition in Lightweight Models" in PyTorch.

Python 2 Updated Nov 15, 2024
Next