-
Kyung Hee University
- South Korea
Lists (19)
Sort Name ascending (A-Z)
Stars
[arXiv'24 & NeurIPSW'24] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
A modular graph-based Retrieval-Augmented Generation (RAG) system
REST: Retrieval-Based Speculative Decoding, NAACL 2024
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
๐150+ Tensor/CUDA Cores Kernels, โก๏ธflash-attn-mma, โก๏ธhgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 ๐๐).
๐A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. ๐๐
A high-throughput and memory-efficient inference and serving engine for LLMs
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
Milvus is a high-performance, cloud-native vector database designed to scale vector search.
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
[EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Ollama ๊ธฐ๋ฐ์ int4 gguf ํ์ sLLM์ multi-turn ํํ๋ก ๋ํํ ์ ์๋ ํตํฉ ๋ชจ๋
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)
LangChain ๊ณต์ Document, Cookbook, ๊ทธ ๋ฐ์ ์ค์ฉ ์์ ๋ฅผ ๋ฐํ์ผ๋ก ์์ฑํ ํ๊ตญ์ด ํํ ๋ฆฌ์ผ์ ๋๋ค. ๋ณธ ํํ ๋ฆฌ์ผ์ ํตํด LangChain์ ๋ ์ฝ๊ณ ํจ๊ณผ์ ์ผ๋ก ์ฌ์ฉํ๋ ๋ฐฉ๋ฒ์ ๋ฐฐ์ธ ์ ์์ต๋๋ค.
Ongoing research training gaussian splatting at scale by distributed system
[CVPR 2024 (Highlight)] Relightable and Animatable Neural Avatar from Sparse-View Video
A curated list of retrieval-augmented generation (RAG) in large language models
AI ๋ฒ๋ฅ ์ด๋๋ฐ์ด์ ๋ชจ๋ธ : KoAlpaca ๋ชจ๋ธ์ ์ํ๋ฒ๋ น ๋ฐ์ดํฐ๋ฅผ ํ์ต์์ผ LoRA finetuning & ์ํ ๋ฒ๋ น 100๋ฌธ 100๋ต ๋ฐ์ดํฐ 2,195๊ฐ๋ฅผ ์คํฌ๋ฉ ํ์ฌ LLM ํ์ต์ ์ํ ๋ํ ํ์์ json ํ์ผ๋ก ์ ์
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds