Skip to content
View hongju-jeong's full-sized avatar
๐Ÿ˜ƒ
๐Ÿ˜ƒ
  • Kyung Hee University
  • South Korea

Organizations

@KHU-Dasom @icns-distributed-cloud

Block or report hongju-jeong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[arXiv'24 & NeurIPSW'24] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Python 77 6 Updated Dec 17, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,723 373 Updated Jul 11, 2024
Python 60 7 Updated Dec 31, 2024
Python 55 4 Updated Nov 25, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,205 2,074 Updated Dec 31, 2024

REST: Retrieval-Based Speculative Decoding, NAACL 2024

C 185 11 Updated Dec 2, 2024

PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)

Python 14 2 Updated Jun 14, 2024
Jupyter Notebook 24 15 Updated Jun 20, 2024

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,414 1,711 Updated Dec 31, 2024

๐Ÿ“š150+ Tensor/CUDA Cores Kernels, โšก๏ธflash-attn-mma, โšก๏ธhgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 ๐ŸŽ‰๐ŸŽ‰).

Cuda 1,826 191 Updated Dec 31, 2024

๐Ÿ“–A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. ๐ŸŽ‰๐ŸŽ‰

3,083 208 Updated Dec 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,942 5,012 Updated Jan 1, 2025

[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Python 77 3 Updated Apr 30, 2024

The official Meta Llama 3 GitHub site

Python 27,709 3,166 Updated Aug 12, 2024

Milvus is a high-performance, cloud-native vector database designed to scale vector search.

Go 31,586 2,988 Updated Dec 31, 2024

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 702 60 Updated Apr 7, 2023
Jsonnet 226 25 Updated May 2, 2024

[EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Python 54 3 Updated Dec 13, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,626 1,023 Updated Dec 31, 2024

Ollama ๊ธฐ๋ฐ˜์˜ int4 gguf ํ˜•์‹ sLLM์„ multi-turn ํ˜•ํƒœ๋กœ ๋Œ€ํ™”ํ•  ์ˆ˜ ์žˆ๋Š” ํ†ตํ•ฉ ๋ชจ๋“ˆ

Jupyter Notebook 5 1 Updated Jul 25, 2024

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,913 176 Updated May 25, 2024

KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)

Jupyter Notebook 278 30 Updated Sep 20, 2024

LangChain ๊ณต์‹ Document, Cookbook, ๊ทธ ๋ฐ–์˜ ์‹ค์šฉ ์˜ˆ์ œ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ์ž‘์„ฑํ•œ ํ•œ๊ตญ์–ด ํŠœํ† ๋ฆฌ์–ผ์ž…๋‹ˆ๋‹ค. ๋ณธ ํŠœํ† ๋ฆฌ์–ผ์„ ํ†ตํ•ด LangChain์„ ๋” ์‰ฝ๊ณ  ํšจ๊ณผ์ ์œผ๋กœ ์‚ฌ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ๋ฐฐ์šธ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Jupyter Notebook 1,304 358 Updated Dec 28, 2024

Ongoing research training gaussian splatting at scale by distributed system

Python 411 22 Updated Aug 9, 2024

[CVPR 2024 (Highlight)] Relightable and Animatable Neural Avatar from Sparse-View Video

Python 104 3 Updated Apr 18, 2024
Python 59 2 Updated Jan 20, 2024

A curated list of retrieval-augmented generation (RAG) in large language models

145 15 Updated Sep 26, 2024

AI ๋ฒ•๋ฅ  ์–ด๋“œ๋ฐ”์ด์ € ๋ชจ๋ธ : KoAlpaca ๋ชจ๋ธ์— ์ƒํ™œ๋ฒ•๋ น ๋ฐ์ดํ„ฐ๋ฅผ ํ•™์Šต์‹œ์ผœ LoRA finetuning & ์ƒํ™œ ๋ฒ•๋ น 100๋ฌธ 100๋‹ต ๋ฐ์ดํ„ฐ 2,195๊ฐœ๋ฅผ ์Šคํฌ๋žฉ ํ•˜์—ฌ LLM ํ•™์Šต์„ ์œ„ํ•œ ๋Œ€ํ™” ํ˜•์‹์˜ json ํŒŒ์ผ๋กœ ์ œ์ž‘

Shell 13 4 Updated Feb 1, 2024

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

Python 933 60 Updated Dec 30, 2024
Next