hongju-jeong

Follow

😃

HongJu Jeong hongju-jeong

😃

Follow

30 followers · 37 following

Kyung Hee University
South Korea

Achievements

Achievements

Organizations

Lists (19)

Sort

3D Face

72 repositories

Cloud Resource management

CUDA

distributed parallel training

11 repositories

elasticsearch

Image Segmentation

inference

LoRa module

MLOps

MSA

microservices architecture

Network

NLU & NLP

21 repositories

NPU

RAG

20 repositories

SDN

SR & Deblur

Vision assistant

Web3

workload dataset

Stars

richard-peng-xia / MMed-RAG

[arXiv'24 & NeurIPSW'24] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Python 77 6 Updated Dec 17, 2024

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,723 373 Updated Jul 11, 2024

YaoJiayi / CacheBlend

Python 60 7 Updated Dec 31, 2024

MooreThreads / TurboRAG

Python 55 4 Updated Nov 25, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,205 2,074 Updated Dec 31, 2024

FasterDecoding / REST

REST: Retrieval-Based Speculative Decoding, NAACL 2024

C 185 11 Updated Dec 2, 2024

amazon-science / piperag

PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)

Python 14 2 Updated Jun 14, 2024

infoslack / qdrant-example

Jupyter Notebook 24 15 Updated Jun 20, 2024

HKUDS / LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,414 1,711 Updated Dec 31, 2024

DefTruth / CUDA-Learn-Notes

📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 1,826 191 Updated Dec 31, 2024

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

3,083 208 Updated Dec 27, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,942 5,012 Updated Jan 1, 2025

YiyangZhou / POVID

[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Python 77 3 Updated Apr 30, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,709 3,166 Updated Aug 12, 2024

milvus-io / milvus

Milvus is a high-performance, cloud-native vector database designed to scale vector search.

Go 31,586 2,988 Updated Dec 31, 2024

facebookresearch / contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 702 60 Updated Apr 7, 2023

starsuzi / Adaptive-RAG

Jsonnet 226 25 Updated May 2, 2024

richard-peng-xia / RULE

[EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Python 54 3 Updated Dec 13, 2024

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,626 1,023 Updated Dec 31, 2024

Nyan-SouthKorea / sLLM_int4_multi-turn_Ollama_Module

Ollama 기반의 int4 gguf 형식 sLLM을 multi-turn 형태로 대화할 수 있는 통합 모듈

Jupyter Notebook 5 1 Updated Jul 25, 2024

AkariAsai / self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,913 176 Updated May 25, 2024

tabtoyou / KoLLaVA

KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)

Jupyter Notebook 278 30 Updated Sep 20, 2024

teddylee777 / langchain-kr

LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.

Jupyter Notebook 1,304 358 Updated Dec 28, 2024

nyu-systems / Grendel-GS

Ongoing research training gaussian splatting at scale by distributed system

Python 411 22 Updated Aug 9, 2024

zju3dv / RelightableAvatar

[CVPR 2024 (Highlight)] Relightable and Animatable Neural Avatar from Sparse-View Video

Python 104 3 Updated Apr 18, 2024

swj0419 / REPLUG

Python 59 2 Updated Jan 20, 2024

coree / awesome-rag

A curated list of retrieval-augmented generation (RAG) in large language models

145 15 Updated Sep 26, 2024

Tongji-KGLLM / RAG-Survey

1,903 125 Updated May 8, 2024

jiwoochris / LAW-Alpaca

AI 법률 어드바이저 모델 : KoAlpaca 모델에 생활법령 데이터를 학습시켜 LoRA finetuning & 생활 법령 100문 100답 데이터 2,195개를 스크랩 하여 LLM 학습을 위한 대화 형식의 json 파일로 제작

Shell 13 4 Updated Feb 1, 2024

NVlabs / InstantSplat

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

Python 933 60 Updated Dec 30, 2024