-
06:24
(UTC +09:00) - https://ddangeun.tistory.com/
Stars
Stable Diffusion web UI
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
A Gradio web UI for Large Language Models with support for multiple inference backends.
Making large AI models cheaper, faster and more accessible
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Code for the paper "Language Models are Unsupervised Multitask Learners"
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Large Language Model Text Generation Inference
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
A deep learning library for video understanding research.
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHX…
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Code for "OnePose: One-Shot Object Pose Estimation without CAD Models", CVPR 2022
Metric learning and retrieval pipelines, models and zoo.
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Universal Monocular Metric Depth Estimation
This is the FER+ new label annotations for the Emotion FER dataset.
MLCD & UNICOM : Large-Scale Visual Representation Model
Code for "OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models" NeurIPS 2022