-
Janus Public
Forked from deepseek-ai/JanusJanus-Series: Unified Multimodal Understanding and Generation Models
Python MIT License UpdatedJan 28, 2025 -
-
KAG Public
Forked from OpenSPG/KAGKAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
Python Apache License 2.0 UpdatedJan 7, 2025 -
VideoRefer Public
Forked from DAMO-NLP-SG/VideoReferThe code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"
Python UpdatedJan 5, 2025 -
YuLan-Mini Public
Forked from RUC-GSAI/YuLan-MiniA highly capable 2.4B lightweight LLM using only 1T pre-training data.
MIT License UpdatedDec 27, 2024 -
-
-
memo Public
Forked from memoavatar/memoMemory-Guided Diffusion for Expressive Talking Video Generation
Python Apache License 2.0 UpdatedDec 6, 2024 -
Thinking-Claude Public
Forked from richards199999/Thinking-ClaudeLet your Claude able to think
TypeScript MIT License UpdatedDec 3, 2024 -
documentation-helper Public
Forked from emarco177/documentation-helperPython Apache License 2.0 UpdatedNov 14, 2024 -
LivePortrait Public
Forked from KwaiVGI/LivePortraitBring portraits to life!
Python Other UpdatedNov 12, 2024 -
browser-use Public
Forked from browser-use/browser-useOpen-Source Web Automation library with any LLM
Python MIT License UpdatedNov 10, 2024 -
open-battery-information Public
Forked from mnh-jansson/open-battery-informationC++ MIT License UpdatedOct 18, 2024 -
WavTokenizer Public
Forked from jishengpeng/WavTokenizerSOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Python MIT License UpdatedAug 30, 2024 -
doctr Public
Forked from mindee/doctrdocTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
PPOCRLabel Public
Forked from PFCCLab/PPOCRLabelPPOCRLabelv2 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.
Python UpdatedAug 24, 2024 -
notebooks Public
Forked from roboflow/notebooksExamples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…
Jupyter Notebook UpdatedAug 19, 2024 -
cvat Public
Forked from cvat-ai/cvatAnnotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
TypeScript MIT License UpdatedAug 18, 2024 -
ultralytics Public
Forked from ultralytics/ultralyticsNEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Python GNU Affero General Public License v3.0 UpdatedAug 18, 2024 -
Deep-Live-Cam Public
Forked from hacksider/Deep-Live-Camreal time face swap and one-click video deepfake with only a single image
Python GNU Affero General Public License v3.0 UpdatedAug 16, 2024 -
facefusion Public
Forked from facefusion/facefusionNext generation face swapper and enhancer
Python Other UpdatedAug 15, 2024 -
PeriodWave Public
Forked from sh-lee-prml/PeriodWaveThe official Implementation of PeriodWave and PeriodWave-Turbo
MIT License UpdatedAug 15, 2024 -
GenerativePhotomontage Public
Forked from lseancs/GenerativePhotomontagePython UpdatedAug 15, 2024 -
insightface Public
Forked from deepinsight/insightfaceState-of-the-art 2D and 3D Face Analysis Project
Python UpdatedAug 14, 2024 -
segment-anything-2 Public
Forked from facebookresearch/sam2The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Jupyter Notebook Apache License 2.0 UpdatedAug 14, 2024 -
LongWriter Public
Forked from THUDM/LongWriterLongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Python Apache License 2.0 UpdatedAug 13, 2024 -
CogVideo Public
Forked from THUDM/CogVideoText-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
-
mPLUG-Owl Public
Forked from X-PLUG/mPLUG-OwlmPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Python MIT License UpdatedAug 13, 2024 -
AI-Scientist Public
Forked from SakanaAI/AI-ScientistThe AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Jupyter Notebook Apache License 2.0 UpdatedAug 13, 2024 -
FruitNeRF Public
Forked from meyerls/FruitNeRF[IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio
Python UpdatedAug 12, 2024