Skip to content
View gary109's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report gary109

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Janus Public

    Forked from deepseek-ai/Janus

    Janus-Series: Unified Multimodal Understanding and Generation Models

    Python MIT License Updated Jan 28, 2025
  • Python MIT License Updated Jan 26, 2025
  • KAG Public

    Forked from OpenSPG/KAG

    KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

    Python Apache License 2.0 Updated Jan 7, 2025
  • The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"

    Python Updated Jan 5, 2025
  • YuLan-Mini Public

    Forked from RUC-GSAI/YuLan-Mini

    A highly capable 2.4B lightweight LLM using only 1T pre-training data.

    MIT License Updated Dec 27, 2024
  • Python Apache License 2.0 Updated Dec 13, 2024
  • memo-tost Public

    Forked from camenduru/memo-tost
    Python Updated Dec 6, 2024
  • memo Public

    Forked from memoavatar/memo

    Memory-Guided Diffusion for Expressive Talking Video Generation

    Python Apache License 2.0 Updated Dec 6, 2024
  • Let your Claude able to think

    TypeScript MIT License Updated Dec 3, 2024
  • Python Apache License 2.0 Updated Nov 14, 2024
  • Bring portraits to life!

    Python Other Updated Nov 12, 2024
  • Open-Source Web Automation library with any LLM

    Python MIT License Updated Nov 10, 2024
  • C++ MIT License Updated Oct 18, 2024
  • SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

    Python MIT License Updated Aug 30, 2024
  • doctr Public

    Forked from mindee/doctr

    docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

    Python 1 Apache License 2.0 Updated Aug 29, 2024
  • PPOCRLabel Public

    Forked from PFCCLab/PPOCRLabel

    PPOCRLabelv2 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.

    Python Updated Aug 24, 2024
  • notebooks Public

    Forked from roboflow/notebooks

    Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…

    Jupyter Notebook Updated Aug 19, 2024
  • cvat Public

    Forked from cvat-ai/cvat

    Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

    TypeScript MIT License Updated Aug 18, 2024
  • NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

    Python GNU Affero General Public License v3.0 Updated Aug 18, 2024
  • real time face swap and one-click video deepfake with only a single image

    Python GNU Affero General Public License v3.0 Updated Aug 16, 2024
  • Next generation face swapper and enhancer

    Python Other Updated Aug 15, 2024
  • The official Implementation of PeriodWave and PeriodWave-Turbo

    MIT License Updated Aug 15, 2024
  • Python Updated Aug 15, 2024
  • State-of-the-art 2D and 3D Face Analysis Project

    Python Updated Aug 14, 2024
  • The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

    Jupyter Notebook Apache License 2.0 Updated Aug 14, 2024
  • LongWriter Public

    Forked from THUDM/LongWriter

    LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

    Python Apache License 2.0 Updated Aug 13, 2024
  • CogVideo Public

    Forked from THUDM/CogVideo

    Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

    Python 1 Apache License 2.0 Updated Aug 13, 2024
  • mPLUG-Owl Public

    Forked from X-PLUG/mPLUG-Owl

    mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

    Python MIT License Updated Aug 13, 2024
  • The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

    Jupyter Notebook Apache License 2.0 Updated Aug 13, 2024
  • FruitNeRF Public

    Forked from meyerls/FruitNeRF

    [IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio

    Python Updated Aug 12, 2024