lizhaoliu-Lec

🎯

Focusing

lizhaoliu lizhaoliu-Lec

🎯

Focusing

Persistence and Concentration.

25 followers · 11 following

South China University of Technology
Guangzhou/China

Achievements

Stars

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 3,492 288 Updated Aug 14, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,509 999 Updated Oct 8, 2024

xzhih / one-key-hidpi

Enable macOS HiDPI and have a native setting.

Shell 8,683 995 Updated Jul 3, 2024

XinyuSun / PSL-InstanceNav

official implementation for ECCV 2024 paper "Prioritized Semantic Learning for Zero-shot Instance Navigation"

Python 13 2 Updated Sep 25, 2024

Li-ChangHao / CoNav

7 Updated Jul 16, 2024

yuweihao / MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Python 1,983 34 Updated Jun 6, 2024

ZSHsh98 / MMD-MP

This is the source code for Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy (ICLR2024).

Python 40 2 Updated Aug 12, 2024

ActiveVisionLab / Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

1,055 73 Updated Oct 8, 2024

xai-org / grok-1

Grok open release

Python 49,478 8,325 Updated Aug 30, 2024

alaamaalouf / FollowAnything

Jupyter Notebook 359 45 Updated Dec 5, 2023

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,746 455 Updated May 3, 2024

GAP-LAB-CUHK-SZ / SAMPro3D

SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation

Python 94 8 Updated Jan 12, 2024

dvlab-research / LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 708 45 Updated Jul 29, 2024

3d-vista / 3D-VisTA

Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"

Python 183 10 Updated Sep 7, 2023

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 753 37 Updated Jun 2, 2024

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,363 1,549 Updated Oct 7, 2024