-
University of Ljubljana
- in/lojze-zust
Highlights
- Pro
Stars
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
Infinite Photorealistic Worlds using Procedural Generation
[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
High accuracy RAG for answering questions from scientific documents with citations
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
Code&Data for Grounded 3D-LLM with Referent Tokens
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
A Zotero plugin for syncing items and notes into Notion
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[NeurIPS 2024] Code release for "Segment Anything without Supervision"
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
Convert PDF to markdown + JSON quickly with high accuracy
A nanoGPT pipeline packed in a spreadsheet
Agno is a lightweight library for building multi-modal Agents
llama3 implementation one matrix multiplication at a time
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
A modern model graph visualizer and debugger
A massively parallel, high-level programming language
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
The Eurovision Song Contest Dataset is a freely-available dataset containing audio features, metadata, contest ranking and voting data of 1735 songs that have competed in the Eurovision Song Contes…
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message se…
[WACV 2025] Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation
Official implementations for paper: Dynamic Typography: Bringing Text to Life via Video Diffusion Prior