Skip to content
View dxli94's full-sized avatar
🐶
🐶

Block or report dxli94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Suna - Open Source Generalist AI Agent

TypeScript 3,119 414 Updated Apr 24, 2025

Lightweight coding agent that runs in your terminal

TypeScript 19,902 1,845 Updated Apr 24, 2025

ProBench: Automatic Evaluation on Open-ended Multi-domain Expert Tasks

Python 7 Updated Mar 11, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 21,773 1,848 Updated Apr 23, 2025

A minimal and universal controller for FLUX.1.

Python 1,506 104 Updated Apr 24, 2025

[CVPR 2025] Official Dataloader and Evaluation Scripts for VideoAutoBench.

Python 10 Updated Nov 28, 2024

Minimalistic large language model 3D-parallelism training

Python 1,805 183 Updated Apr 24, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,075 63 Updated Feb 7, 2025

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 1,029 87 Updated Jan 22, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 24,474 1,537 Updated Apr 24, 2025

The Library for LLM-based multi-agent applications

Python 79 17 Updated Feb 18, 2025

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 95 2 Updated Jul 27, 2024

LLM101n: Let's build a Storyteller

33,262 1,814 Updated Aug 1, 2024

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 417 24 Updated Mar 12, 2025
Vim Script 1 Updated Apr 19, 2024

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 440 24 Updated Mar 10, 2025

A collection of AWESOME things about mixture-of-experts

1,087 78 Updated Dec 8, 2024

A Python package to estimate class prevalence in unlabeled datasets by specifying stability assumptions

Jupyter Notebook 1 1 Updated Dec 5, 2024

leaked prompts of GPTs

29,712 4,018 Updated Sep 27, 2024

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,632 2,355 Updated Jun 26, 2024

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Python 9,032 1,553 Updated Jun 26, 2024

A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.

Python 30 2 Updated Jul 14, 2023

Supercharge Your Model Training

Python 5,341 437 Updated Apr 24, 2025

PyRCA: A Python Machine Learning Library for Root Cause Analysis

Python 465 47 Updated Nov 15, 2023

[TPAMI 2023] This is an official implementation for "Vicinity Vision Transformer".

Python 20 2 Updated Jun 15, 2023

Pretrained Dalle2 from laion

Python 502 65 Updated Apr 15, 2023

EVA Series: Visual Representation Fantasies from BAAI

Python 2,472 184 Updated Aug 1, 2024
Python 695 42 Updated Mar 6, 2023

[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning

Python 24 4 Updated Apr 10, 2023

🌇 A collection of links for free stock photography, video and Illustration websites

13,312 782 Updated Jan 14, 2025
Next