
-
Amazon Web Services
- Remote
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection
Framework that enables fine-tuning of vision-language grounding models on custom datasets
Pioneering Multimodal Reasoning with CoT
[CVPR 2025 Oral] VGGT: Visual Geometry Grounded Transformer
数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸多成熟且好用的内部生态应用。
Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’
This repository is used to record some simulation - implemented solutions, mainly covering areas such as post - quantum cryptography, zero - knowledge proofs, and privacy - preserving protocols.
A trustworthy face data secure protection research platform developed by the Chongqing University of Posts and Telecommunications (CQUPT).
Qihoo360 / 360-LLaMA-Factory
Forked from hiyouga/LLaMA-Factoryadds Sequence Parallelism into LLaMA-Factory
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Expand the MinCloud development ecosystem library
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Bounty Board is a decentralized platform designed to streamline Web3 community activities.
implemention of RingLink network core function
Zotero chat PDF with AI, DeepSeek, GPT 4.1, ChatGPT, Claude, Gemini
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
The first open autoregressive foundational video AI model.
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation