Stars
Liquid: Language Models are Scalable and Unified Multi-modal Generators
A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.
SVG Differentiable Rendering: Generating vector graphics using neural networks. Support: text-to-SVG, Image-to-SVG, SVG Editing.
Babel - Open Multilingual Large Language Models Serving Over 90% of Global Speakers
Residual Kolmogorov-Arnold Network (RKAN) is designed to enhance the performance of classic CNNs by incorporating RKAN blocks into existing architectures.
GENERanno: A Unified Genomic Foundation Model with Specialization in Gene Annotation
A public good tool to help users verify Safe (Gnosis Safe) transactions before signing or execution.
MMDepth: Comprehensive MMEngine-based Framework for Monocular, Stereo & Multi-view Depth Estimation
✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy
一个超超超好用的 uniapp 开发框架:uni-plus 是由 Uniapp + Vue3 + TS + Vite + Pinia + Unocss + WotUi 驱动的跨端快速启动模板,使用 VS Code 开发,具有丰富的代码提示、错误校验、类型提醒、预先插件安装、代码片段等功能,而且拥有丰富的案例 echarts 图表,表单分页,权限控制、接口请求优化等等(配备搭建教程)
The official Soundwave repository
GENERator: A Long-Context Generative Genomic Foundation Model
APOLLUMIA is an ERC-20 token implemented on the Ethereum blockchain. It incorporates transaction tax mechanisms, anti-bot protections, and integrates with Uniswap for decentralized trading.
An intelligent development and testing platform designed to empower small and medium-sized enterprises to build their own R&D systems, streamline workflows, and enhance operational efficiency.
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Pytorch Library for Relational Table Learning with LLMs.
Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection
FIT: 企业级AI开发框架,提供多语言函数引擎(FIT)、流式编排引擎(WaterFlow)及Java生态的LangChain替代方案(FEL)。原生/Spring双模运行,支持插件热插拔与智能聚散部署,无缝统一大模型与业务系统。
Flame is an open-source multimodal AI system designed to translate UI design mockups into high-quality React code. It leverages vision-language modeling, automated data synthesis, and structured tr…
Performant image component for React Native
[ICLR 2025] Improving Data Efficiency via Curating LLM-Driven Rating Systems