Stars
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A mini, open-weights, version of our Proxy assistant.
This repository presents UR-FUNNY dataset: first dataset for multimodal humor detection
General technology for enabling AI capabilities w/ LLMs and MLLMs
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Convert PDF to markdown + JSON quickly with high accuracy
[ICCV 2023] Lighting Every Darkness in Two Pairs: A Calibration-Free Pipeline for RAW Denoising && [Arxiv 2023] Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model
算法面试必备,推荐刷题网站www.lintcode.com。北大学霸的《LeetCode刷题模板》+V领取: jiuzhangfeifei
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Flask-based web application designed to compare text and image embeddings using the CLIP model.
[CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct
Search google, bing, yahoo, and other search engines with python
A simple way to view search results from the search engines (Google, Bing, AOL, etc.)
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…