Stars
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Beginners book on Python - start here if you don't know programming
TinyFusion: Diffusion Transformers Learned Shallow
MMSA is a unified framework for Multimodal Sentiment Analysis.
The code of multi-attention deepfake detection
Official Code for "Structured Kernel Estimation for Photon-Limited Deconvolution" (CVPR 2023)
This is a repository for learning how to conduct academic research.
MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis
📽 Benchmark datasets for Entity Resolution on Knowledge Graphs
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
12 Weeks, 24 Lessons, AI for All!
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
微信机器人底层框架,可接入Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。WeChat Robot Hook.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
A review of papers proposing novel GNN methods with application to brain connectivity published in 2017-2020.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Open-Sora: Democratizing Efficient Video Production for All
Python sample codes for robotics algorithms.
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
[CSUR] A Survey on Video Diffusion Models
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.