Stars
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
Takagi and Nishimoto, CVPR 2023
Extracting spatial and temporal world models from LLMs
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Retrieval Augmented Generation (RAG) on audio data with LangChain
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek3, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
[KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"
Datasets, Transforms and Models specific to Computer Vision
This is the repository for the Tool Learning survey.
Open-Sora: Democratizing Efficient Video Production for All
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
李白 👤 作为唐代杰出诗人,其诗歌作品在中国文学史上具有重要地位。近年来,随着数字技术和人工智能的快速发展,传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入,但在数字化、智能化普及方面仍存在不足。因此,本项目旨在通过构建李白知识图谱,结合大模型训练出专业的AI智能体,以生成式对话应用的形式,推动李白文化的普及与推广。
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
DeepSeek-VL: Towards Real-World Vision-Language Understanding
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models
Large Language Models as Human Mobility Predictors
Official code for "Expression is enough: Improving traffic signal control with advanced traffic state representation ".