Skip to content
View OutBreak-hui's full-sized avatar
😀
😀

Block or report OutBreak-hui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DataComp for Language Models

HTML 1,247 115 Updated Dec 11, 2024

A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.

Python 131 14 Updated Mar 2, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 14,175 1,559 Updated Feb 23, 2025

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

C++ 297 73 Updated Jan 13, 2025

使用cuda实现类似sahi库的切图

Cuda 6 Updated Feb 8, 2025

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,416 273 Updated Nov 1, 2024

使用Nanodet+YoloV8-Pose实现指针仪表的实时检测、高精度读数识别(借助ncnn框架)

C++ 70 7 Updated Oct 31, 2024

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero

Python 18,052 1,476 Updated Mar 4, 2025

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 248 10 Updated Dec 22, 2024

[Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective

319 26 Updated Feb 27, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 1,455 161 Updated Feb 23, 2025

收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中

Python 2,203 254 Updated Aug 29, 2023

Implementation of "PaLM-E: An Embodied Multimodal Language Model"

Python 286 46 Updated Jan 29, 2024

Awesome papers & datasets specifically focused on long-term videos.

251 12 Updated Nov 15, 2024

全球最小的桌面级双轮腿机器人!

C++ 1,226 232 Updated Dec 12, 2024

Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool works locally and aims to create inference chains akin to thos…

Python 29 2 Updated Sep 25, 2024

一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索

Python 455 41 Updated Sep 4, 2024

音视频(H264/H265/AAC)封装、解封装、编解码pipeline,支持NVIDIA、昇腾DVPP硬编解码

C++ 35 8 Updated Dec 16, 2024

real time face swap and one-click video deepfake with only a single image

Python 44,392 6,536 Updated Feb 19, 2025

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Jupyter Notebook 6,811 783 Updated Feb 25, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,127 163 Updated Feb 13, 2025

A simple, easy-to-hack GraphRAG implementation

Python 2,512 243 Updated Jan 15, 2025

A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.

Python 68 3 Updated Oct 14, 2024
C++ 75 12 Updated Aug 1, 2024

Fast Multimodal LLM on Mobile Devices

C++ 721 84 Updated Mar 3, 2025

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,967 512 Updated Mar 4, 2025

基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.

Python 817 77 Updated Jan 21, 2025

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,454 1,290 Updated Sep 5, 2024

TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)

Python 178 13 Updated Nov 17, 2023

大模型基础: 一文了解大模型基础知识

4,121 369 Updated Feb 24, 2025
Next