Skip to content
View GioGioBond's full-sized avatar

Block or report GioGioBond

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

基于python的12306定时抢票脚本

Python 13 3 Updated Dec 30, 2024

深度学习练习本

Jupyter Notebook 104 5 Updated Jan 10, 2025

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,057 45 Updated Jul 31, 2024

Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Transformer输入到输出整个流程。

Python 219 64 Updated Apr 24, 2024

LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)

Python 27 6 Updated May 17, 2024

基于知识图谱的政务问答系统

Python 2 1 Updated May 30, 2021

使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。

Python 10 Updated Sep 6, 2024

该项目主要是为了识别图片里面的表格数据,并将表格数据抽取处理,导出成csv的文件。整个项目会使用streamlit进行部署和展示。使用的技术:paddleocr,PPStructure,streamlit

Python 35 4 Updated Oct 17, 2022

Minutes GPT is a GPT tool that helps you quickly turn meeting recordings into minutes. Minutes GPT 是一个帮助你快速将会议录音转化为会议纪要的 GPT 工具

Python 15 2 Updated Nov 20, 2023

A small open source 3D agent simulator based on LLM.

Python 54 8 Updated Dec 1, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,072 1,150 Updated May 23, 2024

专为新手设计!Chatbot Trainer 是一个基于开源语言模型(GLM4)的聊天机器人训练项目。你可以轻松训练出一个拥有你自己语气性格的聊天机器人,或训练任何你感兴趣的人物,包括名人、历史人物、动漫角色或电影小说中的虚拟人物。 通过项目内置的数据集问答对制作指导,你可以快速训练出一个拥有高度仿真对话能力的聊天机器人。采用先进的机器学习算法,确保训练过程简单、高效,并生成自然流畅的对话。

Python 35 6 Updated Dec 13, 2024

🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 3 hours!

Python 6,886 725 Updated Dec 13, 2024

Train a 1B LLM with 1T tokens from scratch by personal

Python 481 55 Updated Jan 12, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,876 1,716 Updated Sep 19, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,162 383 Updated Jan 2, 2025

🌈Bilibili_video_download-B站视频下载

Python 3,176 750 Updated May 22, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 34,854 5,321 Updated Jan 26, 2025

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,869 847 Updated Aug 20, 2024

🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM!🌏 Train a 27M-parameter VLM from scratch in just 3 hours!

Python 790 80 Updated Dec 13, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,407 236 Updated Jan 25, 2025

基于大语言模型的检索增强生成RAG示例

Python 116 17 Updated Dec 7, 2024

中文纠错-使用拼音树及编辑距离

Java 13 3 Updated Jul 19, 2019

Composable building blocks to build Llama Apps

Python 6,521 803 Updated Jan 26, 2025

A pipeline for LLM knowledge distillation

Python 84 10 Updated Jan 25, 2025

Tools for merging pretrained large language models.

Python 5,150 481 Updated Jan 25, 2025

An Open Source Toolkit For LLM Distillation

Python 439 44 Updated Jan 7, 2025

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

755 43 Updated Oct 22, 2024

多任务知识蒸馏,模型太大了没有上传,可自行下载放在相关目录下

Python 5 Updated May 21, 2023

Nano-BERT is a straightforward, lightweight and comprehensible custom implementation of BERT, inspired by the foundational "Attention is All You Need" paper. The primary objective of this project i…

Jupyter Notebook 14 2 Updated Oct 19, 2023
Next