Skip to content
View Virgil-L's full-sized avatar

Highlights

  • Pro

Block or report Virgil-L

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

深度学习与推荐系统学习,理论结合代码更香。

Jupyter Notebook 112 17 Updated Jul 30, 2022

整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、Scala 、TensorFlow 、大数据 、推荐系统、数据库、数据挖掘 、机器学习 、深度学习 、算法等。

901 287 Updated Sep 1, 2019

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 26,295 2,010 Updated Feb 22, 2025

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,798 456 Updated Jan 3, 2025

Using GPT to parse PDF

Python 3,249 233 Updated Aug 7, 2024

🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.

Python 527 32 Updated Jan 27, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,593 1,330 Updated Feb 21, 2025

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

6 11 Updated Nov 30, 2023

LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.

Python 41 6 Updated Oct 18, 2023

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 3,600 531 Updated Feb 20, 2025

personal chatgpt

Jupyter Notebook 337 62 Updated Dec 16, 2024

Example models using DeepSpeed

Python 6,304 1,067 Updated Feb 14, 2025

Recipes to train reward model for RLHF.

Python 1,186 84 Updated Feb 9, 2025

Apriori and fp-growth implement of python

Python 259 60 Updated Jan 2, 2020

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Jupyter Notebook 2,280 229 Updated Feb 6, 2024

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 126 9 Updated Jul 17, 2024

Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems

273 33 Updated Oct 17, 2023

Towards Generalist Biomedical AI

Python 358 53 Updated Feb 17, 2024

Replication of the paper "Text Is All You Need: Learning Language Representations for Sequential Recommendation" on KDD'23.

Python 109 31 Updated Apr 23, 2024

使用BERT-BILSTM-CRF进行中文命名实体识别。

Python 388 43 Updated Jan 9, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 14,432 1,663 Updated Feb 12, 2025

📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护

5,042 412 Updated Oct 18, 2024
Python 22 1 Updated Aug 1, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 38,900 5,822 Updated Feb 22, 2025

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

Python 178 13 Updated Mar 27, 2024

Emu Series: Generative Multimodal Models from BAAI

Python 1,683 86 Updated Sep 27, 2024

Python bindings for llama.cpp

Python 8,672 1,059 Updated Jan 29, 2025

✨✨Latest Advances on Multimodal Large Language Models

13,956 894 Updated Feb 22, 2025

A Collection of BM25 Algorithms in Python

Python 1,107 93 Updated Oct 8, 2024

Python PDF parser for scientific publications: content and figures

Python 393 62 Updated Mar 21, 2024
Next