-
Stanford CS PhD
- Stanford, CA 94305
- https://cs.stanford.edu/~jiaxuan/
Stars
Fully open reproduction of DeepSeek-R1
veRL: Volcano Engine Reinforcement Learning for LLM
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
A platform for developers to simulate collaborative research activities
A curated list of resources for using LLMs to develop more competitive grant applications.
Training LLMs with QLoRA + FSDP
Video+code lecture on building nanoGPT from scratch
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A watermarking tool to protect artworks from AIGC-driven style mimicry (e.g. LoRA)
Robust recipes to align language models with human and AI preferences
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
A list of all "all you need" papers. Updated daily using the arXiv API.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Transformer related optimization, including BERT, GPT
LlamaIndex is the leading framework for building LLM-powered agents over your data.
🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Ongoing research training transformer models at scale
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🦜🔗 Build context-aware reasoning applications
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
This project is deprecated. Check my new project ChatHub:
A browser extension that enhance search engines with ChatGPT
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…