-
University of Michigan, Ann Arbor
- http://zhefanye.net
Lists (1)
Sort Name ascending (A-Z)
Stars
Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.
Recipes to train the self-rewarding reasoning LLMs.
World Model based Autonomous Driving Platform in CARLA 🚗
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
The official GitHub page for the survey paper "A Survey of Large Language Models".
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
Code and links for over 25,000 trained Atari agents
A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release that enables easy visualization and analysis of models, and co…
📈 Implementation of eight evaluation metrics to access the similarity between two images. The eight metrics are as follows: RMSE, PSNR, SSIM, ISSM, FSIM, SRE, SAM, and UIQ.
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
Massively Parallel Deep Reinforcement Learning. 🔥
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
FinRL®: Financial Reinforcement Learning. 🔥
PGDrive: an open-ended driving simulator with infinite scenes from procedural generation
Model summary in PyTorch similar to `model.summary()` in Keras
StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Python sample codes and textbook for robotics algorithms.
A Collection of Variational Autoencoders (VAE) in PyTorch.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"
✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)