Skip to content
View zhefan's full-sized avatar

Block or report zhefan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

282 7 Updated Apr 2, 2025

Recipes to train the self-rewarding reasoning LLMs.

Python 211 9 Updated Mar 2, 2025

World Model based Autonomous Driving Platform in CARLA 🚗

Python 210 32 Updated Mar 20, 2025

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 2,114 71 Updated Apr 10, 2025

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,375 881 Updated Mar 11, 2025

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1,113 58 Updated Jan 4, 2024

Code and links for over 25,000 trained Atari agents

Python 94 10 Updated Aug 22, 2024

A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release that enables easy visualization and analysis of models, and co…

Jupyter Notebook 203 33 Updated May 25, 2020

📈 Implementation of eight evaluation metrics to access the similarity between two images. The eight metrics are as follows: RMSE, PSNR, SSIM, ISSM, FSIM, SRE, SAM, and UIQ.

Python 612 70 Updated Aug 31, 2024

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 71,321 8,967 Updated Apr 15, 2025

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python 6,120 662 Updated Apr 11, 2025

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 825 49 Updated Aug 12, 2024

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 6,019 1,064 Updated Feb 24, 2025

Massively Parallel Deep Reinforcement Learning. 🔥

Python 3,992 895 Updated Mar 14, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 55,515 11,948 Updated Apr 9, 2025

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 11,454 2,679 Updated Apr 11, 2025

PGDrive: an open-ended driving simulator with infinite scenes from procedural generation

Python 128 16 Updated Jun 20, 2022

Model summary in PyTorch similar to `model.summary()` in Keras

Python 4,037 415 Updated Mar 2, 2024

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

Python 3,459 346 Updated Aug 9, 2024

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,787 162 Updated Mar 8, 2025

Python sample codes and textbook for robotics algorithms.

Python 24,753 6,743 Updated Apr 15, 2025

starter from "How to Train a GAN?" at NIPS2016

11,548 1,666 Updated Jan 9, 2022

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,072 1,116 Updated Mar 21, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,302 3,479 Updated Apr 14, 2025

Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"

Python 83 4 Updated Dec 13, 2019

✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解

Go 33,462 5,774 Updated Dec 11, 2024

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 127,581 23,351 Updated Jan 31, 2025

Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)

1,108 122 Updated Oct 13, 2017
Next