
-
Jilin University
- Changchun Jilin
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
Integrate the DeepSeek API into popular softwares
🦜🔗 Build context-aware reasoning applications
LangChain 的中文入门教程
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
We perform functional grounding of LLMs' knowledge in BabyAI-Text
动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Book PDF and simulation code for the monograph "Massive MIMO Networks: Spectral, Energy, and Hardware Efficiency" by Emil Björnson, Jakob Hoydis and Luca Sanguinetti, published in Foundations and T…
This is the code package related to the follow scientific article: Luca Sanguinetti, Alessio Zappone, Merouane Debbah 'Deep-Learning-Power-Allocation-in-Massive-MIMO' presented at the Asilomar Con…
PySDR.org textbook source material, feel free to post issues/PRs
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Simulation code for the paper: F. Luo and Y. Mao, "A Practical Max-Min Fair Resource Allocation Algorithm for Rate-Splitting Multiple Access," in IEEE Communications Letters, vol. 27, no. 12, pp. 3…
Simulation code for the best paper award paper of EURASIP JWCN 2022: Y. Mao, B. Clerckx, and V.O.K. Li, "Rate-splitting multiple access for downlink communication systems: bridging, generalizing, a…
RSMA relies on linearly precoded rate-splitting with SIC to decode part of the interference and treat the remaining part of the interference as noise.
This is the code for the paper "Deep Learning-Based Rate-Splitting Multiple Access for Reconfigurable Intelligent Surface-Aided Tera-Hertz Massive MIMO"
This is the official implementation of Multi-Agent PPO (MAPPO).
A Python implementation of (a derivative of) the National Residency Matching Program (NRMP) algorithm. Created for a Data Structures 2 assignment.
A python implementation of the nobel prize winning matching algorithm.
Alienware systems lights, fans, and power control tools and apps
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
📚 Introduction to Modern Statistics - A college-level open-source textbook with a modern approach highlighting multivariable relationships and simulation-based inference. For v1, see https://openin…