Kyriection

🎨

Focusing

Zhenyu (Allen) Zhang Kyriection

🎨

Focusing

I am a Ph.D. student at @VITA-Group, University of Texas at Austin. My research interests include trustworthy, efficient, and quantum machine learning.

75 followers · 26 following

The University of Texas at Austin
Austin, TX, USA
15:12 (UTC -06:00)
zhenyu.gallery
@KyriectionZhang

Achievements

Lists (8)

Sort

Stars

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 370 16 Updated Jan 16, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 644 45 Updated Jan 16, 2025

okhat / blog

262 6 Updated Sep 29, 2024

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 1,749 317 Updated Jan 16, 2025

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 868 31 Updated Jan 12, 2025

Glaciohound / LM-Steer

Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)

Python 73 12 Updated Oct 1, 2024

yuandong-tian / arXiv_recbot

A Telegram bot to recommend arXiv papers

Python 220 15 Updated Jan 8, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 21,056 1,590 Updated Jan 14, 2025

HKUNLP / ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 377 19 Updated Oct 16, 2024

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 634 61 Updated Jun 1, 2024

deepseek-ai / DeepSeek-V3

Python 19,335 1,563 Updated Jan 7, 2025

atfortes / Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓

2,274 129 Updated Dec 17, 2024

mobiusml / gemlite

Fast low-bit matmul kernels in Triton

Python 187 15 Updated Jan 7, 2025

velikov-mihail / AI-Powered-Scholarship

Code used in Novy-Marx and Velikov (2024), AI-Powered (Finance) Scholarship

Python 25 10 Updated Jan 8, 2025

lucidrains / lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,082 52 Updated Nov 27, 2024

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 932 82 Updated Jan 16, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 366 13 Updated Jan 14, 2025

facebookresearch / blt

Code for BLT research paper

Python 1,315 95 Updated Jan 14, 2025

princeton-nlp / HELMET

The HELMET Benchmark

Python 103 13 Updated Jan 14, 2025

h0ngxuanli / InContextLab

Python 3 Updated Dec 10, 2024

lucidrains / coconut-pytorch

Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch

Python 145 8 Updated Dec 31, 2024

NVlabs / COAT

Python 52 1 Updated Jan 10, 2025

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 1,661 145 Updated Jan 14, 2025

da03 / Internalize_CoT_Step_by_Step

Python 136 15 Updated Sep 29, 2024

FluxML / Optimisers.jl

Optimisers.jl defines many standard optimisers and utilities for learning loops.

Julia 82 24 Updated Jan 7, 2025

KellerJordan / Muon

Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead

Python 210 6 Updated Jan 4, 2025

zhuhanqing / APOLLO

APOLLO: SGD-like Memory, AdamW-level Performance

Python 81 2 Updated Jan 3, 2025

openai / sparse_autoencoder

Python 404 40 Updated Jul 19, 2024

HanseulJo / position-coupling

Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count (a…

Python 9 Updated Dec 27, 2024

HEmile / storchastic

Stochastic Automatic Differentiation library for PyTorch.

Python 195 5 Updated Aug 30, 2024

Zhenyu (Allen) Zhang Kyriection

Lists (8)

🦾 Benchmarking

💎 Efficient ML

🤖 General Topics

💍 Large Language Models

🚀 My Stack

💁 Quantum ML

🗼 Toolbox

🚩 Trustworthy ML

Stars