Skip to content
View mathematicallfs's full-sized avatar

Highlights

  • Pro

Block or report mathematicallfs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25).

Python 8,333 647 Updated Dec 27, 2024

A MAD laboratory to improve AI architecture designs 🧪

Python 107 12 Updated Dec 17, 2024

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

Python 73 4 Updated Oct 16, 2024

Some preliminary explorations of Mamba's context scaling.

Python 213 11 Updated Feb 8, 2024

official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233 [NeurIPS 2024]

Python 16 3 Updated Aug 29, 2024

GPT-2 (124M) quality in 5B tokens

Python 1 Updated Jun 6, 2024

Offical implementation of IJCAI 2024 paper "Cross-Domain Feature Augmentation for Domain Generalization"

Python 12 1 Updated Aug 20, 2024

LLM training in simple, raw C/CUDA

Cuda 26,004 2,982 Updated Oct 2, 2024

[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Python 89 5 Updated Aug 5, 2024

Collection of papers on state-space models

583 20 Updated Mar 2, 2025

Understand and test language model architectures on synthetic tasks.

Python 183 29 Updated Mar 6, 2025

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 1,151 102 Updated Dec 4, 2024

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

C++ 304 28 Updated Dec 28, 2024

Codes for "MixupE: Understanding and Improving Mixup from Directional Derivative Perspective" UAI 2023 Oral

Python 28 Updated Aug 30, 2023

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Python 4,303 387 Updated Oct 25, 2023

Official code for ''From Optimization Dynamics to Generalization Bounds via Łojasiewicz Gradient Inequality'' (TMLR)

Python 5 Updated Oct 5, 2022

Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)

Python 28 4 Updated Dec 22, 2020

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 37,975 4,673 Updated Mar 5, 2025

可视化Bilibili本地视频XML弹幕转换ASS字幕转换器

Python 177 5 Updated Jan 2, 2024

A library for users to write (experiment in research) configurations in Python Dict or JSON format, read and write parameter value via dot . in code, while can read parameters from the command line…

Python 2,045 277 Updated Aug 22, 2024

Training-free data valuation on deep neural network applications. (ICML-2022)

Python 24 Updated Jul 13, 2022

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

Python 698 99 Updated Feb 5, 2025