Skip to content
View paperfactory's full-sized avatar

Block or report paperfactory

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Anthropic's educational courses

Jupyter Notebook 8,522 679 Updated Nov 26, 2024

Scaling Diffusion Transformers with Mixture of Experts

Python 229 10 Updated Sep 9, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,065 1,430 Updated Dec 23, 2024

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Python 632 53 Updated Dec 27, 2024

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Jupyter Notebook 712 62 Updated Jan 26, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,247 6,152 Updated Dec 9, 2024

LLM inference in C/C++

C++ 70,096 10,119 Updated Jan 2, 2025

Inference Llama 2 in one file of pure C

C 17,751 2,133 Updated Aug 6, 2024

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,423 5,392 Updated Jan 2, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 46,338 5,513 Updated Dec 18, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,136 827 Updated Jun 10, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,789 377 Updated Mar 14, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,016 517 Updated Sep 6, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,739 2,223 Updated Jul 29, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,308 725 Updated Aug 5, 2024
Python 61 5 Updated Jun 16, 2023
Python 319 45 Updated Oct 9, 2023

📺 Discover the latest machine learning / AI courses on YouTube.

16,108 1,926 Updated Jan 22, 2024

A lightweight C++ library for recursive bilateral filtering [Yang, Qingxiong. "Recursive bilateral filtering". European Conference on Computer Vision, 2012].

C++ 352 57 Updated Dec 4, 2021

A latent text-to-image diffusion model

Jupyter Notebook 68,986 10,234 Updated Jun 18, 2024

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Python 2,853 554 Updated Dec 4, 2024

Collection of papers and resources for data augmentation for NLP.

830 78 Updated Aug 12, 2022

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 26,940 5,530 Updated Jan 2, 2025

Denoising Diffusion Probabilistic Models

Python 4,015 383 Updated Aug 29, 2023

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,638 1,063 Updated Oct 9, 2024

Diffusion-LM

Python 1,071 140 Updated Aug 8, 2024

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 663 50 Updated Sep 13, 2023

This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"

Python 193 22 Updated Jan 11, 2023

PyTorch code for MUST

Python 106 12 Updated Mar 8, 2023

Reading list for research topics in Masked Image Modeling

332 31 Updated Dec 3, 2024
Next