- Vienna, Austria
- https://www.linkedin.com/in/cahyawirawan/
- @CahyaWr
Lists (3)
Sort Name ascending (A-Z)
Stars
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
🔊 Text-Prompted Generative Audio Model
The fastai book, published as Jupyter Notebooks
A guidance language for controlling large language models.
Instruct-tune LLaMA on consumer hardware
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
This repository contains the source code for the paper First Order Motion Model for Image Animation
Natural Language Processing Tutorial for Deep Learning Researchers
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
This repository contains the Hugging Face Agents Course.
A multi-voice TTS system trained with an emphasis on quality
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
High-Resolution Image Synthesis with Latent Diffusion Models
QLoRA: Efficient Finetuning of Quantized LLMs
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
An annotated implementation of the Transformer paper.
A collection of infrastructure and tools for research in neural network interpretability.
PyTorch implementation of AnimeGANv2
Jupyter notebooks for the Natural Language Processing with Transformers book
An Open Source text-to-speech system built by inverting Whisper.
Materials for the Hugging Face Diffusion Models Course