- Melbourne
- www.xinliang.co
Stars
🦜🔗 Build context-aware reasoning applications
A latent text-to-image diffusion model
Google Research
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A game theoretic approach to explain the output of any machine learning model.
llama3 implementation one matrix multiplication at a time
FinRL: Financial Reinforcement Learning. 🔥
LAVIS - A One-stop Library for Language-Vision Intelligence
Code samples used on cloud.google.com
From the basics to slightly more interesting applications of Tensorflow
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Repo for the Deep Reinforcement Learning Nanodegree program
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
For trading. Please star.
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation (http://fcn.berkeleyvision.org)
Inference Code for Polygon-RNN++ (CVPR 2018)
Enabling journalists, citizen scientists, humanitarian workers and others to detect “patterns of interest” in satellite imagery.
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".