LLM Models
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Fine-tune a large language model on your own iMessages
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Dockerized LLM inference server with constrained output (JSON mode), built on top of vLLM and outlines. Faster, cheaper and without rate limits. Compare the quality and latency to your current LLM …
LLM training code for Databricks foundation models
Official inference library for Mistral models
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU su…
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Convert Machine Learning Code Between Frameworks
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
A simple chat application that uses managed identity for Azure OpenAI access. Designed for deployment on Azure Container Apps with the Azure Developer CLI.
OpenUI let's you describe UI using your imagination, then see it rendered live.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
Code examples and resources for DBRX, a large language model developed by Databricks
GGUF Quantization of any LLM.
Phi-3 Small Language Models Edge Samples Explore samples of Phi-3 Small Language Models on edge devices like NVIDIA Jetson NX, Xavier, Orin Nano, Orin NX, and Intel devices with ONNX+DirectML and O…
Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…