
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Official implementation of FaceXFormer: A Unified Transformer for Facial Analysis
10 Lessons to Get Started Building AI Agents
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
Code release for ConvNeXt V2 model
Deepfake Video Detection Using Generative Convolutional Vision Transformer
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
Unified automatic quality assessment for speech, music, and sound.
👋 Xplique is a Neural Networks Explainability Toolbox
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
Unofficial implementation of ECCV20 paper "Thinking in frequency: Face forgery detection by mining frequency-aware clues"
Pytorch implementation of F3Net (ECCV 2020 F3Net: Frequency in Face Forgery Network)
Frontier Multimodal Foundation Models for Image and Video Understanding
Composable building blocks to build Llama Apps
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
A virtual lab of LLM agents for science research
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.
Real-Time Deepfake Detection in the Real-World
Awesome LLM Books: Curated list of books on Large Language Models
Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]
Large-Scale Multimodal Dataset of Astronomical Data
Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.
A generative world for general-purpose robotics & embodied AI learning.
An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.5 Sonnet API and Firecrawl. While currently configured for e…
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding