Stars
ML Assistant for Competitive Machine Learning
[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Knowledge Graph Embeddings (KGE) for RAG-LLMs. Our goal was to compare the mathematical differences between Traditional Static Multimodal Vector Embeddings (TVE) from Word2Vec and CLIP encoders fo…
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs,…
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Code to generate NeuralExecs (prompt injection for LLMs)
Whistleblower is a offensive security tool for testing against system prompt leakage and capability discovery of an AI application exposed through API. Built for AI engineers, security researchers …
A collection of deep learning based RGB-T-Fusion methods, codes, and datasets. The main directions involved are Multispectral Pedestrian Detection, RGB-T Aerial Object Detection, RGB-T Semantic Seg…
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)