Skip to content
View moonbings's full-sized avatar
🐱
🐱

Block or report moonbings

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)

Python 13 Updated Jan 24, 2025

[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Python 93 11 Updated Dec 25, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,009 425 Updated Feb 1, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 3,511 198 Updated Jan 29, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,170 164 Updated Jan 30, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,178 591 Updated Apr 16, 2024
Jupyter Notebook 36 2 Updated May 21, 2024

[ECCV2024] Official implementation of paper, "DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs".

Python 139 5 Updated Aug 8, 2024

The Universe of Data. All about data, data science, and data engineering

Python 530 52 Updated Jul 18, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,251 2,340 Updated Aug 12, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,369 413 Updated Aug 7, 2024

Official Implementation of SCOB [ICCV 2023]

Python 22 Updated Nov 16, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,126 5,972 Updated Feb 1, 2025

A natural language interface for computers

Python 58,106 4,985 Updated Jan 24, 2025
Python 73 10 Updated Aug 7, 2023

Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023

Python 44 Updated Jun 11, 2024

extract text from any document. no muss. no fuss.

HTML 3,963 614 Updated Dec 2, 2024

Open source Python library for converting PDF to DOCX.

Python 2,727 393 Updated Sep 23, 2024

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Python 14,073 1,390 Updated Jan 31, 2025

Official repo for MM-REACT

Python 941 70 Updated Jan 31, 2024

My collection of machine learning papers

275 22 Updated Aug 10, 2023

Chromium running inside your terminal

Rust 14,906 294 Updated Jul 1, 2024

🕸️ Web apps in pure Python 🐍

Python 21,521 1,242 Updated Feb 1, 2025

A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.

TypeScript 6,921 515 Updated Apr 22, 2024

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Python 1,148 100 Updated Nov 28, 2023

Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023

Python 104 6 Updated Oct 24, 2023

Learn how to design systems at scale and prepare for system design interviews

34,233 3,870 Updated Apr 10, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,404 9,017 Updated Jan 4, 2025

The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.

Python 138 15 Updated Mar 6, 2023

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Python 375 46 Updated Sep 17, 2024
Next