Skip to content
View Syazvinski's full-sized avatar
💻
Coding
💻
Coding

Highlights

  • Pro

Block or report Syazvinski

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,164 60 Updated Mar 6, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 378 24 Updated Mar 4, 2025

Witness the aha moment of VLM with less than $3.

Python 3,069 241 Updated Mar 1, 2025

Minimal hackable GRPO implementation

Python 165 20 Updated Jan 31, 2025

Solve Visual Understanding with Reinforced VLMs

Python 3,884 240 Updated Mar 7, 2025

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 252 3 Updated Feb 12, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,427 188 Updated Mar 6, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,997 1,403 Updated Feb 1, 2025

Code for the paper - TransURL

Python 2 1 Updated Apr 15, 2024

Code for the Molmo Vision-Language Model

Python 315 20 Updated Dec 12, 2024

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 397 47 Updated Mar 5, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 25,807 3,262 Updated Sep 24, 2024

visionOS 30 days challenge.

Swift 2,176 177 Updated Nov 15, 2024

Unofficial API for YouTube Music

Python 1,918 219 Updated Mar 5, 2025

Cast Mac windows to visionOS

Swift 860 43 Updated Feb 26, 2025

Extends Selenium's Python bindings to give you the ability to inspect requests made by the browser.

Python 1,944 267 Updated Jan 3, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,404 427 Updated May 29, 2024

A framework to enable multimodal models to operate a computer.

Python 9,391 1,270 Updated Feb 28, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,713 2,384 Updated Aug 12, 2024

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 4,739 659 Updated Aug 23, 2024

Ultralytics YOLO11 🚀

Python 37,530 7,296 Updated Mar 6, 2025

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 13,649 4,280 Updated Aug 19, 2024

Utilize a Raspberry Pi and a Nuand BladeRF to generate your own portable local cell network

Python 406 99 Updated Jun 14, 2019

Powerful, fast and robust engine for converting 3D models into g-code instructions for 3D printers. It is part of the larger open source project Cura.

C++ 1,721 892 Updated Mar 6, 2025

A pluggable Django app that enables login/signup via an Ethereum wallet (a la CryptoKitties)

Python 90 40 Updated Aug 11, 2023

The project where literally anything* goes.

Ruby 1,964 595 Updated Jan 2, 2025