Skip to content
View Syazvinski's full-sized avatar
💻
Coding
💻
Coding

Highlights

  • Pro

Block or report Syazvinski

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Successor of Undetected-Chromedriver. Providing a blazing fast framework for web automation, webscraping, bots and any other creative ideas which are normally hindered by annoying anti bot systems …

Python 2,449 247 Updated May 16, 2025

A multi-thread crawler framework with many builtin image crawlers provided.

Python 884 179 Updated Mar 13, 2025

Jobs scraper library for LinkedIn, Indeed, Glassdoor, Google, Bayt, & Naukri

Python 1,612 325 Updated Apr 10, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,413 172 Updated May 21, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 755 46 Updated May 14, 2025

Witness the aha moment of VLM with less than $3.

Python 3,684 284 Updated May 19, 2025

Minimal hackable GRPO implementation

Python 226 30 Updated Jan 31, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,977 309 Updated May 11, 2025

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 304 8 Updated May 15, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,563 205 Updated May 19, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,792 1,487 Updated Apr 24, 2025

Code for the paper - TransURL

Python 5 1 Updated Apr 15, 2024

Code for the Molmo Vision-Language Model

Python 426 37 Updated Dec 12, 2024

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 743 92 Updated May 22, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 26,706 3,335 Updated Sep 24, 2024

visionOS 30 days challenge.

Swift 2,200 178 Updated Nov 15, 2024

Unofficial API for YouTube Music

Python 2,028 231 Updated May 22, 2025

Cast Mac windows to visionOS

Swift 865 43 Updated Feb 26, 2025

Extends Selenium's Python bindings to give you the ability to inspect requests made by the browser.

Python 1,950 268 Updated Jan 3, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,556 428 Updated May 29, 2024

A framework to enable multimodal models to operate a computer.

Python 9,665 1,305 Updated May 13, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,586 2,491 Updated Aug 12, 2024

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 4,920 673 Updated Aug 23, 2024

Ultralytics YOLO11 🚀

Python 41,140 7,945 Updated May 22, 2025

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 13,811 4,350 Updated Aug 19, 2024

Utilize a Raspberry Pi and a Nuand BladeRF to generate your own portable local cell network

Python 411 99 Updated Jun 14, 2019

Powerful, fast and robust engine for converting 3D models into g-code instructions for 3D printers. It is part of the larger open source project Cura.

C++ 1,745 893 Updated May 22, 2025

A pluggable Django app that enables login/signup via an Ethereum wallet (a la CryptoKitties)

Python 91 39 Updated Aug 11, 2023

The project where literally anything* goes.

Ruby 1,964 595 Updated Jan 2, 2025