Skip to content
View vic4code's full-sized avatar
:octocat:
:octocat:

Block or report vic4code

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AWS-native chatbot using Bedrock + Claude (+Nova and Mistral)

TypeScript 1,065 391 Updated Mar 21, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,356 62 Updated Mar 19, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 25,631 2,462 Updated Mar 20, 2025

YOLOE: Real-Time Seeing Anything

Python 818 66 Updated Mar 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 12,240 1,317 Updated Mar 21, 2025

Databricks SDK for Python (Beta)

Python 412 135 Updated Mar 21, 2025

实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果

Python 346 45 Updated Dec 31, 2024

Converts text to speech in realtime

Python 2,720 260 Updated Mar 19, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 6,413 519 Updated Mar 10, 2025

A Conversational Speech Generation Model

Python 10,751 794 Updated Mar 20, 2025

The official Python library for the OpenAI API

Python 25,982 3,740 Updated Mar 21, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 38,477 6,321 Updated Mar 21, 2025

Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 35,414 2,715 Updated Mar 19, 2025

Tools for merging pretrained large language models.

Python 5,455 519 Updated Mar 20, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 84,466 12,475 Updated Mar 21, 2025

Build resilient language agents as graphs.

Python 10,458 1,735 Updated Mar 21, 2025

Systematic evaluation framework that automatically rates overthinking behavior in large language models.

Shell 79 10 Updated Feb 22, 2025

Official Repo for Open-Reasoner-Zero

Python 1,659 78 Updated Mar 5, 2025

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,687 193 Updated Jan 16, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,225 278 Updated Nov 5, 2024

Solve Visual Understanding with Reinforced VLMs

Python 4,256 264 Updated Mar 20, 2025

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,466 100 Updated Mar 21, 2025

Run AI Agent in your browser.

Python 9,829 1,612 Updated Mar 19, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,911 2,403 Updated Aug 12, 2024
Python 125 20 Updated Feb 25, 2025

DeepSeek Coder: Let the Code Write Itself

Python 21,086 2,369 Updated May 21, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 13,021 881 Updated Mar 20, 2025

React app for inspecting, building and debugging with the Realtime API

JavaScript 3,036 1,136 Updated Mar 11, 2025

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 26,510 3,362 Updated Dec 30, 2024

s1: Simple test-time scaling

Python 6,026 704 Updated Mar 6, 2025
Next