randomradio

🎯

Focusing

i.an randomradio

🎯

Focusing

Practical/Minimal/Human

23 followers · 74 following

chenyangzhao.com

Achievements

Lists (1)

Sort

🚀 My stack

Starred repositories

peteanderson80 / SPICE

Semantic Propositional Image Caption Evaluation

Java 140 31 Updated Feb 2, 2023

richardaecn / cvpr18-caption-eval

Learning to Evaluate Image Captioning. CVPR 2018

Python 84 11 Updated Jun 15, 2018

pyushkevich / itksnap

ITK-SNAP medical image segmentation tool

C++ 321 92 Updated Dec 18, 2024

meirwah / awesome-workflow-engines

A curated list of awesome open source workflow engines

6,551 641 Updated Nov 16, 2024

Evil0ctal / Fast-Powerful-Whisper-AI-Services-API

⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API，使用本地运行的Whisper模型进行推理，并支持多GPU并发，针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫，可实现来自多个社交平台的无缝媒体处理，为媒体内容数据自动化处理提供了强大且可扩展的解决方案。

Python 271 28 Updated Dec 18, 2024

WLiK / LLM4Rec-Awesome-Papers

A list of awesome papers and resources of recommender system on large language model (LLM).

1,496 126 Updated Aug 15, 2024

AnyISalIn / 1password-to-apple-passwords

44 4 Updated Nov 16, 2024

illuin-tech / vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

Python 157 16 Updated Dec 17, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 36,197 4,561 Updated Nov 18, 2024

optuna / optuna

A hyperparameter optimization framework

Python 11,144 1,050 Updated Dec 26, 2024

pengyu965 / ChartDete

Context-Aware Chart Element Detection

Python 31 5 Updated Sep 2, 2023

pengyu965 / Awesome-Chart-Understanding

Forked from khuangaf/Awesome-Chart-Understanding

A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.

1 Updated Aug 8, 2024

natowi / 3D-Reconstruction-with-Deep-Learning-Methods

List of projects for 3d reconstruction

952 124 Updated May 7, 2023

jlowin / fastmcp

The fast, Pythonic way to build Model Context Protocol servers 🚀

Python 711 35 Updated Dec 9, 2024

abus-aikorea / voice-pro

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow…

Python 2,410 181 Updated Dec 22, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,645 3,157 Updated Aug 12, 2024

KellerJordan / modded-nanogpt

NanoGPT (124M) in 5 minutes

Python 1,969 183 Updated Dec 17, 2024

Babelscape / rebel

REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 2021).

Python 505 73 Updated Nov 9, 2023

hinthornw / promptimizer

Prompt optimization scratch

Python 542 38 Updated Dec 13, 2024

fabius8 / binanceAlert

Python 81 54 Updated Nov 11, 2024

soimort / you-get

⏬ Dumb downloader that scrapes the web

Python 54,228 9,668 Updated Dec 10, 2024

haad / proxychains

proxychains - a tool that forces any TCP connection made by any given application to follow through proxy like TOR or any other SOCKS4, SOCKS5 or HTTP(S) proxy. Supported auth-types: "user/pass" fo…

C 6,885 631 Updated Jun 8, 2024

janhq / ichigo

Local realtime voice AI

Python 2,119 110 Updated Dec 26, 2024

Thinklab-SJTU / Awesome-LLM4AD

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

1,093 53 Updated Sep 25, 2024

IrohXu / Awesome-Multimodal-LLM-Autonomous-Driving

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

247 12 Updated Mar 14, 2024

Skyvern-AI / skyvern

Automate browser-based workflows with LLMs and Computer Vision

Python 11,237 781 Updated Dec 25, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 7,958 601 Updated Dec 23, 2024

anliyuan / Ultralight-Digital-Human

一个超轻量级、可以在移动端实时运行的数字人模型

Python 1,343 199 Updated Nov 13, 2024

AudioLLMs / Awesome-Audio-Large-Language-Models

Audio Large Language Models

200 11 Updated Dec 26, 2024

facefusion / facefusion

Industry leading face manipulation platform

Python 20,467 3,165 Updated Dec 24, 2024

i.an randomradio

Lists (1)

🚀 My stack

Starred repositories

midjourney

Electron

Go