Skip to content
View baba-bug's full-sized avatar

Block or report baba-bug

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,575 799 Updated Dec 13, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,136 163 Updated Dec 6, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,039 1,395 Updated Dec 18, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,919 350 Updated Dec 18, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 25,490 3,714 Updated Nov 24, 2024

Writing AI Conference Papers: A Handbook for Beginners

1,644 58 Updated Dec 23, 2024

Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation

Python 195 26 Updated Feb 10, 2024

A blog for understanding graph neural network

412 42 Updated Mar 25, 2020

Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning

Python 593 83 Updated Jun 20, 2020
Jupyter Notebook 131 23 Updated Nov 5, 2023

[ICCV2023] "Vision HGNN: An Image is More than a Graph of Nodes" by Yan Han, Peihao Wang, Souvik Kundu, Ying Ding, and Zhangyang Wang

Python 44 8 Updated Apr 9, 2024

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Python 4,101 709 Updated Nov 30, 2024

healthcare data standard in China

465 286 Updated Jan 24, 2024

https://survivesjtu.github.io/SJTU-Application/#/

JavaScript 1,440 117 Updated Feb 7, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,485 91 Updated Dec 11, 2024

👀 Eye Tracking library easily implementable to your projects

Python 2,078 540 Updated Jul 22, 2024

State-of-the-Art Text Embeddings

Python 15,659 2,509 Updated Dec 23, 2024

PromptCLUE, 全中文任务支持零样本学习模型

Jupyter Notebook 657 66 Updated Jun 16, 2023
Python 90 5 Updated Mar 27, 2023

An open source implementation of CLIP.

Python 10,666 1,005 Updated Dec 23, 2024

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 6,626 454 Updated Dec 26, 2024

CVPR 2024 论文和开源项目合集

18,618 2,613 Updated Jul 4, 2024

On explainable attention-based deep neural networks trained on radiographic data augmented with diffusion models

Python 3 Updated Jun 7, 2023

Is synthetic data from generative models ready for image recognition?

Python 178 6 Updated Feb 16, 2023

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 54,879 9,472 Updated Dec 10, 2024

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

Java 75,608 13,988 Updated Aug 14, 2023

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 7,015 1,214 Updated Jul 21, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 66,582 8,156 Updated Dec 24, 2024
Python 4,339 453 Updated Jul 25, 2024
Next