Skip to content
View yq1227's full-sized avatar

Block or report yq1227

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
87 stars written in Python
Clear filter

deep learning for image processing including classification and object-detection etc.

Python 23,644 8,052 Updated Jul 25, 2024

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 11,625 1,943 Updated Dec 6, 2024

End-to-End Speech Processing Toolkit

Python 8,626 2,200 Updated Dec 23, 2024

Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet

Python 1,958 349 Updated Nov 28, 2022

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,488 434 Updated Dec 8, 2024

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,140 415 Updated Jul 25, 2024

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Python 832 130 Updated Nov 25, 2024

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 558 158 Updated Aug 19, 2023

This repo hosts the code and models of "Masked Autoencoders that Listen".

Python 557 47 Updated Apr 5, 2024

(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing

Python 505 41 Updated Oct 7, 2024

ROS + Arduino = Robot

Python 357 353 Updated Apr 10, 2019

Conformer-based Metric GAN for speech enhancement

Python 328 60 Updated May 3, 2024
Python 290 70 Updated Feb 28, 2020

Implementation of a U-net complete with efficient attention as well as the latest research findings

Python 271 20 Updated May 3, 2024

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Python 263 54 Updated Jun 15, 2021

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Python 247 55 Updated Apr 23, 2024

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Python 245 19 Updated Feb 12, 2023

数学建模准备工作,包括一些算法的手写与调用

Python 226 53 Updated Feb 25, 2018
Python 179 27 Updated Dec 4, 2023

A simple library for theoretical research on direction-of-arrival (DOA) estimation in array signal processing.

Python 166 48 Updated Jan 28, 2021

[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".

Python 165 25 Updated May 15, 2024

A meta-package for tianbot autonomous AI racecar based on nvidia development kits.

Python 126 94 Updated Dec 9, 2024

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Python 118 7 Updated Dec 23, 2024

The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"

Python 115 20 Updated Jun 29, 2022

A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

Python 112 31 Updated Jan 27, 2019

A two-stage polyphonic sound event detection and localization method for both SED and DOA.

Python 110 26 Updated Jan 8, 2023

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Python 105 11 Updated Dec 11, 2024

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

Python 97 10 Updated Dec 9, 2024

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.

Python 95 12 Updated May 24, 2023
Next