-
Master's student at JiangNan University
- china
-
-
-
voice_datasets Public
Forked from jim-schwoebel/voice_datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
UpdatedJun 6, 2024 -
keyword-spot Public
Forked from chenyangMl/keyword-spot端到端语音唤醒工具箱,从模型训练到模型推理。
Python MIT License UpdatedApr 19, 2024 -
3D-Speaker Public
Forked from modelscope/3D-SpeakerA repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
Python Apache License 2.0 UpdatedSep 19, 2023 -
awesome-multimodal-ml Public
Forked from pliang279/awesome-multimodal-mlReading list for research topics in multimodal machine learning
MIT License UpdatedAug 4, 2023 -
pyannote-audio Public
Forked from pyannote/pyannote-audioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Jupyter Notebook MIT License UpdatedJul 27, 2023 -
ChatWaifu_Mobile Public
Forked from Voine/ChatWaifu_Mobile移动版二次元 AI 老婆聊天器
C++ MIT License UpdatedMay 12, 2023 -
awesome-asr-contextualization Public
Forked from stevenhillis/awesome-asr-contextualizationA curated list of awesome papers on contextualizing E2E ASR outputs
Apache License 2.0 UpdatedMay 10, 2023 -
code-switching-papers Public
Forked from gentaiscool/code-switching-papersA curated list of research papers and resources on code-switching
Apache License 2.0 UpdatedMay 10, 2023 -
Grounded-Segment-Anything Public
Forked from IDEA-Research/Grounded-Segment-Anything分割一切
Jupyter Notebook Apache License 2.0 UpdatedApr 11, 2023 -
expert_readed_books Public
Forked from 0voice/expert_readed_books2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍
UpdatedMar 9, 2023 -
FastDeploy Public
Forked from PaddlePaddle/FastDeploy⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end…
C++ Apache License 2.0 UpdatedFeb 23, 2023 -
Awesome-LLM Public
Forked from Hannibal046/Awesome-LLMAwesome-LLM: a curated list of Large Language Model
Creative Commons Zero v1.0 Universal UpdatedFeb 22, 2023 -
FastASR Public
Forked from chenkui164/FastASR这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
C Apache License 2.0 UpdatedFeb 15, 2023 -
awesome-ncnn Public
Forked from zchrissirhcz/awesome-ncnn😎 A Collection of Awesome NCNN-based Projects
UpdatedJan 5, 2023 -
-
-
awesome-cpp Public
Forked from fffaraz/awesome-cppA curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
MIT License UpdatedSep 27, 2022 -
OpenAI_Whisper_ASR Public
Forked from prateekralhan/OpenAI_Whisper_ASRA minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
Python MIT License UpdatedSep 26, 2022 -
-
sherpa-ncnn Public
Forked from k2-fsa/sherpa-ncnnReal-time speech recognition using next-gen Kaldi with ncnn
CMake Other UpdatedSep 22, 2022 -
ncnn Public
Forked from Tencent/ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
C++ Other UpdatedSep 21, 2022 -
WeTextProcessing Public
Forked from wenet-e2e/WeTextProcessingPython Apache License 2.0 UpdatedSep 16, 2022 -
pocolm Public
Forked from danpovey/pocolmSmall language toolkit for creation, interpolation and pruning of ARPA language models
C++ Other UpdatedAug 6, 2022 -
torchaudio Public
Forked from pytorch/audioData manipulation and transformation for audio signal processing, powered by PyTorch
Python BSD 2-Clause "Simplified" License UpdatedAug 4, 2022 -
espresso Public
Forked from freewym/espressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Python Other UpdatedJul 24, 2022 -
data2vec-pytorch Public
Forked from arxyzan/data2vec-pytorchPyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
Python MIT License UpdatedJul 11, 2022 -
findpapers Public
Forked from jonatasgrosman/findpapersFindpapers: A tool for helping researchers who are looking for related works
Python MIT License UpdatedJul 1, 2022 -