kevin_up xiexukang

🎯

Focusing

5 followers · 20 following

Master's student at JiangNan University
china

Achievements

Step-Audio Public
Forked from stepfun-ai/Step-Audio

Python Apache License 2.0 Updated Feb 18, 2025
DeepSeek-V3 Public
Forked from deepseek-ai/DeepSeek-V3

Python MIT License Updated Feb 5, 2025
voice_datasets Public
Forked from jim-schwoebel/voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Updated Jun 6, 2024
keyword-spot Public
Forked from chenyangMl/keyword-spot

端到端语音唤醒工具箱，从模型训练到模型推理。

Python MIT License Updated Apr 19, 2024
3D-Speaker Public
Forked from modelscope/3D-Speaker

A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.

Python Apache License 2.0 Updated Sep 19, 2023
awesome-multimodal-ml Public
Forked from pliang279/awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT License Updated Aug 4, 2023
pyannote-audio Public
Forked from pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook MIT License Updated Jul 27, 2023
ChatWaifu_Mobile Public
Forked from Voine/ChatWaifu_Mobile

移动版二次元 AI 老婆聊天器

C++ MIT License Updated May 12, 2023
awesome-asr-contextualization Public
Forked from stevenhillis/awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

Apache License 2.0 Updated May 10, 2023
code-switching-papers Public
Forked from gentaiscool/code-switching-papers

A curated list of research papers and resources on code-switching

Apache License 2.0 Updated May 10, 2023
Grounded-Segment-Anything Public
Forked from IDEA-Research/Grounded-Segment-Anything

分割一切

Jupyter Notebook Apache License 2.0 Updated Apr 11, 2023
expert_readed_books Public
Forked from 0voice/expert_readed_books

2021年最新总结，推荐工程师合适读本，计算机科学，软件技术，创业，思想类，数学类，人物传记书籍

Updated Mar 9, 2023
FastDeploy Public
Forked from PaddlePaddle/FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end…

C++ Apache License 2.0 Updated Feb 23, 2023
Awesome-LLM Public
Forked from Hannibal046/Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

Creative Commons Zero v1.0 Universal Updated Feb 22, 2023
FastASR Public
Forked from chenkui164/FastASR

这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时)，所以识别效果也很好，可以媲美许多商用的ASR软件。

C Apache License 2.0 Updated Feb 15, 2023
awesome-ncnn Public
Forked from zchrissirhcz/awesome-ncnn

😎 A Collection of Awesome NCNN-based Projects

Updated Jan 5, 2023
myblog Public

myblog powered by django,xadmin

Python Updated Dec 8, 2022
json Public
Forked from nlohmann/json

JSON for Modern C++

C++ MIT License Updated Oct 7, 2022
awesome-cpp Public
Forked from fffaraz/awesome-cpp

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

MIT License Updated Sep 27, 2022
OpenAI_Whisper_ASR Public
Forked from prateekralhan/OpenAI_Whisper_ASR

A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models

Python MIT License Updated Sep 26, 2022
whisper Public
Forked from openai/whisper

Jupyter Notebook MIT License Updated Sep 25, 2022
sherpa-ncnn Public
Forked from k2-fsa/sherpa-ncnn

Real-time speech recognition using next-gen Kaldi with ncnn

CMake Other Updated Sep 22, 2022
ncnn Public
Forked from Tencent/ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ Other Updated Sep 21, 2022
WeTextProcessing Public
Forked from wenet-e2e/WeTextProcessing

Python Apache License 2.0 Updated Sep 16, 2022
pocolm Public
Forked from danpovey/pocolm

Small language toolkit for creation, interpolation and pruning of ARPA language models

C++ Other Updated Aug 6, 2022
torchaudio Public
Forked from pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python BSD 2-Clause "Simplified" License Updated Aug 4, 2022
espresso Public
Forked from freewym/espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Python Other Updated Jul 24, 2022
data2vec-pytorch Public
Forked from arxyzan/data2vec-pytorch

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Python MIT License Updated Jul 11, 2022
findpapers Public
Forked from jonatasgrosman/findpapers

Findpapers: A tool for helping researchers who are looking for related works

Python MIT License Updated Jul 1, 2022
wenet_trt8 Public
Forked from huismiling/wenet_trt8

Python Apache License 2.0 Updated Jun 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kevin_up xiexukang

Achievements

Achievements

Block or report xiexukang

Step-Audio Public

DeepSeek-V3 Public

voice_datasets Public

keyword-spot Public

3D-Speaker Public

awesome-multimodal-ml Public

pyannote-audio Public

ChatWaifu_Mobile Public

awesome-asr-contextualization Public

code-switching-papers Public

Grounded-Segment-Anything Public

expert_readed_books Public

FastDeploy Public

Awesome-LLM Public

FastASR Public

awesome-ncnn Public

myblog Public

json Public

awesome-cpp Public

OpenAI_Whisper_ASR Public

whisper Public

sherpa-ncnn Public

ncnn Public

WeTextProcessing Public

pocolm Public

torchaudio Public

espresso Public

data2vec-pytorch Public

findpapers Public

wenet_trt8 Public