Skip to content
View xuyuan21-tal's full-sized avatar
🎯
Focusing
🎯
Focusing
  • TAL
  • BeiJing
  • 10:21 (UTC +08:00)

Block or report xuyuan21-tal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
54 stars written in Python
Clear filter

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 68,224 14,451 Updated May 10, 2024

Inference code for Llama models

Python 55,961 9,520 Updated Aug 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,658 5,782 Updated Aug 19, 2024

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 33,664 10,056 Updated Oct 8, 2024

A generative speech model for daily dialogue.

Python 31,404 3,407 Updated Oct 10, 2024

We write your reusable computer vision tools. 💜

Python 23,525 1,755 Updated Oct 11, 2024

The Memory layer for your AI apps

Python 22,257 2,050 Updated Oct 11, 2024

A community-maintained Python framework for creating mathematical animations.

Python 21,933 1,598 Updated Oct 7, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 19,099 1,932 Updated Oct 11, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,262 1,863 Updated Apr 30, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 16,902 5,356 Updated Oct 10, 2024

Machine learning, in numpy

Python 15,325 3,713 Updated Oct 29, 2023

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 12,662 2,868 Updated Oct 11, 2024

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…

Python 12,022 2,930 Updated Oct 11, 2024

Question and Answer based on Anything.

Python 11,605 1,119 Updated Sep 27, 2024

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 11,230 1,111 Updated Sep 28, 2024

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,607 1,386 Updated Jul 31, 2023

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,238 824 Updated Oct 5, 2024

Retrieval and Retrieval-augmented LLMs

Python 7,104 518 Updated Oct 10, 2024

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python 5,973 658 Updated Sep 11, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,438 561 Updated Sep 29, 2024

Most popular metrics used to evaluate object detection algorithms.

Python 4,943 1,028 Updated Oct 6, 2024

Agent Zero AI framework

Python 4,476 982 Updated Oct 11, 2024

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,423 393 Updated Sep 8, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,403 455 Updated Aug 6, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,390 472 Updated Sep 28, 2024

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…

Python 4,263 721 Updated Sep 23, 2024

Facilitating the design, comparison and sharing of deep text matching models.

Python 3,836 897 Updated Aug 2, 2024

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,187 247 Updated Oct 11, 2024

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 2,888 428 Updated Aug 6, 2024
Next