Skip to content
View skyhawk1990's full-sized avatar

Block or report skyhawk1990

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Curated tutorials and resources for Large Language Models, AI Painting, and more.

3,813 258 Updated Mar 31, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,420 233 Updated Oct 3, 2024

This repo includes ChatGPT prompt curation to use ChatGPT better.

HTML 111,390 15,187 Updated Sep 26, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,254 1,418 Updated Sep 19, 2024

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 15,852 4,520 Updated Jun 21, 2022

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 68,083 14,440 Updated May 10, 2024

A guideline for building practical production-level deep learning systems to be deployed in real world applications.

4,332 645 Updated Nov 17, 2023

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

Python 1,716 334 Updated May 22, 2023

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 11,784 2,316 Updated Oct 30, 2023

Official Pytorch implementations of PSENet.

Python 1,170 345 Updated Apr 7, 2023

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https…

Python 1,224 381 Updated Sep 9, 2022

A tensorflow implementation of EAST text detector

C++ 3,014 1,049 Updated Nov 22, 2022

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Python 3,430 1,329 Updated Oct 3, 2023

classical model code implementation of few-shot/one-shot lenaring, including siamese network, prototypical network, relation network, induction network

Python 135 50 Updated Nov 21, 2019

yolo3+ocr

Python 5,924 1,731 Updated Aug 29, 2022

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,452 1,705 Updated Apr 25, 2024

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,181 1,178 Updated May 28, 2023

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

Python 1,653 280 Updated Mar 29, 2023

A curated list of resources for Chinese NLP 中文自然语言处理相关资料

7,784 1,710 Updated Jul 27, 2023

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,569 3,613 Updated Jul 28, 2024

Natural Language Processing Tasks and References

3,018 546 Updated Sep 20, 2018

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Jupyter Notebook 1,784 731 Updated Jul 20, 2020

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,257 1,293 Updated May 21, 2023

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Python 7,315 2,036 Updated Mar 24, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 133,029 26,545 Updated Oct 7, 2024

TensorFlow code and pre-trained models for BERT

Python 37,965 9,577 Updated Jul 23, 2024

A system for quickly generating training data with weak supervision

Python 5,791 859 Updated May 2, 2024

all kinds of text classification models and more with deep learning

Python 7,851 2,573 Updated Sep 28, 2023

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,434 1,542 Updated May 23, 2024

Collection of NSFW images URLs for the purposes of training an NSFW Image Classifier

3,351 735 Updated Dec 14, 2020
Next