-
text-generation-webui Public
Forked from oobabooga/text-generation-webuiA Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
Python GNU Affero General Public License v3.0 UpdatedJan 31, 2024 -
LLMTest_NeedleInAHaystack Public
Forked from gkamradt/LLMTest_NeedleInAHaystackDoing simple retrieval from LLM models at various context lengths to measure accuracy
Jupyter Notebook Other UpdatedDec 15, 2023 -
Megatron-DeepSpeed Public
Forked from deepspeedai/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedDec 12, 2023 -
chatgpt_system_prompt Public
Forked from LouisShark/chatgpt_system_promptstore all agent's system prompt
C MIT License UpdatedNov 29, 2023 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryEasy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Python Apache License 2.0 UpdatedNov 23, 2023 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedSep 25, 2023 -
Awesome-LLM Public
Forked from Hannibal046/Awesome-LLMAwesome-LLM: a curated list of Large Language Model
-
TencentPretrain Public
Forked from Tencent/TencentPretrainTencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Python Other UpdatedMar 28, 2023 -
ranking Public
Forked from tensorflow/rankingLearning to Rank in TensorFlow
Python Apache License 2.0 UpdatedOct 6, 2022 -
general_ner Public
通用的序列标注模型,其中包含了BERT模型的裁剪,实现了BERT_BILISTM_CRF等主流模型,魔改了CRF
-
-
BERT-of-Theseus Public
Forked from JetRunner/BERT-of-Theseus⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
Python Apache License 2.0 UpdatedJan 10, 2021 -
Pretrained-Language-Model Public
Forked from huawei-noah/Pretrained-Language-ModelPretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Python UpdatedJan 6, 2021 -
ChineseNLPCorpus Public
Forked from InsaneLife/ChineseNLPCorpus中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
-
CPM-Generate Public
Forked from TsinghuaAI/CPM-1-GenerateChinese Pre-Trained Language Models (CPM-LM) Version-I
Python MIT License UpdatedNov 17, 2020 -
TextBrewer Public
Forked from airaria/TextBrewerA PyTorch-based knowledge distillation toolkit for natural language processing
Python Apache License 2.0 UpdatedNov 11, 2020 -
TRTorch Public
Forked from pytorch/TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedSep 25, 2020 -
-
PLMpapers Public
Forked from thunlp/PLMpapersMust-read Papers on pre-trained language models.
-
pytorch-crf Public
Forked from kmkurn/pytorch-crf(Linear-chain) Conditional random field in PyTorch.
Python MIT License UpdatedSep 1, 2020 -
Surprise Public
Forked from NicolasHug/SurpriseA Python scikit for building and analyzing recommender systems
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 14, 2020 -
fucking-algorithm Public
Forked from labuladong/fucking-algorithm刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
-
simpletransformers Public
Forked from ThilinaRajapakse/simpletransformersTransformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Python Apache License 2.0 UpdatedAug 5, 2020 -
-
CLUEPretrainedModels Public
Forked from xu-song/CLUEPretrainedModels高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Python UpdatedJul 8, 2020 -
corpus Public
Forked from SimmerChan/corpus自然语言处理,知识图谱相关语料。按照Task细分,欢迎PR。
Python UpdatedJun 21, 2020 -
pkuseg-python Public
Forked from lancopku/pkuseg-pythonpkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Python MIT License UpdatedJun 21, 2020 -
Recommendation-system Public
Forked from fire717/Recommendation-system推荐系统资料笔记收录/ Everything about Recommendation System. 专题/书籍/论文/产品
UpdatedJun 5, 2020 -
gpt2-ml Public
Forked from imcaspar/gpt2-mlGPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
-
funNLP Public
Forked from fighting41love/funNLP中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Python UpdatedMay 23, 2020