Stars
A quick guide (especially) for trending instruction finetuning datasets
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
Awesome-LLM: a curated list of Large Language Model
✨✨Latest Advances on Multimodal Large Language Models
🦜🔗 Build context-aware reasoning applications
Instruction Tuning with GPT-4
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
This repo includes ChatGPT prompt curation to use ChatGPT better.
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
A library implementing different string similarity and distance measures using Python.
Unsupervised Stance Detection for Arguments from Consequences - Data and Code
Baseline Models for Argumentative Text Understanding for AI Debater (NLPCC2021)
Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Software to Manipulate Different Flavors of Semantic Graphs
Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)
Code for the paper "Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection"
Code to reproduce experiments in the paper "Task-Oriented Dialogue as Dataflow Synthesis" (TACL 2020).
Pytorch codes for "Label-aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification"