Stars
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
doc2dial data includes a set of documents from multiple domains; and conversations between an assisting agent and an end user that are grounded in the associated documents.
Weakly Supervised Topic Segmentation and Labeling
[GRL+ @ ICML 2020] PyTorch implementation for "Deep Graph Contrastive Representation Learning" (https://arxiv.org/abs/2006.04131v2)
Implementation of the paper: Text Segmentation as a Supervised Learning Task
A Neural Model for Joint Topic Segmentation and Classification
SegEval Segmentation Evaluation Package
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).
PyGCL: A PyTorch Library for Graph Contrastive Learning
像使用sklearn那样来使用bert进行中文文本分类、命名实体识别、句子相似度判别
基于pytorch_bert的中文多标签分类
[AAAI 2019] Code for paper "A Deep Sequential Model for Discourse Parsing on Multi-Party Dialogues"
Code and pre-trained model for: Deep Semantic Role Labeling: What Works and What's Next
TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
A list of recent papers about Graph Neural Network methods applied in NLP areas.
Bioformer: an efficient BERT model for biomedical text mining
Source code for EMNLP 2021 paper "Exophoric Pronoun Resolution in Dialogues with Topic Regularization"
This is the repository of our ACL 2022 paper MISC: A MIxed Strategy-Aware Model Integrating COMET for Emotional Support ConversatioN