Skip to content

NLP Information-grounded dialogue over heterogeneous information

Notifications You must be signed in to change notification settings

entitize/HybridDialogue

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

HybridDialogue

Kai Nakamura, Sharon Levy, Yi-Lin Tuan, Wenhu Chen, and William Yang Wang

Our lab: http://nlp.cs.ucsb.edu/index.html

About

A pressing challenge in current dialogue systems is to successfully converse with users on topics with information distributed across different modalities. Previous work in multiturn dialogue systems has primarily focused on either text or table information. In more realistic scenarios, having a joint understanding of both is critical as knowledge is typically distributed over both unstructured and structured forms. We present a new dialogue dataset, HybriDialogue, which consists of crowdsourced natural conversations grounded on both Wikipedia text and tables. The conversations are created through the decomposition of complex multihop questions into simple, realistic multiturn dialogue interactions. We conduct several baseline experiments, including retrieval, system state tracking, and dialogue response generation. Our results show that there is still ample opportunity for improvement, demonstrating the importance of building stronger dialogue systems that can reason over the complex setting of information-seeking dialogue grounded on tables and text.

Download

Dataset Download: https://drive.google.com/file/d/12GvlGEFvLumJ10TKacXxWK_5XxgYEaH6/view?usp=sharing

Usage

See data_api.py

About

NLP Information-grounded dialogue over heterogeneous information

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages