Language of Trauma: Modeling Traumatic Event Descriptions Across Domains with Explainable AI

This repository contains accompanying data and code for the paper (Findings of EMNLP 2024).

Dataset

We present our trauma event dataset TRACE (Trauma Event Recognition Across Contextual Environments).

This repository contains csv-files with the final pre-processed and labeled versions of various datasets (see below). Each file contains a text column and a trauma column, with trauma = 1 indicating the presence of a potentially traumatic event. Samples were randomly selected from the following sources:

Genocide Transcript Corpus (Schirmer et al., 2023): https://github.com/MiriamSchirmer/genocide-transcript-corpus
Reddit Mental Health Dataset (Low et al., 2020): https://zenodo.org/records/3941387
Mental Health Counseling Conversations (Amod, 2024): https://huggingface.co/datasets/Amod/mental_health_counseling_conversations
Incel Forum Dataset (Matter et al., 2024): https://www.frontiersin.org/journals/social-psychology/articles/10.3389/frsps.2024.1383152/full

All source datasets were pre-processed to ensure comparability for our trauma detection task. Due to their varied origins, the samples from each dataset differ in size, with instances ranging from single-word sentences to more elaborate descriptions of events and personal thoughts across all datasets. For compatibility with the BERT-architecture, we split instances exceeding the 512-token limit into smaller segments.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Dataset_GTC-V2.csv		Dataset_GTC-V2.csv
README.md		README.md
final_df_counseling_1200_train_test.csv		final_df_counseling_1200_train_test.csv
final_df_incels_300_test.csv		final_df_incels_300_test.csv
final_df_ptsd_1200_train_test.csv		final_df_ptsd_1200_train_test.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language of Trauma: Modeling Traumatic Event Descriptions Across Domains with Explainable AI

Dataset

Reproducing Experiments

About

Releases

Packages

Contributors 2

MiriamSchirmer/trauma-language

Folders and files

Latest commit

History

Repository files navigation

Language of Trauma: Modeling Traumatic Event Descriptions Across Domains with Explainable AI

Dataset

Reproducing Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages