Mental Health Summarization (MentSum) dataset
The MentSum (Mental Health Summarization) dataset contains over 24k mental health Reddit posts with human-written TLDRs (i.e., short summaries). This dataset is introduced to expedite the research in mental health summarization domain.
Further dataset construction details are available in Section 3 of the LREC 2022 paper MentSum: A Resource for Exploring Summarization of Mental Health Online Posts.
Information on obtaining this dataset can be found here.
Citation
@InProceedings{sotudeh2022:LREC2022,
author = {Sotudeh, Sajad and Goharian, Nazli and Young, Zachary},
title = {MentSum: A Resource for Exploring Summarization of Mental Health Online Posts},
booktitle = {Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC)},
year = {2022},
publisher = {European Language Resources Association (ELRA)},
}
Contact Information
For any comments or questions, please email Sajad.