Mental Health Summarization (MentSum) dataset

The MentSum (Mental Health Summarization) dataset contains over 24k mental health Reddit posts with human-written TLDRs (i.e., short summaries). This dataset is introduced to expedite the research in mental health summarization domain.

Further dataset construction details are available in Section 3 of the LREC 2022 paper MentSum: A Resource for Exploring Summarization of Mental Health Online Posts.

Information on obtaining this dataset can be found here.


Citation


  @InProceedings{sotudeh2022:LREC2022,
  author    = {Sotudeh, Sajad  and  Goharian, Nazli  and  Young, Zachary},
  title     = {MentSum: A Resource for Exploring Summarization of Mental Health Online Posts},
  booktitle = {Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC)},
  year      = {2022},
  publisher = {European Language Resources Association (ELRA)},
  }

Contact Information

For any comments or questions, please email Sajad.