This document provides all details needed to have access to the research collection eRisk 2026.
Any scientific publication derived from the use of this collection should explicitly refer to the following publications:
1. Parapar, J., Perez, A., Wang, X., & Crestani, F. (2025). eRisk 2025: Contextual and Conversational Approaches for Depression Challenges. In European Conference on Information Retrieval (pp. 416–424).
2. Parapar, J., Perez, A., Wang, X., & Crestani, F. (2025, September). Overview of erisk 2025: Early risk prediction on the internet. In International conference of the cross-language evaluation forum for European languages (pp. 242-265). Cham: Springer Nature Switzerland.
The eRisk 2026 collections are available for research purposes under proper user agreements.
In the dataset, there are two types of instances: submissions and comments. Submissions represent the primary posts created by users. They are the main content entries, often containing a title, a body, and additional metadata such as the author and date. Comments are the responses or replies made by users to a submission or to other comments, forming a hierarchical structure. Each comment includes information about the author, content, and its parent (which could be another comment or a submission).
[
{
"submissionId": "mdB60ef",
"author": "subject_lEQN6dA",
"date": "2023-03-08T17:26:33.000+00:00",
"body": "...",
"title": "...",
"number": 3,
"targetSubject": "subject_6wEJkcb",
"comments": [
{
"commentId": "UspY8Bg",
"author": "subject_6wEJkcb",
"date": "2023-03-08T17:51:42.000+00:00",
"body": "...",
"parent": "mdB60ef"
},
...
{
"commentId": "nsnT1GB",
"author": "subject_ifthvcc",
"date": "2023-03-22T19:15:33.000+00:00",
"body": "...",
"parent": "bmC4ctO"
}
]
},
{
"submissionId": "0F6QmWR",
"author": "subject_Wotqigb",
"date": "2024-11-02T20:53:53.000+00:00",
"body": "...",
"title": "...",
"number": 3,
"targetSubject": "subject_pypfjky",
"comments": [
{
"commentId": "Oeas2Wu",
"author": "subject_pypfjky",
"date": "2024-11-02T21:55:41.000+00:00",
"body": "...",
"parent": "K3Z1yt8"
},
{
"commentId": "5CTC18p",
"author": "subject_2DDad7j",
"date": "2024-11-02T21:03:09.000+00:00",
"body": "...",
"parent": "0F6QmWR"
},
...
{
"commentId": "ZqEqil6",
"author": "subject_pypfjky",
"date": "2024-11-02T21:09:50.000+00:00",
"body": "...",
"parent": "0F6QmWR"
}
]
}
]
<DOC>
<DOCNO> SENTENCE_ID </DOCNO>
<PRE> previous sentence text </PRE>
<TEXT> sentence text </TEXT>
<POST> next sentence text </POST>
</DOC>
This collection can only be used for research purposes. If you are interested in having access to this data, please fill the following user agreement and send it to anxo.pvila@udc.es.