CLEF 2018 Workshop
Avignon, 10-14 September 2018
Find Out MoreeRisk explores the evaluation methodology, effectiveness metrics and practical applications (particularly those related to health and safety) of early risk detection on the Internet. Early detection technologies can be employed in different areas, particularly those related to health and safety. For instance, early alerts could be sent when a predator starts interacting with a child for sexual purposes, or when a potential offender starts publishing antisocial threats on a blog, forum or social network. Our main goal is to pioneer a new interdisciplinary research area that would be potentially applicable to a wide variety of situations and to many different personal profiles. Examples include potential paedophiles, stalkers, individuals that could fall into the hands of criminal organisations, people with suicidal inclinations, or people susceptible to depression.
This is the second year that this lab runs and the lab has two main tasks:
This is a continuation of the eRisk 2017 pilot task.
The challenge consists in performing a task on early risk detection of depression. The challenge consists of sequentially processing pieces of evidence and detect early traces of depression as soon as possible. The task is mainly concerned about evaluating Text Mining solutions and, thus, it concentrates on texts written in Social Media. Texts should be processed in the order they were created. In this way, systems that effectively perform this task could be applied to sequentially monitor user interactions in blogs, social networks, or other types of online media.
The test collection for this task has the same format as the collection described in [Losada & Crestani 2016]. The source of data is also the same used for eRisk 2017. It is a collection of writings (posts or comments) from a set of Social Media users. There are two categories of users, depressed and non- depressed, and, for each user, the collection contains a sequence of writings (in chronological order). For each user, his collection of writings has been divided into 10 chunks. The first chunk contains the oldest 10% of the messages, the second chunk contains the second oldest 10%, and so forth.
The task is organized into two different stages:
Evaluation: The evaluation will take into account not only the correctness of the system's output (i.e. whether or not the user is depressed) but also the delay taken to emit its decision. To meet this aim, we will consider the ERDE metric proposed in [Losada & Crestani 2016].
The proceedings of the lab will be published in the online CEUR-WS Proceedings and on the conference website.
To have access to the collection all participants have to fill, sign and send a user agreement form (follow the instructions provided here). Once you have submitted the signed copyright form, you can proceed to register for the lab at CLEF 2018 Labs Registration site
Important DatesThis is a new task in 2018. The format, source of data and overall organization of the task are equivalent to those used for Task 1.
The challenge consists in performing a task on early risk detection of anorexia. The challenge consists of sequentially processing pieces of evidence and detect early traces of anorexia as soon as possible. The task is mainly concerned about evaluating Text Mining solutions and, thus, it concentrates on texts written in Social Media. Texts should be processed in the order they were created. In this way, systems that effectively perform this task could be applied to sequentially monitor user interactions in blogs, social networks, or other types of online media.
The test collection for this task has the same format as the collection described in [Losada & Crestani 2016]. The source of data is also the same. It is a collection of writings (posts or comments) from a set of Social Media users. There are two categories of users: those that have been diagnosed with anorexia, and a control group (non-anorexia). For each user, the collection contains a sequence of writings (in chronological order). For each user, his collection of writings has been divided into 10 chunks. The first chunk contains the oldest 10% of the messages, the second chunk contains the second oldest 10%, and so forth.
The task is organized into two different stages:
Evaluation: The evaluation will take into account not only the correctness of the system's output (i.e. whether or not the user has anorexia) but also the delay taken to emit its decision. To meet this aim, we will consider the ERDE metric proposed in [Losada & Crestani 2016].
The proceedings of the lab will be published in the online CEUR-WS Proceedings and on the conference website.
To have access to the collection all participants have to fill, sign and send a user agreement form (follow the instructions provided here). Once you have submitted the signed copyright form, you can proceed to register for the lab at CLEF 2018 Labs Registration site
Important Dates8/11/2017
30/11/2017
06/02/2018
13/02/2018
20/02/2018
27/02/2018
06/03/2018
13/03/2018
20/03/2018
27/03/2018
03/04/2018
10/04/2018
24/04/2018
31/05/2018
15/06/2018
29/06/2018