SemEval-2018 Task 12: The Argument Reasoning Comprehension Task

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

External Research Organisations

  • Technische Universität Darmstadt
  • Bauhaus-Universität Weimar
View graph of relations

Details

Original languageEnglish
Title of host publicationProceedings of the 12th International Workshop on Semantic Evaluation
EditorsMarianna Apidianaki, Marianna Apidianaki, Saif M. Mohammad, Jonathan May, Ekaterina Shutova, Steven Bethard, Marine Carpuat
Pages763-772
Number of pages10
Publication statusPublished - Jun 2018
Externally publishedYes
Event12th International Workshop on Semantic Evaluation, SemEval 2018, co-located with the 16th Annual Conference of the North American Chapter of the - New Orleans, United States
Duration: 5 Jun 20186 Jun 2018

Abstract

A natural language argument is composed of a claim as well as reasons given as premises for the claim. The warrant explaining the reasoning is usually left implicit, as it is clear from the context and common sense. This makes a comprehension of arguments easy for humans but hard for machines. This paper summarizes the first shared task on argument reasoning comprehension. Given a premise and a claim along with some topic information, the goal is to automatically identify the correct warrant among two candidates that are plausible and lexically close, but in fact imply opposite claims. We describe the dataset with 1970 instances that we built for the task, and we outline the 21 computational approaches that participated, most of which used neural networks. The results reveal the complexity of the task, with many approaches hardly improving over the random accuracy of ≈ 0.5. Still, the best observed accuracy (0.712) underlines the principle feasibility of identifying warrants. Our analysis indicates that an inclusion of external knowledge is key to reasoning comprehension.

ASJC Scopus subject areas

Cite this

SemEval-2018 Task 12: The Argument Reasoning Comprehension Task. / Habernal, Ivan; Wachsmuth, Henning; Gurevych, Iryna et al.
Proceedings of the 12th International Workshop on Semantic Evaluation. ed. / Marianna Apidianaki; Marianna Apidianaki; Saif M. Mohammad; Jonathan May; Ekaterina Shutova; Steven Bethard; Marine Carpuat. 2018. p. 763-772.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Habernal, I, Wachsmuth, H, Gurevych, I & Stein, B 2018, SemEval-2018 Task 12: The Argument Reasoning Comprehension Task. in M Apidianaki, M Apidianaki, SM Mohammad, J May, E Shutova, S Bethard & M Carpuat (eds), Proceedings of the 12th International Workshop on Semantic Evaluation. pp. 763-772, 12th International Workshop on Semantic Evaluation, SemEval 2018, co-located with the 16th Annual Conference of the North American Chapter of the, New Orleans, United States, 5 Jun 2018. https://doi.org/10.18653/v1/S18-1121
Habernal, I., Wachsmuth, H., Gurevych, I., & Stein, B. (2018). SemEval-2018 Task 12: The Argument Reasoning Comprehension Task. In M. Apidianaki, M. Apidianaki, S. M. Mohammad, J. May, E. Shutova, S. Bethard, & M. Carpuat (Eds.), Proceedings of the 12th International Workshop on Semantic Evaluation (pp. 763-772) https://doi.org/10.18653/v1/S18-1121
Habernal I, Wachsmuth H, Gurevych I, Stein B. SemEval-2018 Task 12: The Argument Reasoning Comprehension Task. In Apidianaki M, Apidianaki M, Mohammad SM, May J, Shutova E, Bethard S, Carpuat M, editors, Proceedings of the 12th International Workshop on Semantic Evaluation. 2018. p. 763-772 doi: 10.18653/v1/S18-1121
Habernal, Ivan ; Wachsmuth, Henning ; Gurevych, Iryna et al. / SemEval-2018 Task 12 : The Argument Reasoning Comprehension Task. Proceedings of the 12th International Workshop on Semantic Evaluation. editor / Marianna Apidianaki ; Marianna Apidianaki ; Saif M. Mohammad ; Jonathan May ; Ekaterina Shutova ; Steven Bethard ; Marine Carpuat. 2018. pp. 763-772
Download
@inproceedings{da9779a14af34f5d9ada0c5f00de1d10,
title = "SemEval-2018 Task 12: The Argument Reasoning Comprehension Task",
abstract = "A natural language argument is composed of a claim as well as reasons given as premises for the claim. The warrant explaining the reasoning is usually left implicit, as it is clear from the context and common sense. This makes a comprehension of arguments easy for humans but hard for machines. This paper summarizes the first shared task on argument reasoning comprehension. Given a premise and a claim along with some topic information, the goal is to automatically identify the correct warrant among two candidates that are plausible and lexically close, but in fact imply opposite claims. We describe the dataset with 1970 instances that we built for the task, and we outline the 21 computational approaches that participated, most of which used neural networks. The results reveal the complexity of the task, with many approaches hardly improving over the random accuracy of ≈ 0.5. Still, the best observed accuracy (0.712) underlines the principle feasibility of identifying warrants. Our analysis indicates that an inclusion of external knowledge is key to reasoning comprehension.",
author = "Ivan Habernal and Henning Wachsmuth and Iryna Gurevych and Benno Stein",
note = "Funding Information: This work was supported by the German Research Foundation (DFG) within the ArguAna Project GU 798/20-1, and by the DFG-funded research training group “Adaptive Preparation of Information form Heterogeneous Sources” (AIPHES, GRK 1994/1).; 12th International Workshop on Semantic Evaluation, SemEval 2018, co-located with the 16th Annual Conference of the North American Chapter of the ; Conference date: 05-06-2018 Through 06-06-2018",
year = "2018",
month = jun,
doi = "10.18653/v1/S18-1121",
language = "English",
isbn = "9781948087209",
pages = "763--772",
editor = "Marianna Apidianaki and Marianna Apidianaki and Mohammad, {Saif M.} and Jonathan May and Ekaterina Shutova and Steven Bethard and Marine Carpuat",
booktitle = "Proceedings of the 12th International Workshop on Semantic Evaluation",

}

Download

TY - GEN

T1 - SemEval-2018 Task 12

T2 - 12th International Workshop on Semantic Evaluation, SemEval 2018, co-located with the 16th Annual Conference of the North American Chapter of the

AU - Habernal, Ivan

AU - Wachsmuth, Henning

AU - Gurevych, Iryna

AU - Stein, Benno

N1 - Funding Information: This work was supported by the German Research Foundation (DFG) within the ArguAna Project GU 798/20-1, and by the DFG-funded research training group “Adaptive Preparation of Information form Heterogeneous Sources” (AIPHES, GRK 1994/1).

PY - 2018/6

Y1 - 2018/6

N2 - A natural language argument is composed of a claim as well as reasons given as premises for the claim. The warrant explaining the reasoning is usually left implicit, as it is clear from the context and common sense. This makes a comprehension of arguments easy for humans but hard for machines. This paper summarizes the first shared task on argument reasoning comprehension. Given a premise and a claim along with some topic information, the goal is to automatically identify the correct warrant among two candidates that are plausible and lexically close, but in fact imply opposite claims. We describe the dataset with 1970 instances that we built for the task, and we outline the 21 computational approaches that participated, most of which used neural networks. The results reveal the complexity of the task, with many approaches hardly improving over the random accuracy of ≈ 0.5. Still, the best observed accuracy (0.712) underlines the principle feasibility of identifying warrants. Our analysis indicates that an inclusion of external knowledge is key to reasoning comprehension.

AB - A natural language argument is composed of a claim as well as reasons given as premises for the claim. The warrant explaining the reasoning is usually left implicit, as it is clear from the context and common sense. This makes a comprehension of arguments easy for humans but hard for machines. This paper summarizes the first shared task on argument reasoning comprehension. Given a premise and a claim along with some topic information, the goal is to automatically identify the correct warrant among two candidates that are plausible and lexically close, but in fact imply opposite claims. We describe the dataset with 1970 instances that we built for the task, and we outline the 21 computational approaches that participated, most of which used neural networks. The results reveal the complexity of the task, with many approaches hardly improving over the random accuracy of ≈ 0.5. Still, the best observed accuracy (0.712) underlines the principle feasibility of identifying warrants. Our analysis indicates that an inclusion of external knowledge is key to reasoning comprehension.

UR - http://www.scopus.com/inward/record.url?scp=85061624457&partnerID=8YFLogxK

U2 - 10.18653/v1/S18-1121

DO - 10.18653/v1/S18-1121

M3 - Conference contribution

AN - SCOPUS:85061624457

SN - 9781948087209

SP - 763

EP - 772

BT - Proceedings of the 12th International Workshop on Semantic Evaluation

A2 - Apidianaki, Marianna

A2 - Apidianaki, Marianna

A2 - Mohammad, Saif M.

A2 - May, Jonathan

A2 - Shutova, Ekaterina

A2 - Bethard, Steven

A2 - Carpuat, Marine

Y2 - 5 June 2018 through 6 June 2018

ER -

By the same author(s)