Incorporating domain knowledge for extractive summarization of legal case documents

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Paheli Bhattacharya
  • Soham Poddar
  • Koustav Rudra
  • Kripabandhu Ghosh
  • Saptarshi Ghosh

Organisationseinheiten

Externe Organisationen

  • Indian Institute of Technology Kharagpur (IITKGP)
  • Indian Institute of Science Education and Research Kolkata
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksProceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021
Seiten22-31
Seitenumfang10
ISBN (elektronisch)9781450385268
PublikationsstatusVeröffentlicht - 27 Juli 2021
Veranstaltung18th International Conference on Artificial Intelligence and Law, ICAIL 2021 - Virtual, Online, Brasilien
Dauer: 21 Juni 202125 Juni 2021

Publikationsreihe

NameProceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021

Abstract

Automatic summarization of legal case documents is an important and practical challenge. Apart from many domain-independent text summarization algorithms that can be used for this purpose, several algorithms have been developed specifically for summarizing legal case documents. However, most of the existing algorithms do not systematically incorporate domain knowledge that specifies what information should ideally be present in a legal case document summary. To address this gap, we propose an unsupervised summarization algorithm DELSumm which is designed to systematically incorporate guidelines from legal experts into an optimization setup. We conduct detailed experiments over case documents from the Indian Supreme Court. The experiments show that our proposed unsupervised method outperforms several strong baselines in terms of ROUGE scores, including both general summarization algorithms and legal-specific ones. In fact, though our proposed algorithm is unsupervised, it outperforms several supervised summarization models that are trained over thousands of document-summary pairs.

ASJC Scopus Sachgebiete

Zitieren

Incorporating domain knowledge for extractive summarization of legal case documents. / Bhattacharya, Paheli; Poddar, Soham; Rudra, Koustav et al.
Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021. 2021. S. 22-31 (Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Bhattacharya, P, Poddar, S, Rudra, K, Ghosh, K & Ghosh, S 2021, Incorporating domain knowledge for extractive summarization of legal case documents. in Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021. Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021, S. 22-31, 18th International Conference on Artificial Intelligence and Law, ICAIL 2021, Virtual, Online, Brasilien, 21 Juni 2021. https://doi.org/10.1145/3462757.3466092
Bhattacharya, P., Poddar, S., Rudra, K., Ghosh, K., & Ghosh, S. (2021). Incorporating domain knowledge for extractive summarization of legal case documents. In Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021 (S. 22-31). (Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021). https://doi.org/10.1145/3462757.3466092
Bhattacharya P, Poddar S, Rudra K, Ghosh K, Ghosh S. Incorporating domain knowledge for extractive summarization of legal case documents. in Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021. 2021. S. 22-31. (Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021). doi: 10.1145/3462757.3466092
Bhattacharya, Paheli ; Poddar, Soham ; Rudra, Koustav et al. / Incorporating domain knowledge for extractive summarization of legal case documents. Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021. 2021. S. 22-31 (Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021).
Download
@inproceedings{54376cfa233046ffa040ce461108b1c7,
title = "Incorporating domain knowledge for extractive summarization of legal case documents",
abstract = "Automatic summarization of legal case documents is an important and practical challenge. Apart from many domain-independent text summarization algorithms that can be used for this purpose, several algorithms have been developed specifically for summarizing legal case documents. However, most of the existing algorithms do not systematically incorporate domain knowledge that specifies what information should ideally be present in a legal case document summary. To address this gap, we propose an unsupervised summarization algorithm DELSumm which is designed to systematically incorporate guidelines from legal experts into an optimization setup. We conduct detailed experiments over case documents from the Indian Supreme Court. The experiments show that our proposed unsupervised method outperforms several strong baselines in terms of ROUGE scores, including both general summarization algorithms and legal-specific ones. In fact, though our proposed algorithm is unsupervised, it outperforms several supervised summarization models that are trained over thousands of document-summary pairs.",
keywords = "integer linear programming, legal document summarization",
author = "Paheli Bhattacharya and Soham Poddar and Koustav Rudra and Kripabandhu Ghosh and Saptarshi Ghosh",
note = "Funding Information: Acknowledgements: The authors thank the Law experts from the Rajiv Gandhi School of Intellectual Property Law, India who helped in developing the gold standard data and provided the guidelines for summarization. The research is partially supported by the TCG Centres for Research and Education in Science and Technology (CREST) through the project titled {\textquoteleft}Smart Legal Consultant: AI-based Legal Analytics{\textquoteright}. Thiswork isalsosupported inpartby theEuropean Union{\textquoteright}s Horizon 2020 research and innovation programme under grant agreement No 832921. P. Bhattacharya is supported by a Fellowship from Tata Consultancy Services. ; 18th International Conference on Artificial Intelligence and Law, ICAIL 2021 ; Conference date: 21-06-2021 Through 25-06-2021",
year = "2021",
month = jul,
day = "27",
doi = "10.1145/3462757.3466092",
language = "English",
series = "Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021",
pages = "22--31",
booktitle = "Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021",

}

Download

TY - GEN

T1 - Incorporating domain knowledge for extractive summarization of legal case documents

AU - Bhattacharya, Paheli

AU - Poddar, Soham

AU - Rudra, Koustav

AU - Ghosh, Kripabandhu

AU - Ghosh, Saptarshi

N1 - Funding Information: Acknowledgements: The authors thank the Law experts from the Rajiv Gandhi School of Intellectual Property Law, India who helped in developing the gold standard data and provided the guidelines for summarization. The research is partially supported by the TCG Centres for Research and Education in Science and Technology (CREST) through the project titled ‘Smart Legal Consultant: AI-based Legal Analytics’. Thiswork isalsosupported inpartby theEuropean Union’s Horizon 2020 research and innovation programme under grant agreement No 832921. P. Bhattacharya is supported by a Fellowship from Tata Consultancy Services.

PY - 2021/7/27

Y1 - 2021/7/27

N2 - Automatic summarization of legal case documents is an important and practical challenge. Apart from many domain-independent text summarization algorithms that can be used for this purpose, several algorithms have been developed specifically for summarizing legal case documents. However, most of the existing algorithms do not systematically incorporate domain knowledge that specifies what information should ideally be present in a legal case document summary. To address this gap, we propose an unsupervised summarization algorithm DELSumm which is designed to systematically incorporate guidelines from legal experts into an optimization setup. We conduct detailed experiments over case documents from the Indian Supreme Court. The experiments show that our proposed unsupervised method outperforms several strong baselines in terms of ROUGE scores, including both general summarization algorithms and legal-specific ones. In fact, though our proposed algorithm is unsupervised, it outperforms several supervised summarization models that are trained over thousands of document-summary pairs.

AB - Automatic summarization of legal case documents is an important and practical challenge. Apart from many domain-independent text summarization algorithms that can be used for this purpose, several algorithms have been developed specifically for summarizing legal case documents. However, most of the existing algorithms do not systematically incorporate domain knowledge that specifies what information should ideally be present in a legal case document summary. To address this gap, we propose an unsupervised summarization algorithm DELSumm which is designed to systematically incorporate guidelines from legal experts into an optimization setup. We conduct detailed experiments over case documents from the Indian Supreme Court. The experiments show that our proposed unsupervised method outperforms several strong baselines in terms of ROUGE scores, including both general summarization algorithms and legal-specific ones. In fact, though our proposed algorithm is unsupervised, it outperforms several supervised summarization models that are trained over thousands of document-summary pairs.

KW - integer linear programming

KW - legal document summarization

UR - http://www.scopus.com/inward/record.url?scp=85112387162&partnerID=8YFLogxK

U2 - 10.1145/3462757.3466092

DO - 10.1145/3462757.3466092

M3 - Conference contribution

AN - SCOPUS:85112387162

T3 - Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021

SP - 22

EP - 31

BT - Proceedings of the 18th International Conference on Artificial Intelligence and Law, ICAIL 2021

T2 - 18th International Conference on Artificial Intelligence and Law, ICAIL 2021

Y2 - 21 June 2021 through 25 June 2021

ER -