Using Machine Learning to Automate Mammogram Images Analysis

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Xuejiao Tang
  • Liuhua Zhang
  • Wenbin Zhang
  • Xin Huang
  • Vasileios Iosifidis
  • Zhen Liu
  • Mingli Zhang
  • Enza Messina
  • Ji Zhang

Research Organisations

External Research Organisations

  • Memorial University of Newfoundland
  • University of Maryland Baltimore County
  • Guangdong College of Pharmacy
  • McGill University
  • University of Milan - Bicocca
  • University of Southern Queensland
View graph of relations

Details

Original languageEnglish
Title of host publicationProceedings
Subtitle of host publication2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020
EditorsTaesung Park, Young-Rae Cho, Xiaohua Tony Hu, Illhoi Yoo, Hyun Goo Woo, Jianxin Wang, Julio Facelli, Seungyoon Nam, Mingon Kang
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages757-764
Number of pages8
ISBN (electronic)9781728162157
ISBN (print)978-1-7281-6216-4
Publication statusPublished - 2020
Event2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020 - Virtual, Seoul, Korea, Republic of
Duration: 16 Dec 202019 Dec 2020

Abstract

Breast cancer is the second leading cause of cancer-related death after lung cancer in women. Early detection of breast cancer in X-ray mammography is believed to have effectively reduced the mortality rate since 1989. However, a relatively high false positive rate and a low specificity in mammography technology still exist. In this work, a computer-aided automatic mammogram analysis system is proposed to process the mammogram images and automatically discriminate them as either normal or cancerous, consisting of three consecutive image processing, feature selection, and image classification stages. In designing the system, the discrete wavelet transforms (Daubechies 2, Daubechies 4, and Biorthogonal 6.8) and the Fourier cosine transform were first used to parse the mammogram images and extract statistical features. Then, an entropy-based feature selection method was implemented to reduce the number of features. Finally, different pattern recognition methods (including the Back-propagation Network, the Linear Discriminant Analysis, and the Naive Bayes Classifier) and a voting classification scheme were employed. The performance of each classification strategy was evaluated for sensitivity, specificity, and accuracy and for general performance using the Receiver Operating Curve. Our method is validated on the dataset from the Eastern Health in Newfoundland and Labrador of Canada. The experimental results demonstrated that the proposed automatic mammogram analysis system could effectively improve the classification performances.

Keywords

    automated diagnostic system, Breast cancer

ASJC Scopus subject areas

Sustainable Development Goals

Cite this

Using Machine Learning to Automate Mammogram Images Analysis. / Tang, Xuejiao; Zhang, Liuhua; Zhang, Wenbin et al.
Proceedings: 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020. ed. / Taesung Park; Young-Rae Cho; Xiaohua Tony Hu; Illhoi Yoo; Hyun Goo Woo; Jianxin Wang; Julio Facelli; Seungyoon Nam; Mingon Kang. Institute of Electrical and Electronics Engineers Inc., 2020. p. 757-764 9313247.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Tang, X, Zhang, L, Zhang, W, Huang, X, Iosifidis, V, Liu, Z, Zhang, M, Messina, E & Zhang, J 2020, Using Machine Learning to Automate Mammogram Images Analysis. in T Park, Y-R Cho, XT Hu, I Yoo, HG Woo, J Wang, J Facelli, S Nam & M Kang (eds), Proceedings: 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020., 9313247, Institute of Electrical and Electronics Engineers Inc., pp. 757-764, 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020, Virtual, Seoul, Korea, Republic of, 16 Dec 2020. https://doi.org/10.48550/arXiv.2012.03151, https://doi.org/10.1109/BIBM49941.2020.9313247
Tang, X., Zhang, L., Zhang, W., Huang, X., Iosifidis, V., Liu, Z., Zhang, M., Messina, E., & Zhang, J. (2020). Using Machine Learning to Automate Mammogram Images Analysis. In T. Park, Y.-R. Cho, X. T. Hu, I. Yoo, H. G. Woo, J. Wang, J. Facelli, S. Nam, & M. Kang (Eds.), Proceedings: 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020 (pp. 757-764). Article 9313247 Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2012.03151, https://doi.org/10.1109/BIBM49941.2020.9313247
Tang X, Zhang L, Zhang W, Huang X, Iosifidis V, Liu Z et al. Using Machine Learning to Automate Mammogram Images Analysis. In Park T, Cho YR, Hu XT, Yoo I, Woo HG, Wang J, Facelli J, Nam S, Kang M, editors, Proceedings: 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020. Institute of Electrical and Electronics Engineers Inc. 2020. p. 757-764. 9313247 doi: 10.48550/arXiv.2012.03151, 10.1109/BIBM49941.2020.9313247
Tang, Xuejiao ; Zhang, Liuhua ; Zhang, Wenbin et al. / Using Machine Learning to Automate Mammogram Images Analysis. Proceedings: 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020. editor / Taesung Park ; Young-Rae Cho ; Xiaohua Tony Hu ; Illhoi Yoo ; Hyun Goo Woo ; Jianxin Wang ; Julio Facelli ; Seungyoon Nam ; Mingon Kang. Institute of Electrical and Electronics Engineers Inc., 2020. pp. 757-764
Download
@inproceedings{b08384827ede424dbabb2e9f83ee5a02,
title = "Using Machine Learning to Automate Mammogram Images Analysis",
abstract = "Breast cancer is the second leading cause of cancer-related death after lung cancer in women. Early detection of breast cancer in X-ray mammography is believed to have effectively reduced the mortality rate since 1989. However, a relatively high false positive rate and a low specificity in mammography technology still exist. In this work, a computer-aided automatic mammogram analysis system is proposed to process the mammogram images and automatically discriminate them as either normal or cancerous, consisting of three consecutive image processing, feature selection, and image classification stages. In designing the system, the discrete wavelet transforms (Daubechies 2, Daubechies 4, and Biorthogonal 6.8) and the Fourier cosine transform were first used to parse the mammogram images and extract statistical features. Then, an entropy-based feature selection method was implemented to reduce the number of features. Finally, different pattern recognition methods (including the Back-propagation Network, the Linear Discriminant Analysis, and the Naive Bayes Classifier) and a voting classification scheme were employed. The performance of each classification strategy was evaluated for sensitivity, specificity, and accuracy and for general performance using the Receiver Operating Curve. Our method is validated on the dataset from the Eastern Health in Newfoundland and Labrador of Canada. The experimental results demonstrated that the proposed automatic mammogram analysis system could effectively improve the classification performances.",
keywords = "automated diagnostic system, Breast cancer",
author = "Xuejiao Tang and Liuhua Zhang and Wenbin Zhang and Xin Huang and Vasileios Iosifidis and Zhen Liu and Mingli Zhang and Enza Messina and Ji Zhang",
year = "2020",
doi = "10.48550/arXiv.2012.03151",
language = "English",
isbn = "978-1-7281-6216-4",
pages = "757--764",
editor = "Taesung Park and Young-Rae Cho and Hu, {Xiaohua Tony} and Illhoi Yoo and Woo, {Hyun Goo} and Jianxin Wang and Julio Facelli and Seungyoon Nam and Mingon Kang",
booktitle = "Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",
note = "2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020 ; Conference date: 16-12-2020 Through 19-12-2020",

}

Download

TY - GEN

T1 - Using Machine Learning to Automate Mammogram Images Analysis

AU - Tang, Xuejiao

AU - Zhang, Liuhua

AU - Zhang, Wenbin

AU - Huang, Xin

AU - Iosifidis, Vasileios

AU - Liu, Zhen

AU - Zhang, Mingli

AU - Messina, Enza

AU - Zhang, Ji

PY - 2020

Y1 - 2020

N2 - Breast cancer is the second leading cause of cancer-related death after lung cancer in women. Early detection of breast cancer in X-ray mammography is believed to have effectively reduced the mortality rate since 1989. However, a relatively high false positive rate and a low specificity in mammography technology still exist. In this work, a computer-aided automatic mammogram analysis system is proposed to process the mammogram images and automatically discriminate them as either normal or cancerous, consisting of three consecutive image processing, feature selection, and image classification stages. In designing the system, the discrete wavelet transforms (Daubechies 2, Daubechies 4, and Biorthogonal 6.8) and the Fourier cosine transform were first used to parse the mammogram images and extract statistical features. Then, an entropy-based feature selection method was implemented to reduce the number of features. Finally, different pattern recognition methods (including the Back-propagation Network, the Linear Discriminant Analysis, and the Naive Bayes Classifier) and a voting classification scheme were employed. The performance of each classification strategy was evaluated for sensitivity, specificity, and accuracy and for general performance using the Receiver Operating Curve. Our method is validated on the dataset from the Eastern Health in Newfoundland and Labrador of Canada. The experimental results demonstrated that the proposed automatic mammogram analysis system could effectively improve the classification performances.

AB - Breast cancer is the second leading cause of cancer-related death after lung cancer in women. Early detection of breast cancer in X-ray mammography is believed to have effectively reduced the mortality rate since 1989. However, a relatively high false positive rate and a low specificity in mammography technology still exist. In this work, a computer-aided automatic mammogram analysis system is proposed to process the mammogram images and automatically discriminate them as either normal or cancerous, consisting of three consecutive image processing, feature selection, and image classification stages. In designing the system, the discrete wavelet transforms (Daubechies 2, Daubechies 4, and Biorthogonal 6.8) and the Fourier cosine transform were first used to parse the mammogram images and extract statistical features. Then, an entropy-based feature selection method was implemented to reduce the number of features. Finally, different pattern recognition methods (including the Back-propagation Network, the Linear Discriminant Analysis, and the Naive Bayes Classifier) and a voting classification scheme were employed. The performance of each classification strategy was evaluated for sensitivity, specificity, and accuracy and for general performance using the Receiver Operating Curve. Our method is validated on the dataset from the Eastern Health in Newfoundland and Labrador of Canada. The experimental results demonstrated that the proposed automatic mammogram analysis system could effectively improve the classification performances.

KW - automated diagnostic system

KW - Breast cancer

UR - http://www.scopus.com/inward/record.url?scp=85100348811&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2012.03151

DO - 10.48550/arXiv.2012.03151

M3 - Conference contribution

AN - SCOPUS:85100348811

SN - 978-1-7281-6216-4

SP - 757

EP - 764

BT - Proceedings

A2 - Park, Taesung

A2 - Cho, Young-Rae

A2 - Hu, Xiaohua Tony

A2 - Yoo, Illhoi

A2 - Woo, Hyun Goo

A2 - Wang, Jianxin

A2 - Facelli, Julio

A2 - Nam, Seungyoon

A2 - Kang, Mingon

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020

Y2 - 16 December 2020 through 19 December 2020

ER -