Details
Original language | English |
---|---|
Title of host publication | Medical Imaging 2024: Imaging Informatics for Healthcare, Research, and Applications |
Editors | Hiroyuki Yoshida, Shandong Wu |
Place of Publication | San Diego |
Publisher | SPIE |
Number of pages | 9 |
Volume | 12931 |
ISBN (electronic) | 9781510671676 |
ISBN (print) | 9781510671669 |
Publication status | Published - 2 Apr 2024 |
Abstract
Keywords
- Yolov8, surgical instrument detection, synthetic data, object detection, Multi-label classification, deep learning
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
Medical Imaging 2024: Imaging Informatics for Healthcare, Research, and Applications. ed. / Hiroyuki Yoshida; Shandong Wu. Vol. 12931 San Diego: SPIE, 2024. 12931 .
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - Medical instrument detection with synthetically generated data
AU - Wiese, Leon Vincent
AU - Hinz, Lennart
AU - Reithmeier, Eduard
PY - 2024/4/2
Y1 - 2024/4/2
N2 - The persistent need for more qualified personnel in operating theatres exacerbates the remaining staff’s workload. This increased burden can result in substantial complications during surgical procedures. To address this issue, this research project works on a comprehensive operating theatre system. The system offers real-time monitoring of all surgical instruments in the operating theatre, aiming to alleviate the problem. The foundation of this endeavor involves a neural network trained to classify and identify eight distinct instruments belonging to four distinct surgical instrument groups. A novel aspect of this study lies in the approach taken to select and generate the training and validation data sets. The data sets used in this study consist of synthetically generated image data rather than real image data. Additionally, three virtual scenes were designed to serve as the background for a generation algorithm. This algorithm randomly positions the instruments within these scenes, producing annotated rendered RGB images of the generated scenes. To assess the efficacy of this approach, a separate real data set was also created for testing the neural network. Surprisingly, it was discovered that neural networks trained solely on synthetic data performed well when applied to real data. This research paper shows that it is possible to train neural networks with purely synthetically generated data and use them to recognize surgical instruments in real images.
AB - The persistent need for more qualified personnel in operating theatres exacerbates the remaining staff’s workload. This increased burden can result in substantial complications during surgical procedures. To address this issue, this research project works on a comprehensive operating theatre system. The system offers real-time monitoring of all surgical instruments in the operating theatre, aiming to alleviate the problem. The foundation of this endeavor involves a neural network trained to classify and identify eight distinct instruments belonging to four distinct surgical instrument groups. A novel aspect of this study lies in the approach taken to select and generate the training and validation data sets. The data sets used in this study consist of synthetically generated image data rather than real image data. Additionally, three virtual scenes were designed to serve as the background for a generation algorithm. This algorithm randomly positions the instruments within these scenes, producing annotated rendered RGB images of the generated scenes. To assess the efficacy of this approach, a separate real data set was also created for testing the neural network. Surprisingly, it was discovered that neural networks trained solely on synthetic data performed well when applied to real data. This research paper shows that it is possible to train neural networks with purely synthetically generated data and use them to recognize surgical instruments in real images.
KW - Yolov8
KW - surgical instrument detection
KW - synthetic data
KW - object detection
KW - Multi-label classification
KW - deep learning
U2 - 10.1117/12.3005798
DO - 10.1117/12.3005798
M3 - Conference contribution
SN - 9781510671669
VL - 12931
BT - Medical Imaging 2024: Imaging Informatics for Healthcare, Research, and Applications
A2 - Yoshida, Hiroyuki
A2 - Wu, Shandong
PB - SPIE
CY - San Diego
ER -