Comparison of RetinaNet, SSD, and YOLO v3 for real-time pill identification

BMC Med Inform Decis Mak. 2021 Nov 22;21(1):324. doi: 10.1186/s12911-021-01691-8.

Abstract

Background: The correct identification of pills is very important to ensure the safe administration of drugs to patients. Here, we use three current mainstream object detection models, namely RetinaNet, Single Shot Multi-Box Detector (SSD), and You Only Look Once v3(YOLO v3), to identify pills and compare the associated performance.

Methods: In this paper, we introduce the basic principles of three object detection models. We trained each algorithm on a pill image dataset and analyzed the performance of the three models to determine the best pill recognition model. The models were then used to detect difficult samples and we compared the results.

Results: The mean average precision (MAP) of RetinaNet reached 82.89%, but the frames per second (FPS) is only one third of YOLO v3, which makes it difficult to achieve real-time performance. SSD does not perform as well on the indicators of MAP and FPS. Although the MAP of YOLO v3 is slightly lower than the others (80.69%), it has a significant advantage in terms of detection speed. YOLO v3 also performed better when tasked with hard sample detection, and therefore the model is more suitable for deployment in hospital equipment.

Conclusion: Our study reveals that object detection can be applied for real-time pill identification in a hospital pharmacy, and YOLO v3 exhibits an advantage in detection speed while maintaining a satisfactory MAP.

Keywords: Convolutional neural network; Pill identification; RetinaNet; SSD; YOLO v3.

MeSH terms

  • Algorithms
  • Humans
  • Neural Networks, Computer*
  • Silver Sulfadiazine*

Substances

  • Silver Sulfadiazine