Automatic Detection of COVID-19 Based on Short-Duration Acoustic Smartphone Speech Analysis

Brian Stasak; Zhaocheng Huang; Sabah Razavi; Dale Joachim; Julien Epps

doi:10.1007/s41666-020-00090-4

Automatic Detection of COVID-19 Based on Short-Duration Acoustic Smartphone Speech Analysis

J Healthc Inform Res. 2021;5(2):201-217. doi: 10.1007/s41666-020-00090-4. Epub 2021 Mar 11.

Authors

Brian Stasak¹, Zhaocheng Huang¹, Sabah Razavi², Dale Joachim², Julien Epps¹

Affiliations

¹ School of Electrical Engineering & Telecommunications, University of New South Wales, Sydney, NSW Australia.
² Sonde Health, Boston, MA USA.

Abstract

Currently, there is an increasing global need for COVID-19 screening to help reduce the rate of infection and at-risk patient workload at hospitals. Smartphone-based screening for COVID-19 along with other respiratory illnesses offers excellent potential due to its rapid-rollout remote platform, user convenience, symptom tracking, comparatively low cost, and prompt result processing timeframe. In particular, speech-based analysis embedded in smartphone app technology can measure physiological effects relevant to COVID-19 screening that are not yet digitally available at scale in the healthcare field. Using a selection of the Sonde Health COVID-19 2020 dataset, this study examines the speech of COVID-19-negative participants exhibiting mild and moderate COVID-19-like symptoms as well as that of COVID-19-positive participants with mild to moderate symptoms. Our study investigates the classification potential of acoustic features (e.g., glottal, prosodic, spectral) from short-duration speech segments (e.g., held vowel, pataka phrase, nasal phrase) for automatic COVID-19 classification using machine learning. Experimental results indicate that certain feature-task combinations can produce COVID-19 classification accuracy of up to 80% as compared with using the all-acoustic feature baseline (68%). Further, with brute-forced n-best feature selection and speech task fusion, automatic COVID-19 classification accuracy of upwards of 82-86% was achieved, depending on whether the COVID-19-negative participant had mild or moderate COVID-19-like symptom severity.

Keywords: Digital medicine; Machine learning; Remote sensing; Respiratory illness.