NeuroVAD: Real-Time Voice Activity Detection from Non-Invasive Neuromagnetic Signals. 2020

Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
Electrical and Computer Engineering, University of Texas at Austin, Austin, TX 78712, USA.

Neural speech decoding-driven brain-computer interface (BCI) or speech-BCI is a novel paradigm for exploring communication restoration for locked-in (fully paralyzed but aware) patients. Speech-BCIs aim to map a direct transformation from neural signals to text or speech, which has the potential for a higher communication rate than the current BCIs. Although recent progress has demonstrated the potential of speech-BCIs from either invasive or non-invasive neural signals, the majority of the systems developed so far still assume knowing the onset and offset of the speech utterances within the continuous neural recordings. This lack of real-time voice/speech activity detection (VAD) is a current obstacle for future applications of neural speech decoding wherein BCI users can have a continuous conversation with other speakers. To address this issue, in this study, we attempted to automatically detect the voice/speech activity directly from the neural signals recorded using magnetoencephalography (MEG). First, we classified the whole segments of pre-speech, speech, and post-speech in the neural signals using a support vector machine (SVM). Second, for continuous prediction, we used a long short-term memory-recurrent neural network (LSTM-RNN) to efficiently decode the voice activity at each time point via its sequential pattern-learning mechanism. Experimental results demonstrated the possibility of real-time VAD directly from the non-invasive neural signals with about 88% accuracy.

UI MeSH Term Description Entries
D008297 Male Males
D008875 Middle Aged An adult aged 45 - 64 years. Middle Age
D004562 Electrocardiography Recording of the moment-to-moment electromotive forces of the HEART as projected onto various sites on the body's surface, delineated as a scalar function of time. The recording is monitored by a tracing on slow moving chart paper or by observing it on a cardioscope, which is a CATHODE RAY TUBE DISPLAY. 12-Lead ECG,12-Lead EKG,12-Lead Electrocardiography,Cardiography,ECG,EKG,Electrocardiogram,Electrocardiograph,12 Lead ECG,12 Lead EKG,12 Lead Electrocardiography,12-Lead ECGs,12-Lead EKGs,12-Lead Electrocardiographies,Cardiographies,ECG, 12-Lead,EKG, 12-Lead,Electrocardiograms,Electrocardiographies, 12-Lead,Electrocardiographs,Electrocardiography, 12-Lead
D004585 Electrooculography Recording of the average amplitude of the resting potential arising between the cornea and the retina in light and dark adaptation as the eyes turn a standard distance to the right and the left. The increase in potential with light adaptation is used to evaluate the condition of the retinal pigment epithelium. EOG,Electrooculograms,Electrooculogram
D005260 Female Females
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000328 Adult A person having attained full growth or maturity. Adults are of 19 through 44 years of age. For a person between 19 and 24 years of age, YOUNG ADULT is available. Adults
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D012815 Signal Processing, Computer-Assisted Computer-assisted processing of electric, ultrasonic, or electronic signals to interpret function and activity. Digital Signal Processing,Signal Interpretation, Computer-Assisted,Signal Processing, Digital,Computer-Assisted Signal Interpretation,Computer-Assisted Signal Interpretations,Computer-Assisted Signal Processing,Interpretation, Computer-Assisted Signal,Interpretations, Computer-Assisted Signal,Signal Interpretation, Computer Assisted,Signal Interpretations, Computer-Assisted,Signal Processing, Computer Assisted
D013060 Speech Communication through a system of conventional vocal symbols. Public Speaking,Speaking, Public

Related Publications

Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
January 2018, IEEE access : practical innovations, open solutions,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
April 1976, The Tohoku journal of experimental medicine,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
March 2007, Orvosi hetilap,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
September 2009, Physiological measurement,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
January 2008, Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
October 2021, Arabian journal for science and engineering,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
January 2012, Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
July 2021, ACS omega,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
October 2013, Computers in biology and medicine,
Debadatta Dash, and Paul Ferrari, and Satwik Dutta, and Jun Wang
November 2001, Physiological measurement,
Copied contents to your clipboard!