Publications:Automated speech analysis applied to laryngeal disease categorization
From ISLAB/CAISR
Title | Automated speech analysis applied to laryngeal disease categorization |
---|---|
Author | A. Gelzinis and Antanas Verikas and M. Bacauskiene |
Year | 2008 |
PublicationType | Journal Paper |
Journal | Computer Methods and Programs in Biomedicine |
HostPublication | |
Conference | |
DOI | http://dx.doi.org/10.1016/j.cmpb.2008.01.008 |
Diva url | http://hh.diva-portal.org/smash/record.jsf?searchId=1&pid=diva2:239242 |
Abstract | The long-term goal of the work is a decision support system for diagnostics of laryngeal diseases. Colour images of vocal folds, a voice signal, and questionnaire data are the information sources to be used in the analysis. This paper is concerned with automated analysis of a voice signal applied to screening of laryngeal diseases. The effectiveness of 11 different feature sets in classification of voice recordings of the sustained phonation of the vowel sound /a/ into a healthy and two pathological classes, diffuse and nodular, is investigated. A k-NN classifier, SVM, and a committee build using various aggregation options are used for the classification. The study was made using the mixed gender database containing 312 voice recordings. The correct classification rate of 84.6% was achieved when using an SVM committee consisting of four members. The pitch and amplitude perturbation measures, cepstral energy features, autocorrelation features as well as linear prediction cosine transform coefficients were amongst the feature sets providing the best performance. In the case of two class classification, using recordings from 79 subjects representing the pathological and 69 the healthy class, the correct classification rate of 95.5% was obtained from a five member committee. Again the pitch and amplitude perturbation measures provided the best performance. |