Assessment of the statistical significance of classifications in infrared spectroscopy based diagnostic models

Fecha de publicación: 07/04/2015

Autores de IIS La Fe

Julia Kuligowski

Autor
Guillermo Quintas Soriano

Autor

Participantes ajenos a IIS La Fe

Pérez-Guaita D
Garrigues S
Wood BR

Grupos

Perinatología

Leader

Máximo Vento Torres

Abstract

Fourier transform infrared (IR) spectroscopy in combination with multivariate data analysis is a versatile tool that can be applied to disease diagnosis. However, a rigorous validation of the obtained models is necessary in order to obtain robust results. This work evaluates the advantages of the use of permutation testing for determining the statistical significance of the misclassification errors obtained from IR based diagnostic models through cross validation (CV). The model performance, estimated by CV, is compared to a distribution of CV-performance values obtained using randomly permuted class labels. The distribution of 'random CV-values' is considered as a null distribution and used to establish the significance of the model estimators obtained using real class labels. ATR-FTIR spectra of serum samples were classified using random forest (RF) classifiers according to two criteria, the tag number (a randomly assigned pseudo class membership) and the level of urea (real class). CV errors obtained were compared to the null distribution of CV errors from a permutation test and an independent validation set. The procedure was evaluated testing typical conditions leading to overoptimistic estimations provided by the CV like e.g. the size of subsamples used during CV, variable selection and the use of replicates. Results show that for the tag number (pseudo class), CV indicated classification errors between 23 and 33% depending on the subsample size employed. Those values were even lower when variable selection or replicates were used. However, permutation testing indicated that those CV errors were non-significant. In contrast, for sample classification according to their levels of urea, all cross validation errors were found to be significant. Although the proposed method is computationally intensive, it provides a simple way of calculating an empirical p-value of the CV-estimator, thus establishing the statistical significance and providing a feasibility indicator especially useful for studies where the number of samples is limited.

Datos de la publicación

ISSN/ISSNe:: 0003-2654, 1364-5528
Tipo:: Article
Páginas:: 2422-2427
DOI:: 10.1039/c4an01783h
PubMed:: 25382314
Factor de Impacto:: 1,229 SCImago ℠
Cuartil:: Q1 SCImago ℠

Documentos

No hay documentos

Métricas

Filiaciones

Keywords

CHANCE CORRELATION; SERUM; VALIDATION; TOOL

Proyectos y Estudios Clínicos

UTILIDAD DEL TRATAMIENTO TOCOLITICO DE MANTENIMIENTO EN EL MANEJO DE LA AMENAZA DE PARTO PREMATURO (APP)

Investigador Principal: MÁXIMO VENTO TORRES

EC11-246 . 2012

MULTICENTER, RANDOMIZED, BLINDED CLINICAL STUDY COMPARING EARLY USE OF TOTAL BODY MODERATE HYPOTHERMIA PLUS TOPIRAMATE OR PLACEBO IN ASPHYXIATED NEWBORN INFANTS EVOLVING TO MODERATE-TO-SEVERE HYPOXIC ISCHEMIC ENCEPHALOPATHY

Investigador Principal: MÁXIMO VENTO TORRES

EC11-244 . INSTITUTO DE SALUD CARLOS III; FUNDACIÓN PARA LA INVESTIGACIÓN DEL HOSPITAL UNIVERSITARIO LA FE DE LA COMUNIDAD VALENCIANA . 2012

DAÑO OXIDATIVO Y METILACIÓN DEL ADN Y ACTIVIDAD DE LAS ENZIMAS REPARADORAS Y SU FRECUENCIA MUTAGÉNICA EN PREMATUROS SEGÚN LA CARGA DE OXÍGENO RECIBIDA EN LA REANIMACIÓN

Investigador Principal: MÁXIMO VENTO TORRES

PI14/00443 . INSTITUTO DE SALUD CARLOS III . 2015

IMPACTO DEL ECMO SOBRE LA FARMACOCINÉTICA DE LA ANIDULAFUNGINA.

Investigador Principal: FRANCISCA PÉREZ ESTEBAN

PFI-ANI-2013-01

Cita

Cita Bibliográfica

Assessment of the statistical significance of classifications in infrared spectroscopy based diagnostic models

Autores de IIS La Fe

Julia Kuligowski

Guillermo Quintas Soriano

Participantes ajenos a IIS La Fe

Grupos

Perinatología

Máximo Vento Torres

Abstract

Datos de la publicación

Documentos

Métricas

Filiaciones

Keywords

Proyectos y Estudios Clínicos

UTILIDAD DEL TRATAMIENTO TOCOLITICO DE MANTENIMIENTO EN EL MANEJO DE LA AMENAZA DE PARTO PREMATURO (APP)

MULTICENTER, RANDOMIZED, BLINDED CLINICAL STUDY COMPARING EARLY USE OF TOTAL BODY MODERATE HYPOTHERMIA PLUS TOPIRAMATE OR PLACEBO IN ASPHYXIATED NEWBORN INFANTS EVOLVING TO MODERATE-TO-SEVERE HYPOXIC ISCHEMIC ENCEPHALOPATHY

EFECTO DE LA MODULACION DEL FACTOR INDUCIBLE POR HIPOXIA (HIF) SOBRE LA DEGENERACION RETINIANA EN RETINOSIS PIGMENTARIA

RED DE SALUD MATERNO INFANTIL Y DEL DESARROLLO

CONTRATO POSTDOCTORAL DE INVESTIGACION SARA BORELL

DAÑO OXIDATIVO Y METILACIÓN DEL ADN Y ACTIVIDAD DE LAS ENZIMAS REPARADORAS Y SU FRECUENCIA MUTAGÉNICA EN PREMATUROS SEGÚN LA CARGA DE OXÍGENO RECIBIDA EN LA REANIMACIÓN

IMPACTO DEL ECMO SOBRE LA FARMACOCINÉTICA DE LA ANIDULAFUNGINA.

Cita

Compartir

Assessment of the statistical significance of classifications in infrared spectroscopy based diagnostic models

Autores de IIS La Fe

Guillermo Quintas Soriano

Participantes ajenos a IIS La Fe

Grupos

Abstract

Datos de la publicación

Documentos

Métricas

Filiaciones mostrar / ocultar

Keywords

Proyectos y Estudios Clínicos

Cita

Compartir

Filiaciones