Confidence path regularization for handling label uncertainty in semi-supervised learning: use case in bipolar disorder monitoring

IRIS

Semi-supervised learning has gained great interest because of its ability to combine unlabeled data with-potentially few-labeled observations in a training process. However, in some application contexts, one can question whether all available labels are equally valid. For example, in the context of bipolar disorder (BD) remote monitoring, a common practice is to extrapolate the psychiatrist's assessment onto some fixed time window surrounding the visit, the so-called ground truth period. In consequence, all data from this period are labeled with the same category. Such an approach may potentially result in misguided supervision affecting the model's performance. In this paper, we consider the problem of label uncertainty, assuming that the labels are crisp, but they may be assigned to particular observations with varying confidence. We propose a novel method called Confidence Path Regularization (CPR) that incorporates this uncertainty into the fuzzy c-means semi-supervised learning. The proposed CPR approach is a novel method for automatic, data-driven handling of label uncertainty. We achieve it by estimating the confidence factor for each labeled observation. In addition, CPR allows for the exploration of potential class-specific patterns in the adjusted confidence. The proposed method is illustrated with experiments on partially labeled data about speech characteristics collected from smartphone application for BD monitoring. In this particular applied scenario, we also use additional contextual data to improve the construction of confidence paths. It is shown that the proposed CPR approach enables to reflect the varying confidence in labels as compared with the nominal approach which assigns the majority of observations to the same class associated with relevant ground truth period

Confidence path regularization for handling label uncertainty in semi-supervised learning: use case in bipolar disorder monitoring

Kamil Kmita;Casalino Gabriella;Giovanna Castellano;Olgierd Hryniewicz;Katarzyna Kaczmarek-Majer

2022-01-01

Abstract

Semi-supervised learning has gained great interest because of its ability to combine unlabeled data with-potentially few-labeled observations in a training process. However, in some application contexts, one can question whether all available labels are equally valid. For example, in the context of bipolar disorder (BD) remote monitoring, a common practice is to extrapolate the psychiatrist's assessment onto some fixed time window surrounding the visit, the so-called ground truth period. In consequence, all data from this period are labeled with the same category. Such an approach may potentially result in misguided supervision affecting the model's performance. In this paper, we consider the problem of label uncertainty, assuming that the labels are crisp, but they may be assigned to particular observations with varying confidence. We propose a novel method called Confidence Path Regularization (CPR) that incorporates this uncertainty into the fuzzy c-means semi-supervised learning. The proposed CPR approach is a novel method for automatic, data-driven handling of label uncertainty. We achieve it by estimating the confidence factor for each labeled observation. In addition, CPR allows for the exploration of potential class-specific patterns in the adjusted confidence. The proposed method is illustrated with experiments on partially labeled data about speech characteristics collected from smartphone application for BD monitoring. In this particular applied scenario, we also use additional contextual data to improve the construction of confidence paths. It is shown that the proposed CPR approach enables to reflect the varying confidence in labels as compared with the nominal approach which assigns the majority of observations to the same class associated with relevant ground truth period

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Codice ISBN
	
				978-1-6654-6710-0
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
BEST_PAPER_FUZZIEEE2022_Confidence_path_regularization_for_handling_label_uncertainty_in_semi-supervised_learning_use_case_in_bipolar_disorder_monitoring.pdf non disponibili Descrizione: Versione Editoriale Tipologia: Documento in Versione Editoriale Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 1.64 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.64 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
Confidence_path_for_handling_label_uncertainty_in_semi_supervised_learning.pdf accesso aperto Descrizione: Pre-print Tipologia: Documento in Pre-print Licenza: Creative commons Dimensione 431.81 kB Formato Adobe PDF Visualizza/Apri	431.81 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/413722

Citazioni

ND

5

0

social impact