A definition of distance measure between structural descriptions, which is based on a probabilistic interpretation of the matching predicate, is proposed. It aims at coping with the problem of classification when noise causes both local and structural deformations. The distance measure is defined according to a top-down evaluation scheme: distance between disjunctions of conjuncts, conjunctions, and literals. At the lowest level, the similarity between a feature value in the pattern model (G) and the corresponding value in the observation (Ex) is defined as the probability of observing a greater distortion. The classification problem is approached by means of a multilayered framework in which the cases of single perfect match, no perfect match, and multiple perfect match are treated differently. Another possible application of the distance measure is in the field of concept acquisition. A plausible solution for the problem of completing the attribute and structure spaces, based on the probabilistic approach, is also given. Finally, both a comparison with other related works and an application in the domain of layout-based document recognition are illustrated.

Classification in noisy environments using a distance measure between structural symbolic descriptions

ESPOSITO, Floriana;MALERBA, Donato;SEMERARO, Giovanni
1992-01-01

Abstract

A definition of distance measure between structural descriptions, which is based on a probabilistic interpretation of the matching predicate, is proposed. It aims at coping with the problem of classification when noise causes both local and structural deformations. The distance measure is defined according to a top-down evaluation scheme: distance between disjunctions of conjuncts, conjunctions, and literals. At the lowest level, the similarity between a feature value in the pattern model (G) and the corresponding value in the observation (Ex) is defined as the probability of observing a greater distortion. The classification problem is approached by means of a multilayered framework in which the cases of single perfect match, no perfect match, and multiple perfect match are treated differently. Another possible application of the distance measure is in the field of concept acquisition. A plausible solution for the problem of completing the attribute and structure spaces, based on the probabilistic approach, is also given. Finally, both a comparison with other related works and an application in the domain of layout-based document recognition are illustrated.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/128403
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 27
  • ???jsp.display-item.citation.isi??? 18
social impact