Many organizations collect large amounts of spatially referenced data. Spatial Data Mining targets the discovery of interesting, implicit knowledge from such data. The specific classification task has been extensively investigated in the classical inductive setting, where only labeled examples are used to generate a classifier, discarding a large amount of information potentially conveyed by the unlabeled instances to be classified. In this work spatial classification is based on transduction, an inference mechanism “from particular to particular” which uses both labeled and unlabeled data to build a classifier whose main goal is that of classifying (only) unlabeled data as accurately as possible. The proposed method, named TRANSC, employs a principled probabilistic classification in multi-relational data mining to face the challenges posed by handling spatial data. The predictive accuracy of TRANSC has been evaluated on two real-world spatial datasets.
Mining Geo-Spatial Data in a Transductive Setting
APPICE, ANNALISA;CECI, MICHELANGELO;MALERBA, Donato;
2007-01-01
Abstract
Many organizations collect large amounts of spatially referenced data. Spatial Data Mining targets the discovery of interesting, implicit knowledge from such data. The specific classification task has been extensively investigated in the classical inductive setting, where only labeled examples are used to generate a classifier, discarding a large amount of information potentially conveyed by the unlabeled instances to be classified. In this work spatial classification is based on transduction, an inference mechanism “from particular to particular” which uses both labeled and unlabeled data to build a classifier whose main goal is that of classifying (only) unlabeled data as accurately as possible. The proposed method, named TRANSC, employs a principled probabilistic classification in multi-relational data mining to face the challenges posed by handling spatial data. The predictive accuracy of TRANSC has been evaluated on two real-world spatial datasets.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.