Knowledge discovery in databases (KDD) is not a straightforward application of a single method, but rather a long lasting, iterative and interactive process during which the user has to model a multitude of data derivation processes, execute them and interpret the results in order to formulate new derivation processes. Having a single tool that supports the user in the selection of data, their preprocessing and transformation, as well as their mining, is an important requirement of KDD practitioners. Moreover, it is important to assist the user of a KDD system in the selection of the right parameters or methods. This paper describes KDB2000, a tool that integrates database access, data preprocessing and transformation techniques, a full range of data mining algorithms as well as pattern validation and visualization. This integration aims to support the user in the entire KDD process enabling him/her to see the same problem from many different angles for a thorough investigation. In addition, KDB2000 makes use of the agent technology to assist the user in some data mining tasks, such as choosing the best pruning strategy for induced decision tree in accordance to both data features and used needs.
KDB2000: An integrated knowledge discovery tool
APPICE, ANNALISA;CECI M;
2002-01-01
Abstract
Knowledge discovery in databases (KDD) is not a straightforward application of a single method, but rather a long lasting, iterative and interactive process during which the user has to model a multitude of data derivation processes, execute them and interpret the results in order to formulate new derivation processes. Having a single tool that supports the user in the selection of data, their preprocessing and transformation, as well as their mining, is an important requirement of KDD practitioners. Moreover, it is important to assist the user of a KDD system in the selection of the right parameters or methods. This paper describes KDB2000, a tool that integrates database access, data preprocessing and transformation techniques, a full range of data mining algorithms as well as pattern validation and visualization. This integration aims to support the user in the entire KDD process enabling him/her to see the same problem from many different angles for a thorough investigation. In addition, KDB2000 makes use of the agent technology to assist the user in some data mining tasks, such as choosing the best pruning strategy for induced decision tree in accordance to both data features and used needs.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.