A Data Mining Toolset For Distributed High-performance Platforms
Price
Free (open access)
Volume
28
Pages
Published
2002
Size
716 kb
Paper DOI
10.2495/DATA020051
Copyright
WIT Press
Author(s)
M Cannataro, A Congiusta, D Talia & P Trunfio
Abstract
Today a large number of scientific and commercial applications often require to analyse large data sets maintained over geographically distributed sites by using the computational power of distributed high-performance environments. Advances in networking technology and computational infrastructure made it possible to construct large-scale distributed computing platforms, called computational grids, that provide dependable, consistent, and pervasive access to high-end computational resources. Grids can play a significant role in providing an effective computational support for distributed data mining applications. Currently we are developing a software system for geographically distributed knowledge discovery applications called KNOWLEDGE GRID, which is designed on top of computational grid mechanisms, provided by grid environments such as Glob us. In this paper we present an integrated toolset named VEGA (Visual Environment for Grid Applications), which allows a Knowledge Grid user to develop and execute distributed data mining computations in a simple and effective way. 1 Introduction In many industrial, scientific and commercial applications, it is often necessary to mine large distributed data sets by using the computational power of distributed high-performance computers. Advances in networking technology and computational infrastructure made it possible to design computational grids as large-scale distributed computing platforms that provide dependable, consistent, and pervasive access to high-end computational resources. The term
Keywords