supported by the EC IST Programme
CoreGRID European Research Network on Foundations, Software Infrastructures and Applications
for large scale distributed, GRID and Peer-to-Peer Technologies
image
Home  
Friday, 05 December 2008

spacer spacer
 

The CoreGRID Network of Excellence currently offers

ident Fellowships:
   for postgraduate students in the field of GRID Research

ident Job announcements:
   related to GRID research free of charge 

Main Menu
Home
News
Events
CoreGRID WG
CoreGRID NoE
Institutes
Integration Activities
Dissemination
Training & Education
CoreGRID & Industry
Mobility Portal
Trust&Security Portal
Collaboration Gateway
Other Collaborations
Links
Contact Us
Login Form





Lost Password?
Who's Online
We have 1 guest online
Visitors: 2734408
Syndicate
Get the latest news direct to your desktop
 
spacer spacer
spacer spacer
 
CoreGRID Technical Report TR-0141 Print

Distributed Data Mining in Desktop Grids

CoreGRID Technical Report TR-0141

Several kinds of scientific and commercial applications require the execution of a large number of independent tasks. One highly successful and low cost mechanism for acquiring the necessary compute power for these applications is the “public-resource computing”, or “desktop Grid” paradigm, which exploits the computational power of private computers. So far, this paradigm has not been applied to data mining applications for two main reasons. First, it is not trivial to decompose a data mining algorithm into truly independent sub-tasks. Second, the large volume of data involved makes it difficult to handle the communication costs of a parallel paradigm. In this paper, we focus on one of the main data mining problem: the extraction of closed frequent itemsets from transactional databases. We show that is possible to decompose this problem into independent tasks, which however need to share a large volume of data. We thus introduce a data-intensive computing network, which adopts a P2P topology based on super peers with caching capabilities, aiming to support the dissemination of large amounts of information. Finally, we evaluate the execution of our data mining job on such network.
 
 
spacer spacer
spacer spacer
 
© 2008 CoreGRID Network of Excellence - European Grid Research
 
spacer spacer