AITopics | Grid

Collaborating Authors

Grid

News Overviews Instructional Materials AI-Alerts Classics

Performance study of distributed Apriori-like frequent itemsets mining

Aouad, Lamine M., Le-Khac, Nhien-An, Kechadi, Tahar M.

arXiv.org Machine LearningFeb-21-2019

In this article, we focus on distributed Apriori-based frequent itemsets mining. We present a new distributed approach which takes into account inherent characteristics of this algorithm. We study the distribution aspect of this algorithm and give a comparison of the proposed approach with a classical Apriori-like distributed algorithm, using both analytical and experimental studies. We find that under a wide range of conditions and datasets, the performance of a distributed Apriori-like algorithm is not related to global strategies of pruning since the performance of the local Apriori generation is usually characterized by relatively high success rates of candidate sets frequency at low levels which switch to very low rates at some stage, and often drops to zero. This means that the intermediate communication steps and remote support counts computation and collection in classical distributed schemes are computationally inefficient locally, and then constrains the global performance. Our performance evaluation is done on a large cluster of workstations using the Condor system and its workflow manager DAGMan. The results show that the presented approach greatly enhances the performance and achieves good scalability compared to a typical distributed Apriori founded algorithm.

algorithm, artificial intelligence, data mining, (17 more...)

arXiv.org Machine Learning

doi: 10.1007/s10115-009-0205-3

1903.03008

Country: Europe (0.46)

Genre: Research Report > New Finding (0.54)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Architecture > Distributed Systems (1.00)
Information Technology > Grid (0.93)

Add feedback

Next Generation Language Resources using GRID

Calzolari, Federico, Sassolini, Eva, Sassi, Manuela, Cucurullo, Sebastiana, Picchi, Eugenio, Bertagna, Francesca, Enea, Alessandro, Monachini, Monica, Soria, Claudia, Calzolari, Nicoletta

arXiv.org Artificial IntelligenceDec-1-2009

This paper presents a case study concerning the challenges and requirements posed by next generation language resources, realized as an overall model of open, distributed and collaborative language infrastructure. If a sort of "new paradigm" is required, we think that the emerging and still evolving technology connected to Grid computing is a very interesting and suitable one for a concrete realization of this vision. Given the current limitations of Grid computing, it is very important to test the new environment on basic language analysis tools, in order to get the feeling of what are the potentialities and possible limitations connected to its use in NLP. For this reason, we have done some experiments on a module of Linguistic Miner, i.e. the extraction of linguistic patterns from restricted domain corpora.

application, artificial intelligence, natural language, (18 more...)

arXiv.org Artificial Intelligence

cs/0611148

Country:

Europe (0.95)
North America > United States > California (0.46)

Genre: Research Report (0.40)

Technology:

Information Technology > Grid (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Architecture > Distributed Systems (1.00)

Add feedback