AITopics | guration

Training deep learning models, particularly Transformer-based architectures such as Large Language Models (LLMs), demands substantial computational resources and extended training periods. While optimal configuration and infrastructure selection can significantly reduce associated costs, this optimization requires preliminary analysis tools. This paper introduces PreNeT, a novel predictive framework designed to address this optimization challenge. PreNeT facilitates training optimization by integrating comprehensive computational metrics, including layer-specific parameters, arithmetic operations and memory utilization. A key feature of PreNeT is its capacity to accurately predict training duration on previously unexamined hardware infrastructures, including novel accelerator architectures. This framework employs a sophisticated approach to capture and analyze the distinct characteristics of various neural network layers, thereby enhancing existing prediction methodologies. Through proactive implementation of PreNeT, researchers and practitioners can determine optimal configurations, parameter settings, and hardware specifications to maximize cost-efficiency and minimize training duration. Experimental results demonstrate that PreNeT achieves up to 72% improvement in prediction accuracy compared to contemporary state-of-the-art frameworks.

artificial intelligence, machine learning, training time, (17 more...)

arXiv.org Artificial Intelligence

2412.15519

Country: Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Services (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning to Classify Galaxy Shapes Using the EM Algorithm

Neural Information Processing SystemsFeb-16-2024, 19:21:53 GMT

The eld of astronomy is increasingly data-driven as new observing instruments permit the rapid collection of massive archives of sky image data. In this paper we investigate the problem of identifying bent-double radio galaxies in the FIRST (Faint Images of the Radio Sky at Twenty-cm) Survey data set [1]. FIRST produces large numbers of radio images of the deep sky using the Very Large Array at the National Radio Astronomy Observatory. It is scheduled to cover more that 10,000 square degrees of the northern and southern caps (skies). Of particular scienti c interest to astronomers is the identi cation and cataloging of sky objects with a "bent-double" morphology, indicating clusters of galaxies ([8], see Figure 1). Due to the very large number of observed deep-sky radio sources, (on the order of 106 so far) it is infeasible for the astronomers to label all of them manually. The data from the FIRST Survey (http://sundog.stsci.edu/) is available in both raw image format and in the form of a catalog of features that have been automatically derived from the raw images by an image analysis program [8]. Each entry corresponds to a single detectable "blob" of bright intensity relative to the sky background: these entries are called

algorithm, classify galaxy shape, galaxy, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

Towards Modeling Human Attention from Eye Movements for Neural Source Code Summarization

Bansal, Aakash, Sharif, Bonita, McMillan, Collin

arXiv.org Artificial IntelligenceMay-16-2023

These descriptions are called "summaries" and are a key component of software documentation for programmers. A programmer may read a short summary like "takes a screenshot" to quickly understand what a section of code does, without resorting to reading the source code. Despite the usefulness of these summaries, programmers often neglect to write or update them. The result is that automatic source code summarization has long been an appetizing target in software engineering research. The scienti c community has long sought to enable machines to understand code in the way people do, so that those machines can describe code like a person would. A con uence of recent advances in both software engineering and machine learning research is bearing fruit, such that automated code summarization seems almost within reach. In particular, neural source code summarization has held the vanguard of the state of the art since around 2017. Neural code summarization refers to approaches based on neural networks, namely the encoderdecoder architecture [61].

machine learning, natural language, programmer, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3591136

2305.09773

Country:

North America > United States > Nebraska > Lancaster County > Lincoln (0.14)
North America > United States > Indiana > St. Joseph County > Notre Dame (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Experimental Study (0.93)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Ergodic Annealing

Baldassi, Carlo, Maccheroni, Fabio, Marinacci, Massimo, Pirazzini, Marco

arXiv.org Artificial IntelligenceAug-1-2020

The recent years and events lead to a massive development of content-oriented cloud services. The most popular and voluminous content o¤ered in today's networks are videos that must be e¢ ciently delivered to end customers. The objective of the service provider (root) is to optimize the delivery of content to its costumers (terminals). In this optimization problem the cost is usually assumed to be known (left graph). Yet, in reality it is often unknown because it depends on many stochastic factors, such as the tra¢ c on the network, the level of demand, and so on (right graph). Figure 1: Graphical representation of networks where information travels from a root to a set of terminals over channels with known or unknown cost.

annealing, artificial intelligence, optimization problem, (17 more...)

arXiv.org Artificial Intelligence

2008.00234

Country:

Asia > Macao (0.07)
Europe > Italy (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.35)

Add feedback

Computer-Aided Algorithm Design: Automated Tuning, Conﬁguration, Selection, and Beyond

Hoos, Holger H. (University of British Columbia)

AAAI ConferencesMay-1-2010

In this talk, I will introduce computer-aided algorithm design and discuss its main ingredients: design patterns, which provide ways of structuring potentially large spaces of candidate algorithms, and meta-algorithmic optimisation procedures, which are used for ﬁnding good designs within these spaces. After explaining how this algorithm design approach differs from and complements related approaches in program synthesis, genetic programming and so-called hyperheuristics, I will illustrate its success using examples from our own work in SAT-based software veriﬁcation (Hutter et al. 2007), timetabling (Chiarandini, Fawcett, and Hoos 2008) and mixed integer programming (Hutter, Hoos, and Leyton-Brown 2010). Furthermore, I will argue why this approach can be expected to be particularly useful and effective for building better solvers for rich and diverse classes of combinatorial problems, such as planning and scheduling. Finally, I will outline out how programming by optimisation — a design paradigm that emphasises the automated construction of performance-optimised algorithm by means of searching large spaces of alternative designs — has the potential to transform the design of high-performance algorithm from a craft that is based primarily on experience and intuition into a principled and highly effective engineering effort.

algorithm, automated tuning, computer-aided algorithm design, (14 more...)

AAAI Conferences

Twentieth International Conference on Automated Planning and Scheduling

Country: North America > Canada > British Columbia (0.06)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.71)

Add feedback

Comparing Beliefs, Surveys, and Random Walks

Aurell, Erik, Gordon, Uri, Kirkpatrick, Scott

Neural Information Processing SystemsDec-31-2005

Survey propagation is a powerful technique from statistical physics that has been applied to solve the 3-SAT problem both in principle and in practice. We give, using only probability arguments, a common derivation of survey propagation, belief propagation and several interesting hybrid methods. We then present numerical experiments which use WSAT (a widely used random-walk based SAT solver) to quantify the complexity of the 3-SAT formulae as a function of their parameters, both as randomly generated and after simpli£cation, guided by survey propagation. Some properties of WSAT which have not previously been reported make it an ideal tool for this purpose - its mean cost is proportional to the number of variables in the formula (at a £xed ratio of clauses to variables) in the easy-SAT regime and slightly beyond, and its behavior in the hard-SAT regime appears to re¤ect the underlying structure of the solution space that has been predicted by replica symmetry-breaking arguments. An analysis of the tradeoffs between the various methods of search for satisfying assignments shows WSAT to be far more powerful than has been appreciated, and suggests some interesting new directions for practical algorithm development.

formula, probability, wsat, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Comparing Beliefs, Surveys, and Random Walks

Aurell, Erik, Gordon, Uri, Kirkpatrick, Scott

Neural Information Processing SystemsDec-31-2005

Survey propagation is a powerful technique from statistical physics that has been applied to solve the 3-SAT problem both in principle and in practice. We give, using only probability arguments, a common derivation of survey propagation, belief propagation and several interesting hybrid methods. We then present numerical experiments which use WSAT (a widely used random-walk based SAT solver) to quantify the complexity of the 3-SAT formulae as a function of their parameters, both as randomly generated and after simpli£cation, guided by survey propagation. Some properties of WSAT which have not previously been reported make it an ideal tool for this purpose - its mean cost is proportional to the number of variables in the formula (at a £xed ratio of clauses to variables) in the easy-SAT regime and slightly beyond, and its behavior in the hard-SAT regime appears to re¤ect the underlying structure of the solution space that has been predicted by replica symmetry-breaking arguments. An analysis of the tradeoffs between the various methods of search for satisfying assignments shows WSAT to be far more powerful than has been appreciated, and suggests some interesting new directions for practical algorithm development.

formula, probability, wsat, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Comparing Beliefs, Surveys, and Random Walks

Aurell, Erik, Gordon, Uri, Kirkpatrick, Scott

Neural Information Processing SystemsDec-31-2005

It consists of a ensemble of randomly generated logical expressions, each depending onN Boolean variablesx i, and constructed by taking the AND of M clauses. Each clausea consists of the OR of 3 "literals"y i,a .

artificial intelligence, decimation, probability, (17 more...)

Neural Information Processing Systems

Country:

Europe > Sweden (0.14)
Asia > Middle East > Israel (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Learning to Classify Galaxy Shapes Using the EM Algorithm

Kirshner, Sergey, Cadez, Igor V., Smyth, Padhraic, Kamath, Chandrika

Neural Information Processing SystemsDec-31-2003

We describe the application of probabilistic model-based learning to the problem of automatically identifying classes of galaxies, based on both morphological and pixel intensity characteristics. The EM algorithm can be used to learn how to spatially orient a set of galaxies so that they are geometrically aligned. We augment this "ordering-model" with a mixture model on objects, and demonstrate how classes of galaxies can be learned in an unsupervised manner using a two-level EM algorithm. The resulting models provide highly accurate classi£cation of galaxies in cross-validation experiments.

algorithm, galaxy, guration, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Orange County > Irvine (0.05)
North America > United States > California > Orange County > Laguna Hills (0.04)
North America > United States > California > Alameda County > Livermore (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)

Add feedback

Learning to Classify Galaxy Shapes Using the EM Algorithm

Kirshner, Sergey, Cadez, Igor V., Smyth, Padhraic, Kamath, Chandrika

Neural Information Processing SystemsDec-31-2003

We describe the application of probabilistic model-based learning to the problem of automatically identifying classes of galaxies, based on both morphological and pixel intensity characteristics. The EM algorithm can be used to learn how to spatially orient a set of galaxies so that they are geometrically aligned. We augment this "ordering-model" with a mixture model on objects, and demonstrate how classes of galaxies can be learned in an unsupervised manner using a two-level EM algorithm. The resulting models provide highly accurate classi£cation of galaxies in cross-validation experiments.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States > California > Orange County > Irvine (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)

Add feedback

Filters

Collaborating Authors

guration

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time

Learning to Classify Galaxy Shapes Using the EM Algorithm

Towards Modeling Human Attention from Eye Movements for Neural Source Code Summarization

Ergodic Annealing

Computer-Aided Algorithm Design: Automated Tuning, Conﬁguration, Selection, and Beyond

Comparing Beliefs, Surveys, and Random Walks

Comparing Beliefs, Surveys, and Random Walks

Comparing Beliefs, Surveys, and Random Walks

Learning to Classify Galaxy Shapes Using the EM Algorithm

Learning to Classify Galaxy Shapes Using the EM Algorithm