AITopics

doi: 10.1613/jair.1065

AI Access Foundation

10318

Country:

Asia > Middle East > Israel (0.14)
Europe > United Kingdom > England (0.14)

Industry:

Leisure & Entertainment > Games (0.69)
Telecommunications (0.54)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Decomposition of Reinforcement Learning for Admission Control of Self-Similar Call Arrival Processes

Carlström, Jakob

In multi-service communications networks, such as Asynchronous Transfer Mode (ATM) networks, resource control is of crucial importance for the network operator as well as for the users. The objective is to maintain the service quality while maximizing the operator's revenue. At the call level, service quality (Grade of Service) is measured in terms of call blocking probabilities, and the key resource to be controlled is bandwidth. Network routing and call admission control (CAC) are two such resource control problems. Markov decision processes offer a framework for optimal CAC and routing [1]. By modelling the dynamics of the network with traffic and computing control policies using dynamic programming [2], resource control is optimized. A standard assumption in such models is that calls arrive according to Poisson processes. This makes the models of the dynamics relatively simple. Although the Poisson assumption is valid for most user-initiated requests in communications networks, a number of studies [3, 4, 5] indicate that many types of arrival similar.

arrival process, artificial intelligence, télécommunications, (16 more...)

Country:

Europe > Sweden (0.15)
North America > United States > Massachusetts > Middlesex County (0.14)
Asia > Middle East > Israel (0.14)

Industry: Telecommunications (0.54)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)

Direct Classification with Indirect Data

Brown, Timothy X.

Suppose there exists an unknown real-valued property of the feature space, p(¢), that maps from the feature space, ¢ ERn, to R. The property function and a positive set A c

artificial intelligence, classifier, télécommunications, (17 more...)

Country: North America > United States > Colorado (0.14)

Industry: Telecommunications (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.33)

Direct Classification with Indirect Data

Brown, Timothy X.

Suppose there exists an unknown real-valued property of the feature space, p(¢), that maps from the feature space, ¢ ERn, to R. The property function and a positive set A c

artificial intelligence, classifier, télécommunications, (17 more...)

Country: North America > United States > Colorado (0.14)

Industry: Telecommunications (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.33)

Analysis of Bit Error Probability of Direct-Sequence CDMA Multiuser Demodulators

Tanaka, Toshiyuki

We analyze the bit error probability of multiuser demodulators for directsequence binaryphase-shift-keying (DSIBPSK) CDMA channel with additive gaussian noise. The problem of multiuser demodulation is cast into the finite-temperature decoding problem, and replica analysis is applied toevaluate the performance of the resulting MPM (Marginal Posterior Mode)demodulators, which include the optimal demodulator and the MAP demodulator as special cases. An approximate implementation ofdemodulators is proposed using analog-valued Hopfield model as a naive mean-field approximation to the MPM demodulators, and its performance is also evaluated by the replica analysis. Results of the performance evaluationshows effectiveness of the optimal demodulator and the mean-field demodulator compared with the conventional one, especially inthe cases of small information bit rate and low noise level. 1 Introduction The CDMA (Code-Division-Multiple-Access) technique [1] is important as a fundamental technology of digital communications systems, such as cellular phones. The important applications includerealization of spread-spectrum multipoint-to-point communications systems, in which multiple users share the same communication channel.

artificial intelligence, demodulator, télécommunications, (16 more...)

Country:

Asia > Japan (0.15)
North America > United States > Massachusetts (0.14)

Industry: Telecommunications (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.47)
Information Technology > Communications > Networks (0.35)

Decomposition of Reinforcement Learning for Admission Control of Self-Similar Call Arrival Processes

Carlström, Jakob

In multi-service communications networks, such as Asynchronous Transfer Mode (ATM) networks, resource control is of crucial importance for the network operator as well as for the users. The objective is to maintain the service quality while maximizing the operator's revenue. At the call level, service quality (Grade of Service) is measured in terms of call blocking probabilities, and the key resource to be controlled is bandwidth. Network routing and call admission control (CAC) are two such resource control problems. Markov decision processes offer a framework for optimal CAC and routing [1]. By modelling thedynamics of the network with traffic and computing control policies using dynamic programming [2], resource control is optimized. A standard assumption in such models is that calls arrive according to Poisson processes. This makes the models of the dynamics relatively simple. Although the Poisson assumption is valid for most user-initiated requests in communications networks, a number of studies [3, 4, 5] indicate that many types of arrival processesin wide-area networks as well as in local area networks are statistically selfsimilar.

arrival process, artificial intelligence, télécommunications, (16 more...)

Country:

Europe > Sweden (0.15)
North America > United States > Massachusetts > Middlesex County (0.14)
Asia > Middle East > Israel (0.14)

Industry: Telecommunications (0.54)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Journal of Artificial Intelligence ResearchNov-1-2001

Experiments with Infinite-Horizon, Policy-Gradient Estimation

Baxter, J., Bartlett, P. L., Weaver, L.

In this paper, we present algorithms that perform gradient ascent of the average reward in a partially observable Markov decision process (POMDP). These algorithms are based on GPOMDP, an algorithm introduced in a companion paper (Baxter & Bartlett, this volume), which computes biased estimates of the performance gradient in POMDPs. The algorithm's chief advantages are that it uses only one free parameter beta, which has a natural interpretation in terms of bias-variance trade-off, it requires no knowledge of the underlying state, and it can be applied to infinite state, control and observation spaces. We show how the gradient estimates produced by GPOMDP can be used to perform gradient ascent, both with a traditional stochastic-gradient algorithm, and with an algorithm based on conjugate-gradients that utilizes gradient information to bracket maxima in line searches. Experimental results are presented illustrating both the theoretical results of (Baxter & Bartlett, this volume) on a toy problem, and practical aspects of the algorithms on a number of more realistic problems.

artificial intelligence, controller, machine learning, (16 more...)

doi: 10.1613/jair.807

AI Access Foundation

10290

Country: North America > United States (0.46)

Industry:

Telecommunications (0.46)
Leisure & Entertainment > Games (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Mozer, Michael C., Wolniewicz, Richard H., Grimes, David B., Johnson, Eric, Kaushansky, Howard

Churn Reduction in the Wireless Industry

Neural Information Processing SystemsDec-31-2000

Competition in the wireless telecommunications industry is rampant. To maintain profitability, wireless carriers must control chum, the loss of subscribers who switch from one carrier to another. We explore statistical techniques for chum prediction and, based on these predictions.

neural network, subscriber, télécommunications, (18 more...)

Country: North America > United States > Colorado > Boulder County > Boulder (0.14)

Industry: Telecommunications (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Mozer, Michael C., Wolniewicz, Richard H., Grimes, David B., Johnson, Eric, Kaushansky, Howard

Churn Reduction in the Wireless Industry

Neural Information Processing SystemsDec-31-2000

Competition in the wireless telecommunications industry is rampant. To maintain profitability,wireless carriers must control chum, the loss of subscribers who switch from one carrier to another. We explore statistical techniques for chum prediction and, based on these predictions.

neural network, subscriber, télécommunications, (18 more...)

Country: North America > United States > Colorado > Boulder County > Boulder (0.14)

Industry: Telecommunications (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Neural Information Processing SystemsDec-31-2000

Low Power Wireless Communication via Reinforcement Learning

Brown, Timothy X.

This paper examines the application of reinforcement learning to a wireless communicationproblem. The problem requires that channel utility be maximized while simultaneously minimizing battery usage. We present a solution to this multi-criteria problem that is able to significantly reducepower consumption. The solution uses a variable discount factor to capture the effects of battery usage. 1 Introduction Reinforcement learning (RL) has been applied to resource allocation problems in telecommunications, e.g.,channel allocation in wireless systems, network routing, and admission control in telecommunication networks [1,2, 8, 10]. These have demonstrated reinforcement learningcan find good policies that significantly increase the application reward within the dynamics of the telecommunication problems.

artificial intelligence, packet, télécommunications, (14 more...)

Country: North America > United States > Colorado > Boulder County > Boulder (0.14)

Genre: Research Report (0.34)

Industry: Telecommunications (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)