continuous part
Dissimilar Batch Decompositions of Random Datasets
Noname manuscript No. (will be inserted by the editor) Ghurumuruhan Ganesan IISER Bhopal Abstract For better learning, large datasets are often split into small batch es and fed sequentially to the predictive model. In this paper, we study suc h batch decompositions from a probabilistic perspective. We assume that data poin ts (possibly corrupted) are drawn independently from a given space and define a co ncept of similarity between two data points. We then consider decompositions that restrict the amount of similarity within each batch and obtain high probability bounds for the minimum size. We demonstrate an inherent tradeoff between relaxing the similarity constraint and the overall size and also use martingale methods to obtain bounds fo r the maximum size of data subsets with a given similarity.
Compressive Estimation of a Stochastic Process with Unknown Autocorrelation Function
Khalilsarai, Mahdi Barzegar, Haghighatshoar, Saeid, Caire, Giuseppe, Wunder, Gerhard
In this paper, we study the prediction of a circularly symmetric zero-mean stationary Gaussian process from a window of observations consisting of finitely many samples. This is a prevalent problem in a wide range of applications in communication theory and signal processing. Due to stationarity, when the autocorrelation function or equivalently the power spectral density (PSD) of the process is available, the Minimum Mean Squared Error (MMSE) predictor is readily obtained. In particular, it is given by a linear operator that depends on autocorrelation of the process as well as the noise power in the observed samples. The prediction becomes, however, quite challenging when the PSD of the process is unknown. In this paper, we propose a blind predictor that does not require the a priori knowledge of the PSD of the process and compare its performance with that of an MMSE predictor that has a full knowledge of the PSD. To design such a blind predictor, we use the random spectral representation of a stationary Gaussian process. We apply the well-known atomic-norm minimization technique to the observed samples to obtain a discrete quantization of the underlying random spectrum, which we use to predict the process. Our simulation results show that this estimator has a good performance comparable with that of the MMSE estimator.
Agent Based Approaches to Engineering Autonomous Space Software
Dennis, Louise A., Fisher, Michael, Lincoln, Nicholas, Lisitsa, Alexei, Veres, Sandor M.
Current approaches to the engineering of space software such as satellite control systems are based around the development of feedback controllers using packages such as MatLab's Simulink toolbox. These provide powerful tools for engineering real time systems that adapt to changes in the environment but are limited when the controller itself needs to be adapted. We are investigating ways in which ideas from temporal logics and agent programming can be integrated with the use of such control systems to provide a more powerful layer of autonomous decision making. This paper will discuss our initial approaches to the engineering of such systems.
On the Distribution of Penalized Maximum Likelihood Estimators: The LASSO, SCAD, and Thresholding
Potscher, Benedikt M., Leeb, Hannes
We study the distributions of the LASSO, SCAD, and thresholding estimators, in finite samples and in the large-sample limit. The asymptotic distributions are derived for both the case where the estimators are tuned to perform consistent model selection and for the case where the estimators are tuned to perform conservative model selection. Our findings complement those of Knight and Fu (2000) and Fan and Li (2001). We show that the distributions are typically highly nonnormal regardless of how the estimator is tuned, and that this property persists in large samples. The uniform convergence rate of these estimators is also obtained, and is shown to be slower than 1/root(n) in case the estimator is tuned to perform consistent model selection. An impossibility result regarding estimation of the estimators' distribution function is also provided.