Goto

Collaborating Authors

 Oceania


Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks

arXiv.org Machine Learning

The skip-connections used in residual networks have become a standard architecture choice in deep learning due to the increased training stability and generalization performance with this architecture, although there has been limited theoretical understanding for this improvement. In this work, we analyze overparameterized deep residual networks trained by gradient descent following random initialization, and demonstrate that (i) the class of networks learned by gradient descent constitutes a small subset of the entire neural network function class, and (ii) this subclass of networks is sufficiently large to guarantee small training error. By showing (i) we are able to demonstrate that deep residual networks trained with gradient descent have a small generalization gap between training and test error, and together with (ii) this guarantees that the test error will be small. Our optimization and generalization guarantees require overparameterization that is only logarithmic in the depth of the network, while all known generalization bounds for deep non-residual networks have overparameterization requirements that are at least polynomial in the depth. This provides an explanation for why residual networks are preferable to non-residual ones.


Semantic Preserving Generative Adversarial Models

arXiv.org Machine Learning

Shahar Harel, Meir Maor †, Amir Ronen ‡ SparkBeyond L TD Israel Abstract We introduce generative adversarial models in which the discriminator is replaced by a calibrated (non-differentiable) classifier repeatedly enhanced by domain relevant features. The role of the classifier is to prove that the actual and generated data differ over a controlled semantic space. We demonstrate that such models have the ability to generate objects with strong guarantees on their properties in a wide range of domains. They require less data than ordinary GANs, provide natural stopping conditions, uncover important properties of the data, and enhance transfer learning. Our techniques can be combined with standard generative models. We demonstrate the usefulness of our approach by applying it to several unrelated domains: generating good locations for cellular antennae, molecule generation preserving key chemical properties, and generating and extrapolating lines from very few data points. Intriguing open problems are presented as well. 1 Introduction Generative adversarial networks (GANs) (Goodfellow et al. 2014) achieved many impressive results. Recent literature surveys as well as a large code repository can be found at (Creswell et al. 2017; Kurach et al. 2018; Hindupur). Arguably however, most of these results were obtained for generation of images, text, and videos. First, humans have very good judgment of the quality of the generated objects and hence can fine-tune the generative model until it is satisfactory. Second, there exists a huge amount of available data that can be used for model training. This is unlikely to be the case in a wide range of important domains (e.g.


Irregular Convolutional Auto-Encoder on Point Clouds

arXiv.org Machine Learning

We proposed a novel graph convolutional neural network that could construct a coarse, sparse latent point cloud from a dense, raw point cloud. With a novel non-isotropic convolution operation defined on irregular geometries, the model then can reconstruct the original point cloud from this latent cloud with fine details. Furthermore, we proposed that it is even possible to perform particle simulation using the latent cloud encoded from some simulated particle cloud (e.g. fluids), to accelerate the particle simulation process. Our model has been tested on ShapeNetCore dataset for Auto-Encoding with a limited latent dimension and tested on a synthesis dataset for fluids simulation. We also compare the model with other state-of-the-art models, and several visualizations were done to intuitively understand the model.


DSTL: Solution to Limitation of Small Corpus in Speech Emotion Recognition

Journal of Artificial Intelligence Research

Traditional machine learning methods share a common hypothesis: training and testing datasets must be in a common feature space with the same distribution. However, in reality, the labeled target data may be rare, so that target space does not share the same feature space or distribution as an available training set (source domain). To address the mismatch of domains, we propose a Dual-Subspace Transfer Learning (DSTL) framework that considers both the common and specific information of the two domains. In DSTL, a latent common subspace is first learned to preserve the data properties and reduce the discrepancy of domains. Then, we propose a mapping strategy to transfer the sourcespecific information to the target subspace. The integration of the domain-common and specific information constructs the proposed DSTL framework. In comparison to the stateart-of works, the main contribution of our work is that the DSTL framework not only considers the commonalities, but also exploits the specific information. Experiments on three emotional speech corpora verify the effectiveness of our approach. The results show that the methods which include both domain-common and specific information perform better than the baseline methods which only exploit the domain commonalities.


Why investing in AI is one of the biggest commercial opportunities for businesses

#artificialintelligence

This makes investment in AI one of the biggest commercial opportunities in today's fast-changing economy. However, recent Deloitte data reveals that while 82% of the UK's large businesses are pursuing some form of AI initiatives, only 15% can be considered'seasoned' or mature AI adopters – lower than the US (24%), Germany (22%), Canada (19%), Australia (17%) and France (16%). In this competitive international market, creating a favourable business environment for AI and investing in AI skills programmes is key for ensuring the long-term success of the UK's national AI strategy. While the UK has a thriving AI scene, it's important to ensure it continues to grow and that investment in AI is proportionally distributed across the country, both to make the most of existing AI talent and to encourage new talent development. Although 80% of the UK's top 50 AI start-ups are based in London, some notable AI success stories came from outside of the capital.


Prediction method for epileptic seizures developed: System designed to use data from non-surgical devices powered by AI and machine learning

#artificialintelligence

Now researchers at the University of Sydney have used advanced artificial intelligence and machine learning to develop a generalised method to predict when seizures will strike that will not require surgical implants. Dr Omid Kavehei from the Faculty of Engineering and IT and the University of Sydney Nano Institute said: "We are on track to develop an affordable, portable and non-surgical device that will give reliable prediction of seizures for people living with treatment-resistant epilepsy." In a paper published this month in Neural Networks, Dr Kavehei and his team have proposed a generalised, patient-specific, seizure-prediction method that can alert epilepsy sufferers within 30 minutes of the likelihood of a seizure. Dr Kavehei said there had been remarkable advances in artificial intelligence as well as micro- and nano-electronics that have allowed the development of such systems. Now it is completely accessible.


GE Healthcare, Fujitsu to develop AI to help spot brain aneurysms

#artificialintelligence

GE Healthcare signed on to a new Australian research collaboration to develop artificial intelligence tools to quickly and automatically diagnose brain aneurysms. GE will be contributing its Revolution CT scanners to the "co-creation" effort. The project is being led by the Australian division of Fujitsu, the Tokyo-based IT services firm, which will focus on developing the AI and digital solutions. Additionally, Sydney's Macquarie University and Macquarie Medical Imaging will provide clinical expertise for the product's development and testing. The group hopes to offer a commercial solution to radiology practices in Australia before going worldwide.


Data Science Certification Artificial Intelligence - EduAstrum

#artificialintelligence

We are in the era of the Fourth Industrial Revolution. The economy and businesses are being reshaped. We need to be prepared for the new world that lies ahead. We provide the best and most comprehensive Training in Data Science, Machine learning, Python, IoT in Kerala. EduAstrum has partnered with ExcelR Solutions to bring you world-class technical and managerial trainings in Kerala.


Top 5 Epic Artificial Intelligence Fails

#artificialintelligence

Artificial Intelligence over the years has done wonders in various sectors. And with time this sought-after technology is just getting better and better, making human tasks easier than ever. However, there is a bitter fact and that is it can make mistakes -- after all, it's just technology. As the contribution of AI to humanity has been monumental, its failures have also been equally hilarious. In this article, we are going to take a look at five epic instances when AI has failed to the core.


ASIC plan for AI snoops on insurance calls strains hearing

#artificialintelligence

Australia's financial watchdog might be dreaming of the day when call centre surveillance software automatically catches crooked insurance sales staff. But there's still a way to go before AI-powered voice analytics can decipher the verbage that bubbles out of a sales boiler room. That's the reality check bowled up to regulators and industry rapidly spitballing prototypes of new regtech solutions as banks, insurers and auditors all trying to find ways to automatically detect bad behaviour without creating a profit sapping compliance cost sinkhole in the process. At a closely watched regtech forum late last month, ASIC outlined its findings from a trial of voice analytics software applied to a sample of life insurance sales calls. With a freshly sharpened set of teeth, the watchdog says it sees "great potential" in using voice analytics to automatically identify instances of potential misconduct in life insurance sales calls - but there's a catch.