Goto

Collaborating Authors

 South America


Global Artificial Intelligence (AI) Market in Food and Beverage (F&B) Industry 2017-2021 Evolving Opportunities with Aboard Software and Ailytic Technavio

#artificialintelligence

The global artificial intelligence (AI) market in food and beverage (F&B) industry market is poised to grow by USD 275.34 million during 2017-2021, progressing at a CAGR of more than 42% during the forecast period. This press release features multimedia. The introduction of regulations to improve food safety and emergence of IIoT is anticipated to boost the growth of the market. Manufacturers in the F&B industry are increasingly adopting automation to meet regulations and guidelines set by industry associations for the maintenance of quality products. F&B manufacturers are required to have a safety system in place for analysis of hazards and risk-based preventive controls.


Uncovering differential equations from data with hidden variables

arXiv.org Machine Learning

Examples include meteorology, biology, and physics. The usual way to model deterministic dynamical systems is by using (partial) differential equations. Typically, differential equations models for a given dynamical system are derived using apriori insights into the problem at hand; then the model is validated using empirical observations. In an era in which massive data-sets pertaining to different fields of science are widely available, an interesting problem is whether it is possible for a useful differential equations model to be learned directly from data, without any major modeling effort required by the researcher. Our goal in this paper is to develop a general methodology for building such differential equations models in contexts in which not all relevant variables are observed, that is, in cases in which the main variable of interest depends on other variables of which no measurements are available. As a concrete example, consider the following problem. RTE, the electricity transmission system operator of France, uses high-level simulations of hourly temperature series to study the impact different climate scenarios have on electricity consumption, and hence on the French electrical power grid.


High Temporal Resolution Rainfall Runoff Modelling Using Long-Short-Term-Memory (LSTM) Networks

arXiv.org Machine Learning

Accurate and efficient models for rainfall runoff (RR) simulations are crucial for flood risk management. Most rainfall models in use today are process-driven; i.e. they solve either simplified empirical formulas or some variation of the St. Venant (shallow water) equations. With the development of machine-learning techniques, we may now be able to emulate rainfall models using, for example, neural networks. In this study, a data-driven RR model using a sequence-to-sequence Long-short-Term-Memory (LSTM) network was constructed. The model was tested for a watershed in Houston, TX, known for severe flood events. The LSTM network's capability in learning long-term dependencies between the input and output of the network allowed modeling RR with high resolution in time (15 minutes). Using 10-years precipitation from 153 rainfall gages and river channel discharge data (more than 5.3 million data points), and by designing several numerical tests the developed model performance in predicting river discharge was tested. The model results were also compared with the output of a process-driven model Gridded Surface Subsurface Hydrologic Analysis (GSSHA). Moreover, physical consistency of the LSTM model was explored. The model results showed that the LSTM model was able to efficiently predict discharge and achieve good model performance. When compared to GSSHA, the data-driven model was more efficient and robust in terms of prediction and calibration. Interestingly, the performance of the LSTM model improved (test Nash-Sutcliffe model efficiency from 0.666 to 0.942) when a selected subset of rainfall gages based on the model performance, were used as input instead of all rainfall gages.


Intelligent Arxiv: Sort daily papers by learning users topics preference

arXiv.org Machine Learning

Current daily paper releases are becoming increasingly large and areas of research are growing in diversity. This makes it harder for scientists to keep up to date with current state of the art and identify relevant work within their lines of interest. The goal of this article is to address this problem using Machine Learning techniques. We model a scientific paper to be built as a combination of different scientific knowledge from diverse topics into a new problem. In light of this, we implement the unsupervised Machine Learning technique of Latent Dirichlet Allocation (LDA) on the corpus of papers in a given field to: i) define and extract underlying topics in the corpus; ii) get the topics weight vector for each paper in the corpus; and iii) get the topics weight vector for new papers. By registering papers preferred by a user, we build a user vector of weights using the information of the vectors of the selected papers. Hence, by performing an inner product between the user vector and each paper in the daily Arxiv release, we can sort the papers according to the user preference on the underlying topics. We have created the website IArxiv.org where users can read sorted daily Arxiv releases (and more) while the algorithm learns each users preference, yielding a more accurate sorting every day. Current IArxiv.org version runs on Arxiv categories astro-ph, gr-qc, hep-ph and hep-th and we plan to extend to others. We propose several new useful and relevant implementations to be additionally developed as well as new Machine Learning techniques beyond LDA to further improve the accuracy of this new tool.


Application of independent component analysis and TOPSIS to deal with dependent criteria in multicriteria decision problems

arXiv.org Artificial Intelligence

A vast number of multicriteria decision making methods have been developed to deal with the problem of ranking a set of alternatives evaluated in a multicriteria fashion. Very often, these methods assume that the evaluation among criteria is statistically independent. However, in actual problems, the observed data may comprise dependent criteria, which, among other problems, may result in biased rankings. In order to deal with this issue, we propose a novel approach whose aim is to estimate, from the observed data, a set of independent latent criteria, which can be seen as an alternative representation of the original decision matrix. A central element of our approach is to formulate the decision problem as a blind source separation problem, which allows us to apply independent component analysis techniques to estimate the latent criteria. Moreover, we consider TOPSIS-based approaches to obtain the ranking of alternatives from the latent criteria. Results in both synthetic and actual data attest the relevance of the proposed approach.


Uncertainty Weighted Causal Graphs

arXiv.org Artificial Intelligence

Causality has traditionally been a scientific way to generate knowledge by relating causes to effects. From an imaginery point of view, causal graphs are a helpful tool for representing and infering new causal information. In previous works, we have generated automatically causal graphs associated to a given concept by analyzing sets of documents and extracting and representing the found causal information in that visual way. The retrieved information shows that causality is frequently imperfect rather than exact, feature gathered by the graph. In this work we will attempt to go a step further modelling the uncertainty in the graph through probabilistic improving the management of the imprecision in the quoted graph.


Natural Language Processing (NLP) Market to Reach USD 80.68 billion by 2026; Increasing Demand for Enhanced Algorithms to Boost Growth, says Fortune Business Insights

#artificialintelligence

Key Companies Covered in NLP Market Research Report are 3M Company, Adobe Systems Inc., Amazon Web Services Inc., Apple Inc., Google (Alphabet Inc.), Hewlett-Packard Enterprise Company, Intel Corporation, Microsoft Corporation, SAS Institute Inc., Other key market players The global Natural Language Processing (NLP) Market size is projected to reach USD 80.68 billion by 2026, thereby exhibiting a CAGR of 32.4% during the forecast period. This information is published by Fortune Business Insights, in a report, titled, "Natural Language Processing (NLP) Market Size, Share & Industry Analysis, By Deployment (On-Premises, Cloud, and Hybrid), By Technology (Interactive Voice Response (IVR), Optical Character Recognition (OCR), Text Analytics, Speech Analytics, Classification and Categorization, Pattern and Image Recognition, and Others), By Industry Vertical (Healthcare, Retail, High Tech and Telecom, BFSI, Automotive & Transportation, Advertising & Media, Manufacturing, and Others) and Regional Forecast, 2019-2026." The report further states that the market was USD 8.61 billion in 2018. It is set to gain momentum from the rising demand for big data, improved algorithms, and powerful computing. What Does the Report Contain?


Artificial Intelligence (AI) in battling the coronavirus - ELE Times

#artificialintelligence

Artificial Intelligence technology can today automatically mine through news reports and online content from around the world, helping experts recognize anomalies that could lead to a potential epidemic or, worse, a pandemic. In other words, our new AI overlords might actually help us survive the next plague. These new AI capabilities are on full display with the recent coronavirus outbreak, which was identified early by a Canadian firm called BlueDot, which is one of a number of companies that use data to evaluate public health risks. The company, which says it conducts "automated infectious disease surveillance," notified its customers about the new form of coronavirus at the end of December, days before both the US Centers for Disease Control and Prevention (CDC) and the World Health Organization (WHO) sent out official notices, as reported by Wired. Now nearing the end of January, the respiratory virus that's been linked to the city of Wuhan in China has already claimed the lives of more than 100 people.


A Speaker Verification Backend for Improved Calibration Performance across Varying Conditions

arXiv.org Machine Learning

In a recent work, we presented a discriminative backend for speaker verification that achieved good out-of-the-box calibration performance on most tested conditions containing varying levels of mismatch to the training conditions. This backend mimics the standard PLDA-based backend process used in most current speaker verification systems, including the calibration stage. All parameters of the backend are jointly trained to optimize the binary cross-entropy for the speaker verification task. Calibration robustness is achieved by making the parameters of the calibration stage a function of vectors representing the conditions of the signal, which are extracted using a model trained to predict condition labels. In this work, we propose a simplified version of this backend where the vectors used to compute the calibration parameters are estimated within the backend, without the need for a condition prediction model. We show that this simplified method provides similar performance to the previously proposed method while being simpler to implement, and having less requirements on the training data. Further, we provide an analysis of different aspects of the method including the effect of initialization, the nature of the vectors used to compute the calibration parameters, and the effect that the random seed and the number of training epochs has on performance. We also compare the proposed method with the trial-based calibration (TBC) method that, to our knowledge, was the state-of-the-art for achieving good calibration across varying conditions. We show that the proposed method outperforms TBC while also being several orders of magnitude faster to run, comparable to the standard PLDA baseline.


Privacy-Preserving Boosting in the Local Setting

arXiv.org Machine Learning

In machine learning, boosting is one of the most popular methods that designed to combine multiple base learners to a superior one. The well-known Boosted Decision Tree classifier, has been widely adopted in many areas. In the big data era, the data held by individual and entities, like personal images, browsing history and census information, are more likely to contain sensitive information. The privacy concern raises when such data leaves the hand of the owners and be further explored or mined. Such privacy issue demands that the machine learning algorithm should be privacy aware. Recently, Local Differential Privacy is proposed as an effective privacy protection approach, which offers a strong guarantee to the data owners, as the data is perturbed before any further usage, and the true values never leave the hands of the owners. Thus the machine learning algorithm with the private data instances is of great value and importance. In this paper, we are interested in developing the privacy-preserving boosting algorithm that a data user is allowed to build a classifier without knowing or deriving the exact value of each data samples. Our experiments demonstrate the effectiveness of the proposed boosting algorithm and the high utility of the learned classifiers.