AITopics

2102.10978

Country:

Asia > India (0.25)
North America > United States > New York (0.04)
Africa > Mozambique > Sofala Province > Beira (0.04)

Genre: Research Report (0.40)

Industry: Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Tran, Ngoc Mai, Buck, Johannes, Klüppelberg, Claudia

Causal Discovery of a River Network from its Extremes

arXiv.org Machine LearningFeb-11-2021

Causal inference for extremes has only be considered during the past few years. That observations of climate extremes such as floods, hurricanes, and droughts, but also man-made catastrophes like industry fire, terrorist attacks, or crashes of financial markets have been in the focus of research is convincingly documented in the journal Extremes. On the other hand, it is a fundamental problem to assess causality of risks. Often rare events are interconnected; for example, floods disseminate through a river network, and credit markets might fail due to some endogenous systemic risk propagation. Hence, it is necessary to not only understand dependencies between rare events, but also their causal structure.

colorado, node, qtree, (12 more...)

2102.06197

Country:

North America > United States > Colorado (0.12)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.40)

Industry:

Banking & Finance (0.74)
Law Enforcement & Public Safety > Terrorism (0.54)
Water & Waste Management > Water Management (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)
(2 more...)

Pavutnitskiy, Fedor, Ivanov, Sergei O., Abramov, Evgeny, Borovitskiy, Viacheslav, Klochkov, Artem, Vialov, Viktor, Zaikovskii, Anatolii, Petiushko, Aleksandr

Quadric hypersurface intersection for manifold learning in feature space

arXiv.org Machine LearningFeb-11-2021

The knowledge that data lies close to a particular submanifold of the ambient Euclidean space may be useful in a number of ways. For instance, one may want to automatically mark any point far away from the submanifold as an outlier, or to use its geodesic distance to measure similarity between points. Classical problems for manifold learning are often posed in a very high dimension, e.g. for spaces of images or spaces of representations of words. Today, with deep representation learning on the rise in areas such as computer vision and natural language processing, many problems of this kind may be transformed into problems of moderately high dimension, typically of the order of hundreds. Motivated by this, we propose a manifold learning technique suitable for moderately high dimension and large datasets. The manifold is learned from the training data in the form of an intersection of quadric hypersurfaces -- simple but expressive objects. At test time, this manifold can be used to introduce an outlier score for arbitrary new points and to improve a given similarity metric by incorporating learned geometric structure into it.

intersection, manifold, quadric hypersurface intersection, (14 more...)

2102.06186

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Asia > Russia (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.50)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Konstantinov, Nikola, Lampert, Christoph H.

Fairness-Aware Learning from Corrupted Data

arXiv.org Machine LearningFeb-11-2021

Addressing fairness concerns about machine learning models is a crucial step towards their long-term adoption in real-world automated systems. While many approaches have been developed for training fair models from data, little is known about the effects of data corruption on these methods. In this work we consider fairness-aware learning under arbitrary data manipulations. We show that an adversary can force any learner to return a biased classifier, with or without degrading accuracy, and that the strength of this bias increases for learning problems with underrepresented protected groups in the data. We also provide upper bounds that match these hardness results up to constant factors, by proving that two natural learning algorithms achieve order-optimal guarantees in terms of both accuracy and fairness under adversarial data manipulations.

adversary, fairness-aware learning, probability, (13 more...)

2102.06004

Country:

Europe > Austria (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.46)

Nassif, Ali Bou, Shahin, Ismail, Hamsa, Shibani, Nemmour, Nawel, Hirose, Keikichi

CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions

arXiv.org Artificial IntelligenceFeb-11-2021

This work aims at intensifying text-independent speaker identification performance in real application situations such as noisy and emotional talking conditions. This is achieved by incorporating two different modules: a Computational Auditory Scene Analysis CASA based pre-processing module for noise reduction and cascaded Gaussian Mixture Model Convolutional Neural Network GMM-CNN classifier for speaker identification followed by emotion recognition. This research proposes and evaluates a novel algorithm to improve the accuracy of speaker identification in emotional and highly-noise susceptible conditions. Experiments demonstrate that the proposed model yields promising results in comparison with other classifiers when Speech Under Simulated and Actual Stress SUSAS database, Emirati Speech Database ESD, the Ryerson Audio-Visual Database of Emotional Speech and Song RAVDESS database and the Fluent Speech Commands database are used in a noisy environment.

classifier, identification, speaker identification, (14 more...)

doi: 10.1016/j.asoc.2021.107141

2102.05894

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Middle East > UAE > Sharjah Emirate > Sharjah (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Finland > North Karelia > Joensuu (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Speech > Acoustic Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
(2 more...)

Barbano, Carlo Alberto, Perlo, Daniele, Tartaglione, Enzo, Fiandrotti, Attilio, Bertero, Luca, Cassoni, Paola, Grangetto, Marco

UniToPatho, a labeled histopathological dataset for colorectal polyps classification and adenoma dysplasia grading

arXiv.org Artificial IntelligenceFeb-10-2021

Histopathological characterization of colorectal polyps allows to tailor patients' management and follow up with the ultimate aim of avoiding or promptly detecting an invasive carcinoma. Colorectal polyps characterization relies on the histological analysis of tissue samples to determine the polyps malignancy and dysplasia grade. Deep neural networks achieve outstanding accuracy in medical patterns recognition, however they require large sets of annotated training images. We introduce UniToPatho, an annotated dataset of 9536 hematoxylin and eosin (H&E) stained patches extracted from 292 whole-slide images, meant for training deep neural networks for colorectal polyps classification and adenomas grading. We present our dataset and provide insights on how to tackle the problem of automatic colorectal polyps characterization.

artificial intelligence, classifier, machine learning, (17 more...)

doi: 10.1109/ICIP42928.2021.9506198

2101.09991

Country:

Europe > Italy > Piedmont > Turin Province > Turin (0.14)
Europe > United Kingdom > England (0.04)
Europe > Ireland (0.04)
Asia (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)

arXiv.org Machine LearningFeb-10-2021

Causal Inference for Time series Analysis: Problems, Methods and Evaluation

Moraffah, Raha, Sheth, Paras, Karami, Mansooreh, Bhattacharya, Anchit, Wang, Qianru, Tahir, Anique, Raglin, Adrienne, Liu, Huan

Time series data is a collection of chronological observations which is generated by several domains such as medical and financial fields. Over the years, different tasks such as classification, forecasting, and clustering have been proposed to analyze this type of data. Time series data has been also used to study the effect of interventions over time. Moreover, in many fields of science, learning the causal structure of dynamic systems and time series data is considered an interesting task which plays an important role in scientific discoveries. Estimating the effect of an intervention and identifying the causal relations from the data can be performed via causal inference. Existing surveys on time series discuss traditional tasks such as classification and forecasting or explain the details of the approaches proposed to solve a specific task. In this paper, we focus on two causal inference tasks, i.e., treatment effect estimation and causal discovery for time series data, and provide a comprehensive review of the approaches in each task. Furthermore, we curate a list of commonly used evaluation metrics and datasets for each task and provide in-depth insight. These metrics and datasets can serve as benchmarks for research in the field.

dataset, time series data, time sery, (11 more...)

2102.05829

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Wisconsin (0.04)
(15 more...)

Genre:

Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Government (1.00)
Transportation (0.67)
Banking & Finance > Economy (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(4 more...)

Singh, Bharat, Najibi, Mahyar, Sharma, Abhishek, Davis, Larry S.

Scale Normalized Image Pyramids with AutoFocus for Object Detection

arXiv.org Artificial IntelligenceFeb-10-2021

We present an efficient foveal framework to perform object detection. A scale normalized image pyramid (SNIP) is generated that, like human vision, only attends to objects within a fixed size range at different scales. Such a restriction of objects' size during training affords better learning of object-sensitive filters, and therefore, results in better accuracy. However, the use of an image pyramid increases the computational cost. Hence, we propose an efficient spatial sub-sampling scheme which only operates on fixed-size sub-regions likely to contain objects (as object locations are known during training). The resulting approach, referred to as Scale Normalized Image Pyramid with Efficient Resampling or SNIPER, yields up to 3 times speed-up during training. Unfortunately, as object locations are unknown during inference, the entire image pyramid still needs processing. To this end, we adopt a coarse-to-fine approach, and predict the locations and extent of object-like regions which will be processed in successive scales of the image pyramid. Intuitively, it's akin to our active human-vision that first skims over the field-of-view to spot interesting regions for further processing and only recognizes objects at the right resolution. The resulting algorithm is referred to as AutoFocus and results in a 2.5-5 times speed-up during inference when used with SNIP.

detection, detector, resolution, (13 more...)

2102.05646

Country:

North America > United States > Maryland > Prince George's County > College Park (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Hardt, Moritz, Recht, Benjamin

Patterns, predictions, and actions: A story about machine learning

arXiv.org Machine LearningFeb-9-2021

This graduate textbook on machine learning tells a story of how patterns in data support predictions and consequential actions. Starting with the foundations of decision making, we cover representation, optimization, and generalization as the constituents of supervised learning. A chapter on datasets as benchmarks examines their histories and scientific bases. Self-contained introductions to causality, the practice of causal inference, sequential decision making, and reinforcement learning equip the reader with concepts and tools to reason about actions and their consequences. Throughout, the text discusses historical context and societal impact. We invite readers from all backgrounds; some experience with probability, calculus, and linear algebra suffices.

law enforcement, simultaneous perturbation gradient approximation, vascular disease, (35 more...)

2102.05242

Country:

North America > United States > California (0.67)
Europe > United Kingdom > England (0.28)
North America > United States > Texas (0.13)
North America > United States > Pennsylvania (0.13)

Genre:

Workflow (1.00)
Summary/Review (1.00)
Research Report > New Finding (1.00)
(4 more...)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(10 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(10 more...)

Liua, Xiaofan, Zhanga, Zuoquan, Wanga, Di

Classification of Imbalanced Credit scoring data sets Based on Ensemble Method with the Weighted-Hybrid-Sampling

arXiv.org Machine LearningFeb-9-2021

In the era of big data, the utilization of credit-scoring models to determine the credit risk of applicants accurately becomes a trend in the future. The conventional machine learning on credit scoring data sets tends to have poor classification for the minority class, which may bring huge commercial harm to banks. In order to classify imbalanced data sets, we propose a new ensemble algorithm, namely, Weighted-Hybrid-Sampling-Boost (WHSBoost). In data sampling, we process the imbalanced data sets with weights by the Weighted-SMOTE method and the Weighted-Under-Sampling method, and thus obtain a balanced training sample data set with equal weight. In ensemble algorithm, each time we train the base classifier, the balanced data set is given by the method above. In order to verify the applicability and robustness of the WHSBoost algorithm, we performed experiments on the simulation data sets, real benchmark data sets and real credit scoring data sets, comparing WHSBoost with SMOTE, SMOTEBoost and HSBoost based on SVM, BPNN, DT and KNN.

algorithm, base classifier, classifier, (16 more...)

2102.04721

Country:

Asia > China > Beijing > Beijing (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
North America > United States > Wisconsin (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Banking & Finance > Credit (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)