Goto

Collaborating Authors

 South America


Pull out all the stops: Textual analysis via punctuation sequences

arXiv.org Artificial Intelligence

Whether enjoying the lucid prose of a favorite author or slogging through some other writer's cumbersome, heavy-set prattle (full of parentheses, em dashes, compound adjectives, and Oxford commas), readers will notice stylistic signatures not only in word choice and grammar, but also in punctuation itself. Indeed, visual sequences of punctuation from different authors produce marvelously different (and visually striking) sequences. Punctuation is a largely overlooked stylistic feature in "stylometry", the quantitative analysis of written text. In this paper, we examine punctuation sequences in a corpus of literary documents and ask the following questions: Are the properties of such sequences a distinctive feature of different authors? Is it possible to distinguish literary genres based on their punctuation sequences? Do the punctuation styles of authors evolve over time? Are we on to something interesting in trying to do stylometry without words, or are we full of sound and fury (signifying nothing)?


Distributed, partially collapsed MCMC for Bayesian Nonparametrics

arXiv.org Machine Learning

Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly used models like the Dirichlet process and the beta-Bernoulli process can be expressed as, are decomposable into independent sub-measures. We use this decomposition to partition the latent measure into a finite measure containing only in-stantiated components, and an infinite measure containing all other components. We then select different inference algorithms for the two components: uncollapsed samplers mix well on the finite measure, while collapsed samplers mix well on the infinite, sparsely occupied tail. The resulting hybrid algorithm can be applied to a wide class of models, and can be easily distributed to allow scalable inference without sacrificing asymptotic convergence guarantees.


Hydra: Preserving Ensemble Diversity for Model Distillation

arXiv.org Machine Learning

Ensembles of models have been empirically shown to improve predictive performance and to yield robust measures of uncertainty. However, they are expensive in computation and memory. Therefore, recent research has focused on distilling ensembles into a single compact model, reducing the computational and memory burden of the ensemble while trying to preserve its predictive behavior. Most existing distillation formulations summarize the ensemble by capturing its average predictions. As a result, the diversity of the ensemble predictions, stemming from each individual member, is lost. Thus, the distilled model cannot provide a measure of uncertainty comparable to that of the original ensemble. To retain more faithfully the diversity of the ensemble, we propose a distillation method based on a single multi-headed neural network, which we refer to as Hydra. The shared body network learns a joint feature representation that enables each head to capture the predictive behavior of each ensemble member. We demonstrate that with a slight increase in parameter count, Hydra improves distillation performance on classification and regression settings while capturing the uncertainty behaviour of the original ensemble over both in-domain and out-of-distribution tasks.


Monte Carlo Game Solver

arXiv.org Artificial Intelligence

W e present a general algorithm to order moves so as to speedup exact game solvers. It uses online learning of playout policies and Monte Carlo Tree Search. The learned policy and the information in the Monte Carlo tree are used to order moves in game solvers. They improve greatly the solving time for multiple games.


Domain Adaption for Knowledge Tracing

arXiv.org Artificial Intelligence

With the rapid development of online education system, knowledge tracing which aims at predicting students' knowledge state is becoming a critical and fundamental task in personalized education. Traditionally, existing methods are domain-specified. However, there are a larger number of domains (e.g., subjects, schools) in the real world and the lacking of data in some domains, how to utilize the knowledge and information in other domains to help train a knowledge tracing model for target domains is increasingly important. We refer to this problem as domain adaptation for knowledge tracing (DAKT) which contains two aspects: (1) how to achieve great knowledge tracing performance in each domain. (2) how to transfer good performed knowledge tracing model between domains. To this end, in this paper, we propose a novel adaptable framework, namely adaptable knowledge tracing (AKT) to address the DAKT problem. Specifically, for the first aspect, we incorporate the educational characteristics (e.g., slip, guess, question texts) based on the deep knowledge tracing (DKT) to obtain a good performed knowledge tracing model. For the second aspect, we propose and adopt three domain adaptation processes. First, we pre-train an auto-encoder to select useful source instances for target model training. Second, we minimize the domain-specific knowledge state distribution discrepancy under maximum mean discrepancy (MMD) measurement to achieve domain adaptation. Third, we adopt fine-tuning to deal with the problem that the output dimension of source and target domain are different to make the model suitable for target domains. Extensive experimental results on two private datasets and seven public datasets clearly prove the effectiveness of AKT for great knowledge tracing performance and its superior transferable ability.


Spectroscopy and Chemometrics News Weekly #2, 2020

#artificialintelligence

NIRSpectroscopy NIRS Sensors NearInfrared Analyzers DigitalTransformation QualityControl foodtech machinelearning AI datascience LINK SAFE COST IN MAINTAINING NIR-SPECTROSCOPY METHODS NIRSpectroscopy NIRS Spectroscopy DigitalTransformation Analysis Lab Laboratory Application Quantitative Analysis Methods Measurements Analytical Parameters Spectrometer Quality Accuracy LINK Do you develop NIR / NIRS calibrations by yourself? Check out their product page … link Get the Chemometrics and Spectroscopy News in real time on Twitter @ CalibModel and follow us. Near Infrared "Study of chemical compound spatial distribution in biodegradable active films using NIR hyperspectral imaging and multivariate curve resolution" LINK "Advances in Near Infrared Spectroscopy and Related Computational Methods" MDPI Books – Pages: 496 OpenAccess NIRSpectroscopy NIRS NIR LINK " Ampliación de una librería espectral de mezclas unifeed analizadas en un instrumento NIRS de laboratorio" LINK "Applied Sciences, Vol. 9, Pages 5058: Single-Kernel FT-NIR Spectroscopy for Detecting Maturity of Cucumber Seeds Using a Multiclass Hierarchical Classification Strategy" LINK " Visible-near Infrared (VIS-NIR) Spectroscopy as a Rapid Measurement Tool to Assess the Effect of Tillage on Oil Contaminated Sites" LINK "Non-invasive measurements of'Yunhe'pears by vis-NIRS technology coupled with deviation fusion modeling approach" LINK "Standard Analytical Methods, Sensory Evaluation, NIRS and Electronic Tongue for Sensing Taste Attributes of Different Melon Varieties." LINK "Control of ascorbic acid in fortified powdered soft drinks using near-infrared spectroscopy (NIRS) and multivariate analysis" LINK "Prediction Model of the Key Components for Lodging Resistance in Rapeseed Stalk Using Near-Infrared Reflectance Spectroscopy (NIRS)" LINK "NIR spectroscopic determination of urine components in spot urine: preliminary investigation towards optical point-of-care test." LINK "O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression."


Tel Aviv start-up gets FDA approval for 'stroke of genius' AI package

#artificialintelligence

Tel-Aviv based start-up Aidoc, a leading provider of Artificial Intelligence solutions for radiologists, received US Food and Drug Administration (FDA) clearance for its AI solution that spots strokes (Large-Vessel Occlusion) in the brain during head CTA scans.An LVO is the blockage of vessels in the brain, and according to Ariella Shoham, Aidoc's vice president of marketing, the AI technology "uses deep learning to automatically look at every head CT before a patient has even left the imaging room. "It investigates the images to see if they show blocked blood vessels in the brain or bleeding (intracranial hemorrhages)," she explained. "If one of these time-critical conditions is found, Aidoc re-prioritizes the worklists of radiologists so that the urgent scan is looked at immediately and the patient can be treated quickly."Shoham said that Aidoc already received FDA clearances to identify and flag pulmonary embolism (blockages in the lungs) and cervical spine fractures (broken neck). "Other Aidoc solutions currently in clinical testing include identifying air in the abdomen," she continued. "Altogether, Aidoc is targeting the most common critical life-threatening conditions that make up 80% of all urgent cases on CT scans.


Brazil is emerging as a world-class AI innovation hub

#artificialintelligence

Brazil's government has big plans for AI, despite having come late to the party. In Oxford Insights' AI Readiness Index 2019, Brazil was ranked 40 out of 192 countries, a sign that the South American powerhouse is moving up in the AI world. The report looks at how ready countries are to take advantage of the AI technologies PwC forecasts will add $15 trillion to the global economy by 2030. The 2019 report also cautions that the "Global South could be left behind by the so-called fourth industrial revolution." But even as some of the planet's richest nations, including Canada, China, Germany, Japan, Singapore, and the U.S, have become recognized AI innovation hubs, according to studies by Deloitte and others, South America -- led by Brazil -- is rapidly emerging as a leader in AI-enabled businesses.


Applying Gene Expression Programming for Solving One-Dimensional Bin-Packing Problems

arXiv.org Artificial Intelligence

This work aims to study and explore the use of Gene Expression Programming (GEP) in solving the on-line Bin-Packing problem. The main idea is to show how GEP can automatically find acceptable heuristic rules to solve the problem efficiently and economically. One dimensional Bin-Packing problem is considered in the course of this work with the constraint of minimizing the number of bins filled with the given pieces. Experimental Data includes instances of benchmark test data taken from Falkenauer (1996) for One-dimensional Bin-Packing Problems. Results show that GEP can be used as a very powerful and flexible tool for finding interesting compact rules suited for the problem. The impact of functions is also investigated to show how they can affect and influence the success of rates when they appear in rules. High success rates are gained with smaller population size and fewer generations compared to previous work performed using Genetic Programming.


Maximal Closed Set and Half-Space Separations in Finite Closure Systems

arXiv.org Artificial Intelligence

We investigate some algorithmic properties of closed set and half-space separation in abstract closure systems. Assuming that the underlying closure system is finite and given by the corresponding closure operator, we show that the half-space separation problem is NP-complete. In contrast, for the relaxed problem of maximal closed set separation we give a greedy algorithm using linear number of queries (i.e., closure operator calls) and show that this bound is sharp. For a second direction to overcome the negative result above, we consider Kakutani closure systems and prove that they are algorithmically characterized by the greedy algorithm. As one of the major potential application fields, we then focus on Kakutani closure systems over graphs and generalize a fundamental characterization result based on the Pasch axiom to graph structured partitioning of finite sets. In addition, we give a sufficient condition for Kakutani closure systems over graphs in terms of graph minors. For a second application field, we consider closure systems over finite lattices, present an adaptation of the generic greedy algorithm to this kind of closure systems, and consider two potential applications. We show that for the special case of subset lattices over finite ground sets, e.g., for formal concept lattices, its query complexity is only logarithmic in the size of the lattice. The second application is concerned with finite subsumption lattices in inductive logic programming. We show that our method for separating two sets of first-order clauses from each other extends the traditional approach based on least general generalizations of first-order clauses. Though our primary focus is on the generality of the results obtained, we experimentally demonstrate the practical usefulness of the greedy algorithm on binary classification problems in Kakutani and non-Kakutani closure systems.