Country
Automatic Gesture Recognition in Robot-assisted Surgery with Reinforcement Learning and Tree Search
Gao, Xiaojie, Jin, Yueming, Dou, Qi, Heng, Pheng-Ann
Automatic surgical gesture recognition is fundamental for improving intelligence in robot-assisted surgery, such as conducting complicated tasks of surgery surveillance and skill evaluation. However, current methods treat each frame individually and produce the outcomes without effective consideration on future information. In this paper, we propose a framework based on reinforcement learning and tree search for joint surgical gesture segmentation and classification. An agent is trained to segment and classify the surgical video in a human-like manner whose direct decisions are re-considered by tree search appropriately. Our proposed tree search algorithm unites the outputs from two designed neural networks, i.e., policy and value network. With the integration of complementary information from distinct models, our framework is able to achieve the better performance than baseline methods using either of the neural networks. For an overall evaluation, our developed approach consistently outperforms the existing methods on the suturing task of JIGSAWS dataset in terms of accuracy, edit score and F1 score. Our study highlights the utilization of tree search to refine actions in reinforcement learning framework for surgical robotic applications.
Detecting Code Clones with Graph Neural Networkand Flow-Augmented Abstract Syntax Tree
Wang, Wenhan, Li, Ge, Ma, Bo, Xia, Xin, Jin, Zhi
Code clones are semantically similar code fragments pairs that are syntactically similar or different. Detection of code clones can help to reduce the cost of software maintenance and prevent bugs. Numerous approaches of detecting code clones have been proposed previously, but most of them focus on detecting syntactic clones and do not work well on semantic clones with different syntactic features. To detect semantic clones, researchers have tried to adopt deep learning for code clone detection to automatically learn latent semantic features from data. Especially, to leverage grammar information, several approaches used abstract syntax trees (AST) as input and achieved significant progress on code clone benchmarks in various programming languages. However, these AST-based approaches still can not fully leverage the structural information of code fragments, especially semantic information such as control flow and data flow. To leverage control and data flow information, in this paper, we build a graph representation of programs called flow-augmented abstract syntax tree (FA-AST). We construct FA-AST by augmenting original ASTs with explicit control and data flow edges. Then we apply two different types of graph neural networks (GNN) on FA-AST to measure the similarity of code pairs. As far as we have concerned, we are the first to apply graph neural networks on the domain of code clone detection. We apply our FA-AST and graph neural networks on two Java datasets: Google Code Jam and BigCloneBench. Our approach outperforms the state-of-the-art approaches on both Google Code Jam and BigCloneBench tasks.
Optimizing Black-box Metrics with Adaptive Surrogates
Jiang, Qijia, Adigun, Olaoluwa, Narasimhan, Harikrishna, Fard, Mahdi Milani, Gupta, Maya
We address the problem of training models with black-box and hard-to-optimize metrics by expressing the metric as a monotonic function of a small number of easy-to-optimize surrogates. We pose the training problem as an optimization over a relaxed surrogate space, which we solve by estimating local gradients for the metric and performing inexact convex projections. We analyze gradient estimates based on finite differences and local linear interpolations, and show convergence of our approach under smoothness assumptions with respect to the surrogates. Experimental results on classification and ranking problems verify the proposal performs on par with methods that know the mathematical formulation, and adds notable value when the form of the metric is unknown.
MEUZZ: Smart Seed Scheduling for Hybrid Fuzzing
Chen, Yaohui, Ahmadi, Mansour, farkhani, Reza Mirzazade, Wang, Boyu, Lu, Long
Seed scheduling is a prominent factor in determining the yields of hybrid fuzzing. Existing hybrid fuzzers schedule seeds based on fixed heuristics that aim to predict input utilities. However, such heuristics are not generalizable as there exists no one-size-fits-all rule applicable to different programs. They may work well on the programs from which they were derived, but not others. To overcome this problem, we design a Machine learning-Enhanced hybrid fUZZing system (MEUZZ), which employs supervised machine learning for adaptive and generalizable seed scheduling. MEUZZ determines which new seeds are expected to produce better fuzzing yields based on the knowledge learned from past seed scheduling decisions made on the same or similar programs. MEUZZ's learning is based on a series of features extracted via code reachability and dynamic analysis, which incurs negligible runtime overhead (in microseconds). Moreover, MEUZZ automatically infers the data labels by evaluating the fuzzing performance of each selected seed. As a result, MEUZZ is generally applicable to, and performs well on, various kinds of programs. Our evaluation shows MEUZZ significantly outperforms the state-of-the-art grey-box and hybrid fuzzers, achieving 27.1% more code coverage than QSYM. The learned models are reusable and transferable, which boosts fuzzing performance by 7.1% on average and improves 68% of the 56 cross-program fuzzing campaigns. MEUZZ discovered 47 deeply hidden and previously unknown bugs--with 21 confirmed and fixed by the developers--when fuzzing 8 well-tested programs with the same configurations as used in previous work.
A One-to-One Correspondence between Natural Numbers and Binary Trees
Skliar, Osvaldo, Gapper, Sherry, Monge, Ricardo E.
A characterization is provided for each natural number except one (1) by means of an ordered pair of elements. The first element is a natural number called the type of the natural number characterized, and the second is a natural number called the order of the number characterized within those of its type. A one-to-one correspondence is specified between the set of binary trees such that a) a given node has no child nodes (that is, it is a terminal node), or b) it has exactly two child nodes. Thus, binary trees such that one of their parent nodes has only one child node are excluded from the set considered here.
Adversarial Filters of Dataset Biases
Bras, Ronan Le, Swayamdipta, Swabha, Bhagavatula, Chandra, Zellers, Rowan, Peters, Matthew E., Sabharwal, Ashish, Choi, Yejin
Large neural models have demonstrated human-level performance on language and vision benchmarks such as ImageNet and Stanford Natural Language Inference (SNLI). Yet, their performance degrades considerably when tested on adversarial or out-of-distribution samples. This raises the question of whether these models have learned to solve a dataset rather than the underlying task by overfitting on spurious dataset biases. We investigate one recently proposed approach, AFLite, which adversarially filters such dataset biases, as a means to mitigate the prevalent overestimation of machine performance. We provide a theoretical understanding for AFLite, by situating it in the generalized framework for optimum bias reduction. Our experiments show that as a result of the substantial reduction of these biases, models trained on the filtered datasets yield better generalization to out-of-distribution tasks, especially when the benchmarks used for training are over-populated with biased samples. We show that AFLite is broadly applicable to a variety of both real and synthetic datasets for reduction of measurable dataset biases and provide extensive supporting analyses. Finally, filtering results in a large drop in model performance (e.g., from 92% to 63% for SNLI), while human performance still remains high. Our work thus shows that such filtered datasets can pose new research challenges for robust generalization by serving as upgraded benchmarks.
`Why not give this work to them?' Explaining AI-Moderated Task-Allocation Outcomes using Negotiation Trees
Zahedi, Zahra, Sengupta, Sailik, Kambhampati, Subbarao
The problem of multi-agent task allocation arises in a variety of scenarios involving human teams. In many such settings, human teammates may act with selfish motives and try to minimize their cost metrics. In the absence of (1) complete knowledge about the reward of other agents and (2) the team's overall cost associated with a particular allocation outcome, distributed algorithms can only arrive at sub-optimal solutions within a reasonable amount of time. To address these challenges, we introduce the notion of an AI Task Allocator (AITA) that, with complete knowledge, comes up with fair allocations that strike a balance between the individual human costs and the team's performance cost. To ensure that AITA is explicable to the humans, we allow each human agent to question AITA's proposed allocation with counterfactual allocations. In response, we design AITA to provide a replay negotiation tree that acts as an explanation showing why the counterfactual allocation, with the correct costs, will eventually result in a sub-optimal allocation. This explanation also updates a human's incomplete knowledge about their teammate's and the team's actual costs. We then investigate whether humans are (1) able to understand the explanations provided and (2) convinced by it using human factor studies. Finally, we show the effect of various kinds of incompleteness on the length of explanations. We conclude that underestimation of other's costs often leads to the need for explanations and in turn, longer explanations on average.
Precise neural network computation with imprecise analog devices
Binas, Jonathan, Neil, Daniel, Indiveri, Giacomo, Liu, Shih-Chii, Pfeiffer, Michael
The operations used for neural network computation map favorably onto simple analog circuits, which outshine their digital counterparts in terms of compactness and efficiency. Nevertheless, such implementations have been largely supplanted by digital designs, partly because of device mismatch effects due to material and fabrication imperfections. We propose a framework that exploits the power of deep learning to compensate for this mismatch by incorporating the measured device variations as constraints in the neural network training process. This eliminates the need for mismatch minimization strategies and allows circuit complexity and power-consumption to be reduced to a minimum. Our results, based on largescale simulations as well as a prototype VLSI chip implementation indicate a processing efficiency comparable to current state-of-art digital implementations. This method is suitable for future technology based on nanodevices with large variability, such as memristive arrays. The growing need for computing power has led to the exploration of computing technologies beyond the predominant von Neumann architecture. In particular, due to the separation of memory and processing elements, traditional computing systems experience a bottleneck when dealing with problems involving great amounts of high-dimensional data [4, 25], such as image processing, probabilistic inference, or speech recognition. These problems are often best tackled by conceptually simple but powerful and highly parallel models, such as deep neural networks (DNNs), which have delivered state-of-the-art performance on exactly those applications [29]. The fact that DNNs are characterized by stereotypical and simple operations at each unit, which often can be performed in parallel, makes them compatible with the processing style of graphics processing units (GPUs) [46]. The large computational demands of DNNs have simultaneously sparked interest in methods that make neural network inference faster and more power efficient, whether through new algorithmic inventions [19, 22, 12], dedicated digital hardware implementations [6, 17, 9, 34], or by taking inspiration from real nervous systems [15, 38, 33, 24, 40].
One week of unhealthy eating could 'damage a part of the brain which normally stops us eating MORE'
Eating a diet of junk food for just one week was enough to damage part of the brain that stops us eating more when we are already full, research suggests. Study participants who ate an abundance of fast food and high-fat milkshakes had increased cravings for more after seven days. They performed worse on cognitive tests, with results suggesting an area of the brain called the hippocampus was impaired. The hippocampus normally stops us from gorging on more food when we are full by suppressing memories of how tasty it is. When it's not working properly, the memories are more powerful and we are left unable to resist more cake, chocolates and crisps in front of us, the researchers believe.
Leiden University's computer algorithm spots ELEVEN asteroids that could hit Earth
A computer algorithm from Leiden University in the Netherlands has spotted eleven asteroids that could eventually hit Earth and cause'unprecedented devastation'. All were missed by NASA software thanks to their chaotic orbits, which are difficult for current techniques to predict and identify as being potentially dangerous. Each are more than 328 feet (100 metres) in diameter and will pass closer to our planet than ten times the distance between the Earth and the moon. For comparison, the Tunguska object which flattened 772 square miles of forest in Siberia had a diameter of around 164–262 feet (50–80 metres). However, these space rocks won't pose a threat in our lifetime, however -- for they will only get worryingly near to Earth between the years 2131 and 2923.