Goto

Collaborating Authors

 time slower


1704fe7aaff33a54802b83a016050ab8-Supplemental-Conference.pdf

Neural Information Processing Systems

Neural Machine Translation: Fairseq has MITLicense. All experiments are implemented on Pytorch which has BSDLicense. Other assets that we use have no license. Image Classification: Here we provide some extra details of our experiments. From the results in Table 3, we can see that SGDHess achieves the best accuracy among all optimizers.




Efficient algorithms for implementing incremental proximal-point methods

arXiv.org Artificial Intelligence

Model training algorithms which observe a small portion of the training set in each computational step are ubiquitous in practical machine learning, and include both stochastic and online optimization methods. In the vast majority of cases, such algorithms typically observe the training samples via the gradients of the cost functions the samples incur. Thus, these methods exploit are the slope of the cost functions via their first-order approximations. To address limitations of gradient-based methods, such as sensitivity to step-size choice in the stochastic setting, or inability to use small function variability in the online setting, several streams of research attempt to exploit more information about the cost functions than just their gradients via the well-known proximal operators. However, implementing such methods in practice poses a challenge, since each iteration step boils down to computing the proximal operator, which may not be easy. In this work we devise a novel algorithmic framework, which exploits convex duality theory to achieve both algorithmic efficiency and software modularity of proximal operator implementations, in order to make experimentation with incremental proximal optimization algorithms accessible to a larger audience of researchers and practitioners, by reducing the gap between their theoretical description in research papers and their use in practice. We provide a reference Python implementation for the framework developed in this paper as an open source library at on https://github.com/alexshtf/inc_prox_pt/releases/tag/prox_pt_paper, along with examples which demonstrate our implementation on a variety of problems, and reproduce the numerical experiments in this paper. The pure Python reference implementation is not necessarily the most efficient, but is a basis for creating efficient implementations by combining Python with a native backend.


3D Imaging Shows How Shark Guts Work Like a Tesla Valve

WIRED

In 1920, Serbian-born inventor Nikola Tesla designed and patented what he called a "valvular conduit": a pipe whose internal design ensures that fluid will flow in one preferred direction, with no need for moving parts, making it ideal for microfluidics applications, among other uses. According to a recent paper published in the Proceedings of the Royal Society B, the Tesla valve also provides a useful model for how food moves through the digestive system of many species of shark. Based on new CT scans of shark intestines, scientists have concluded that the intestines are naturally occurring Tesla valves. This story originally appeared on Ars Technica, a trusted source for technology news, tech policy analysis, reviews, and more. Ars is owned by WIRED's parent company, Condรฉ Nast.


DJI Mavic Air 2 Review: The Best Drone for Taking Photos and Videos

WIRED

DJI's new Mavic Air 2 folding-style drone is a huge improvement over the previous model--so much so that for most people, this is the perfect drone. The Mavic Air 2 is the middle child in DJI's consumer drone lineup, sitting between the smaller, lighter, but less capable Mavic Mini, and the more powerful, more capable, but also more expensive, Mavic 2. If you're just getting started with drones, the less expensive Mavic Mini (8/10 WIRED Recommends)--my previous top pick for most people--might be a better buy. That said, the Air 2 offers better collision avoidance systems, higher quality photos and video, and a wide assortment of automated flight features that newcomers and seasoned vets alike can appreciate. The Mavic Air 2 is slightly bigger than its predecessor, at least on paper. The folding design remains compact, and at 1.3 pounds, the drone is plenty portable.


Customer churn classification using predictive machine learning models - WebSystemer.no

#artificialintelligence

Metis Data Science Bootcamp has been rigorous, and this is my third project. The goal is to predict customer churn in a Telecommunication company. Customer attrition, customer turnover, or customer defection -- they all refer to the loss of clients or customers, ie, churn. This can be due to voluntary reasons (by choice) or involuntary reasons (for example relocation). In this article, we will explore 8 predictive analytic models to assess customers' propensity or risk to churn.


The Stanford Natural Language Processing Group

@machinelearnbot

That model is fairly slow. Essentially, that model is trying to pull out all stops to maximize tagger accuracy. Speed consequently suffers due to choices like using 4th order bidirectional tag conditioning. It's nearly as accurate (96.97% accuracy vs. 97.32% on the standard WSJ22-24 test set) and is an order of magnitude faster. Compared to MXPOST, the Stanford POS Tagger with this model is both more accurate and considerably faster. It all depends, but on a 2008 nothing-special Intel server, it tags about 15000 words per second.


Why is Vlookup (in Excel) 1,000 times slower than hash tables in Python?

@machinelearnbot

The easy answer is that excel is a resource hog & despite being a tremendously powerful tool for mining small datasets, when you start to push past the traditional 65565 rows, you start to move into realms where Microsoft is traditionally not good (memory handling, i/o management, efficient processing). First a few questions: 1. Cardinality: Were both tables internally unique? Excel bogs down on cartesian products in my experience -- you need one to many or one to one matches. Am assuming you ran a pivot table on both counting the unique instances of each email address, comparing the row count in the table with the grand total (both should be the same). Am assuming that this answer is going to be yes across the board, since you're probably a really good excel jockey.