Collaborating Authors


Multiple Patients Behavior Detection in Real-time using mmWave Radar and Deep CNNs Machine Learning

To address potential gaps noted in patient monitoring in the hospital, a novel patient behavior detection system using mmWave radar and deep convolution neural network (CNN), which supports the simultaneous recognition of multiple patients' behaviors in real-time, is proposed. In this study, we use an mmWave radar to track multiple patients and detect the scattering point cloud of each one. For each patient, the Doppler pattern of the point cloud over a time period is collected as the behavior signature. A three-layer CNN model is created to classify the behavior for each patient. The tracking and point clouds detection algorithm was also implemented on an mmWave radar hardware platform with an embedded graphics processing unit (GPU) board to collect Doppler pattern and run the CNN model. A training dataset of six types of behavior were collected, over a long duration, to train the model using Adam optimizer with an objective to minimize cross-entropy loss function. Lastly, the system was tested for real-time operation and obtained a very good inference accuracy when predicting each patient's behavior in a two-patient scenario.

Multi-class Hierarchical Question Classification for Multiple Choice Science Exams Artificial Intelligence

Prior work has demonstrated that question classification (QC), recognizing the problem domain of a question, can help answer it more accurately. However, developing strong QC algorithms has been hindered by the limited size and complexity of annotated data available. To address this, we present the largest challenge dataset for QC, containing 7,787 science exam questions paired with detailed classification labels from a fine-grained hierarchical taxonomy of 406 problem domains. We then show that a BERT-based model trained on this dataset achieves a large (+0.12 MAP) gain compared with previous methods, while also achieving state-of-the-art performance on benchmark open-domain and biomedical QC datasets. Finally, we show that using this model's predictions of question topic significantly improves the accuracy of a question answering system by +1.7% P@1, with substantial future gains possible as QC performance improves.

Which Machine Learning Platform Performs Best?


The advent of automated machine learning platforms has expanded the access and availability of algorithmic interpretation over the past several years. But how do the different machine learning platforms stack up from a performance perspective? That's the question that researchers from Arizona State University sought to answer. As the market for machine learning platforms expands, users are naturally inclined to seek sources of information to rank and rate the various options that are available to them. Which systems are the easiest to use?

Wrapping up Python into a Cloud-based PostgreSQL


Specifically, using PL/Python, one can bring in countless Python libraries to process data close to the database. Here I will talk about my efforts to bring in the functionality of PySAL, a spatial analytics library written in Python and developed largely by Serge Rey, et al. at Arizona State University. PySAL makes available robust exploratory spatial data analysis related to spatial cluster and outlier detection, hotspot detection, spatial regression, and much more. Besides the wrappers we wrote for PySAL, we have written classes for bringing in machine learning methods such as random forest, linear regression, support vector machines, and neural networks from scikit-learn and Tensorflow. This talk will specifically cover the challenges we encountered programming in the PL/Python environment, collaborations with some of the PySAL developers, and the power of having spatial statistics and machine learning capabilities baked right into a cloud database.

You Don't Have to Be a Rocket Scientist to Hire the Right Data Scientist


Join us live with Marcus Lemonis at Thought Leaders Arizona on May 24. We've all heard it by now: Data scientists have the century's sexiest job and they're here to save your business with their big data expertise. Everyone wants to hire one. But what are the chances you'll stumble across an actual data scientist unicorn who just happens to be a perfect fit for your business? The thing about data science is that it draws from dozens of fields, including machine learning, data mining, analytics and artificial intelligence.

Online Tensor Methods for Learning Latent Variable Models Machine Learning

We introduce an online tensor decomposition based approach for two latent variable modeling problems namely, (1) community detection, in which we learn the latent communities that the social actors in social networks belong to, and (2) topic modeling, in which we infer hidden topics of text articles. We consider decomposition of moment tensors using stochastic gradient descent. We conduct optimization of multilinear operations in SGD and avoid directly forming the tensors, to save computational and storage costs. We present optimized algorithm in two platforms. Our GPU-based implementation exploits the parallelism of SIMD architectures to allow for maximum speed-up by a careful optimization of storage and data transfer, whereas our CPU-based implementation uses efficient sparse matrix computations and is suitable for large sparse datasets. For the community detection problem, we demonstrate accuracy and computational efficiency on Facebook, Yelp and DBLP datasets, and for the topic modeling problem, we also demonstrate good performance on the New York Times dataset. We compare our results to the state-of-the-art algorithms such as the variational method, and report a gain of accuracy and a gain of several orders of magnitude in the execution time.