Goto

Collaborating Authors

 seb




Subword Embedding from Bytes Gains Privacy without Sacrificing Accuracy and Complexity

Zhang, Mengjiao, Xu, Jia

arXiv.org Artificial Intelligence

While NLP models significantly impact our lives, there are rising concerns about privacy invasion. Although federated learning enhances privacy, attackers may recover private training data by exploiting model parameters and gradients. Therefore, protecting against such embedding attacks remains an open challenge. To address this, we propose Subword Embedding from Bytes (SEB) and encode subwords to byte sequences using deep neural networks, making input text recovery harder. Importantly, our method requires a smaller memory with $256$ bytes of vocabulary while keeping efficiency with the same input length. Thus, our solution outperforms conventional approaches by preserving privacy without sacrificing efficiency or accuracy. Our experiments show SEB can effectively protect against embedding-based attacks from recovering original sentences in federated learning. Meanwhile, we verify that SEB obtains comparable and even better results over standard subword embedding methods in machine translation, sentiment analysis, and language modeling with even lower time and space complexity.


The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding

Enevoldsen, Kenneth, Kardos, Márton, Muennighoff, Niklas, Nielbo, Kristoffer Laigaard

arXiv.org Artificial Intelligence

The evaluation of English text embeddings has transitioned from evaluating a handful of datasets to broad coverage across many tasks through benchmarks such as MTEB. However, this is not the case for multilingual text embeddings due to a lack of available benchmarks. To address this problem, we introduce the Scandinavian Embedding Benchmark (SEB). SEB is a comprehensive framework that enables text embedding evaluation for Scandinavian languages across 24 tasks, 10 subtasks, and 4 task categories. Building on SEB, we evaluate more than 26 models, uncovering significant performance disparities between public and commercial solutions not previously captured by MTEB. We open-source SEB and integrate it with MTEB, thus bridging the text embedding evaluation gap for Scandinavian languages.


Streamlined Empirical Bayes Fitting of Linear Mixed Models in Mobile Health

Menictas, Marianne, Tomkins, Sabina, Murphy, Susan A

arXiv.org Machine Learning

To effect behavior change a successful algorithm must make high-quality decisions in real-time. For example, a mobile health (mHealth) application designed to increase physical activity must make contextually relevant suggestions to motivate users. While machine learning offers solutions for certain stylized settings, such as when batch data can be processed offline, there is a dearth of approaches which can deliver high-quality solutions under the specific constraints of mHealth. We propose an algorithm which provides users with contextualized and personalized physical activity suggestions. This algorithm is able to overcome a challenge critical to mHealth that complex models be trained efficiently. We propose a tractable streamlined empirical Bayes procedure which fits linear mixed effects models in large-data settings. Our procedure takes advantage of sparsity introduced by hierarchical random effects to efficiently learn the posterior distribution of a linear mixed effects model. A key contribution of this work is that we provide explicit updates in order to learn both fixed effects, random effects and hyper-parameter values. We demonstrate the success of this approach in a mobile health (mHealth) reinforcement learning application, a domain in which fast computations are crucial for real time interventions. Not only is our approach computationally efficient, it is also easily implemented with closed form matrix algebraic updates and we show improvements over state of the art approaches both in speed and accuracy of up to 99% and 56% respectively.


Stagewise Enlargement of Batch Size for SGD-based Learning

Zhao, Shen-Yi, Xie, Yin-Peng, Li, Wu-Jun

arXiv.org Machine Learning

Existing research shows that the batch size can seriously affect the performance of stochastic gradient descent~(SGD) based learning, including training speed and generalization ability. A larger batch size typically results in less parameter updates. In distributed training, a larger batch size also results in less frequent communication. However, a larger batch size can make a generalization gap more easily. Hence, how to set a proper batch size for SGD has recently attracted much attention. Although some methods about setting batch size have been proposed, the batch size problem has still not been well solved. In this paper, we first provide theory to show that a proper batch size is related to the gap between initialization and optimum of the model parameter. Then based on this theory, we propose a novel method, called \underline{s}tagewise \underline{e}nlargement of \underline{b}atch \underline{s}ize~(\mbox{SEBS}), to set proper batch size for SGD. More specifically, \mbox{SEBS} adopts a multi-stage scheme, and enlarges the batch size geometrically by stage. We theoretically prove that, compared to classical stagewise SGD which decreases learning rate by stage, \mbox{SEBS} can reduce the number of parameter updates without increasing generalization error. SEBS is suitable for \mbox{SGD}, momentum \mbox{SGD} and AdaGrad. Empirical results on real data successfully verify the theories of \mbox{SEBS}. Furthermore, empirical results also show that SEBS can outperform other baselines.


TuNet: End-to-end Hierarchical Brain Tumor Segmentation using Cascaded Networks

Vu, Minh H., Nyholm, Tufve, Löfstedt, Tommy

arXiv.org Machine Learning

Glioma is one of the most common types of brain tumors arising in the glial cells in the human brain and spinal cord. In addition to the threat of death, glioma treatment is also very costly. Hence, automatic and accurate segmentation and measurement from the early stages are critical in order to prolong the survival rates of the patients and to reduce the costs of health care. In the present work, we propose a novel end-to-end cascaded network for semantic segmentation that utilizes the hierarchical structure of the tumor sub-regions with ResNet-like blocks and Squeeze-and-Excitation modules after each convolution and concatenation block. By utilizing cross-validation, an average ensemble technique, and a simple post-processing technique, we obtained dice scores of 90.34, 81.12, and 78.42 and Hausdorff Distances (95th percentile) of 4.32, 6.28, and 3.70 for the whole tumor, tumor core, and enhancing tumor, respectively, on the online validation set.


The achievement gap and AI augmented online tutoring

#artificialintelligence

The achievement gap between students who come from different socio-economic backgrounds is a well-known and persistent problem in education. Disparities in achievements between high and low socio-economic groups are evident in children as young as age 3 years and seem to be a problem the world over. Despite pupils' overall attainment scores rising over the last decade or so, the gap between students from different socio-economic groups remains intractably present and widespread. Family finances play a big part amongst the various reasons for this disparity. Pupils from low socio-economic backgrounds (SEBs) are often only able to attend a few, if any of the extra-curricular activities enjoyed by their more affluent peers. Access to good schools for pupils from SEBs is often reduced as is their access to the educational and occupational aspirations which can impact children's academic achievement.


Banking on AI to Offer Better Customer Service

#artificialintelligence

Artificial intelligence enables virtual agents to learn by observation. That adaptive ability is what a Swedish bank is counting on to serve its customers. In early 2016, SEB, one of Sweden's largest banks with a presence in 20 countries around the globe, started integrating Amelia, an artificial intelligence (AI) platform from IPsoft, into its help desk. Amelia is represented by a blond female avatar and is always referred to as "she" rather than "it." The artificial intelligence platform is built on semantic understanding, which enables Amelia to interact with users through natural language to determine what actions to take in order to answer a question, fulfill a request or solve a problem.


Swedish bank uses Amelia the robot for customer services

#artificialintelligence

SEB in Sweden is the first bank to use IPsoft's cognitive technology for customer services after the software robot proved successful in an internal IT service desk project. A collection of our most popular articles for IT leaders from the first few months of 2016, including: - Corporate giants recruit digitally-minded outsiders to drive transformation - Analytics platforms to drive strategy in 2016 - Next generation: The changing role of IT leaders. This email address is already registered. By submitting my Email address I confirm that I have read and accepted the Terms of Use and Declaration of Consent. By submitting your personal information, you agree that TechTarget and its partners may contact you regarding relevant content, products and special offers.