Goto

Collaborating Authors

 merlion


MERLION: Marine ExploRation with Language guIded Online iNformative Visual Sampling and Enhancement

Thengane, Shrutika Vishal, Prasetyo, Marcel Bartholomeus, Tan, Yu Xiang, Meghjani, Malika

arXiv.org Artificial Intelligence

Autonomous and targeted underwater visual monitoring and exploration using Autonomous Underwater Vehicles (AUVs) can be a challenging task due to both online and offline constraints. The online constraints comprise limited onboard storage capacity and communication bandwidth to the surface, whereas the offline constraints entail the time and effort required for the selection of desired key frames from the video data. An example use case of targeted underwater visual monitoring is finding the most interesting visual frames of fish in a long sequence of an AUV's visual experience. This challenge of targeted informative sampling is further aggravated in murky waters with poor visibility. In this paper, we present MERLION, a novel framework that provides semantically aligned and visually enhanced summaries for murky underwater marine environment monitoring and exploration. Specifically, our framework integrates (a) an image-text model for semantically aligning the visual samples to the users' needs, (b) an image enhancement model for murky water visual data and (c) an informative sampler for summarizing the monitoring experience. We validate our proposed MERLION framework on real-world data with user studies and present qualitative and quantitative results using our evaluation metric and show improved results compared to the state-of-the-art approaches. We have open-sourced the code for MERLION at the following link https://github.com/MARVL-Lab/MERLION.git.


Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning

Li, Shuyue Stella, Xiao, Cihan, Li, Tianjian, Odoom, Bismarck

arXiv.org Artificial Intelligence

Code-switching, also called code-mixing, is the linguistics phenomenon where in casual settings, multilingual speakers mix words from different languages in one utterance. Due to its spontaneous nature, code-switching is extremely low-resource, which makes it a challenging problem for language and speech processing tasks. In such contexts, Code-Switching Language Identification (CSLID) becomes a difficult but necessary task if we want to maximally leverage existing monolingual tools for other tasks. In this work, we propose two novel approaches toward improving language identification accuracy on an English-Mandarin child-directed speech dataset. Our methods include a stacked Residual CNN+GRU model and a multitask pre-training approach to use Automatic Speech Recognition (ASR) as an auxiliary task for CSLID. Due to the low-resource nature of code-switching, we also employ careful silver data creation using monolingual corpora in both languages and up-sampling as data augmentation. We focus on English-Mandarin code-switched data, but our method works on any language pair. Our best model achieves a balanced accuracy of 0.781 on a real English-Mandarin code-switching child-directed speech corpus and outperforms the previous baseline by 55.3%.


GitHub - salesforce/Merlion: Merlion: A Machine Learning Framework for Time Series Intelligence

#artificialintelligence

Merlion is a Python library for time series intelligence. It provides an end-to-end machine learning framework that includes loading and transforming data, building and training models, post-processing model outputs, and evaluating model performance. It supports various time series learning tasks, including forecasting and anomaly detection for both univariate and multivariate time series. This library aims to provide engineers and researchers a one-stop solution to rapidly develop models for their specific time series needs, and benchmark them across multiple time series datasets. The table below provides a visual overview of how Merlion's key features compare to other libraries for time series anomaly detection and/or forecasting.


Merlion: A Machine Learning Library for Time Series

Bhatnagar, Aadyot, Kassianik, Paul, Liu, Chenghao, Lan, Tian, Yang, Wenzhuo, Cassius, Rowan, Sahoo, Doyen, Arpit, Devansh, Subramanian, Sri, Woo, Gerald, Saha, Amrita, Jagota, Arun Kumar, Gopalakrishnan, Gokulakrishnan, Singh, Manpreet, Krithika, K C, Maddineni, Sukumar, Cho, Daeki, Zong, Bo, Zhou, Yingbo, Xiong, Caiming, Savarese, Silvio, Hoi, Steven, Wang, Huan

arXiv.org Machine Learning

We introduce Merlion, an open-source machine learning library for time series. It features a unified interface for many commonly used models and datasets for anomaly detection and forecasting on both univariate and multivariate time series, along with standard pre/post-processing layers. It has several modules to improve ease-of-use, including visualization, anomaly score calibration to improve interpetability, AutoML for hyperparameter tuning and model selection, and model ensembling. Merlion also provides a unique evaluation framework that simulates the live deployment and re-training of a model in production. This library aims to provide engineers and researchers a one-stop solution to rapidly develop models for their specific time series needs and benchmark them across multiple time series datasets. In this technical report, we highlight Merlion's architecture and major functionalities, and we report benchmark numbers across different baseline models and ensembles.