Goto

Collaborating Authors

 Atlantic Ocean


Drone footage shows shark circling man and small child at Alabama beach

FOX News

The Gulf of Mexico has around 50 species of sharks, with around 20 to 30 species that beachgoers and fishermen can encounter. A shark was captured on drone footage Monday circling a man and a child swimming at a popular beach in Alabama. The footage, taken by 15-year-old Jackson Silvio and obtained by Fox News Digital, shows the man and child wading further out into the water at Orange Beach. At one point the shark appeared to swim just within a few feet of the man. The shark can be seen following them, swimming in a circle as it gets closer.


Wagner boss blasts Russia's elite following Moscow drone attack

Al Jazeera

The head of Russia's Wagner mercenary force has again criticised the Russian military and political elite following the drone attack on Moscow that injured two people, damaged property and left some furious the Kremlin had not better protected the capital city. In an expletive-drenched statement posted on Telegram by his press service on Tuesday, Yevgeny Prigozhin – whose mercenary fighters have played a key role in the war in Ukraine – blamed the drone attack on out-of-touch officials living in Moscow's affluent suburb of Rublyovka. "You, the Defence Ministry, have done nothing to launch an offensive," Prigozhin said in the statement. "How dare you allow the drones to reach Moscow?" "And what do ordinary people do when drones with explosives crash into their windows?" Focusing his ire on powerful residents of the upmarket Rublyovka area in Moscow's western suburbs, Prigozhin spoke of the "scum" and "swine" who sat quietly while Moscow was attacked.


Brain-Inspired Spiking Neural Network for Online Unsupervised Time Series Prediction

arXiv.org Artificial Intelligence

Energy and data-efficient online time series prediction for predicting evolving dynamical systems are critical in several fields, especially edge AI applications that need to update continuously based on streaming data. However, current DNN-based supervised online learning models require a large amount of training data and cannot quickly adapt when the underlying system changes. Moreover, these models require continuous retraining with incoming data making them highly inefficient. To solve these issues, we present a novel Continuous Learning-based Unsupervised Recurrent Spiking Neural Network Model (CLURSNN), trained with spike timing dependent plasticity (STDP). CLURSNN makes online predictions by reconstructing the underlying dynamical system using Random Delay Embedding by measuring the membrane potential of neurons in the recurrent layer of the RSNN with the highest betweenness centrality. We also use topological data analysis to propose a novel methodology using the Wasserstein Distance between the persistence homologies of the predicted and observed time series as a loss function. We show that the proposed online time series prediction methodology outperforms state-of-the-art DNN models when predicting an evolving Lorenz63 dynamical system.


QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations

arXiv.org Artificial Intelligence

Formulating selective information needs results in queries that implicitly specify set operations, such as intersection, union, and difference. For instance, one might search for "shorebirds that are not sandpipers" or "science-fiction films shot in England". To study the ability of retrieval systems to meet such information needs, we construct QUEST, a dataset of 3357 natural language queries with implicit set operations, that map to a set of entities corresponding to Wikipedia documents. The dataset challenges models to match multiple constraints mentioned in queries with corresponding evidence in documents and correctly perform various set operations. The dataset is constructed semi-automatically using Wikipedia category names. Queries are automatically composed from individual categories, then paraphrased and further validated for naturalness and fluency by crowdworkers. Crowdworkers also assess the relevance of entities based on their documents and highlight attribution of query constraints to spans of document text. We analyze several modern retrieval systems, finding that they often struggle on such queries. Queries involving negation and conjunction are particularly challenging and systems are further challenged with combinations of these operations.


Titanic remains reveal lost gold necklace made from the tooth of a megalodon

Daily Mail - Science & tech

A necklace'made from the tooth of a megalodon shark' is revealed in new images from the wreckage of RMS Titanic. The stunning artefact – which has not been worn since the ship's sinking in April 1912 – was identified in footage taken last summer by Guernsey-based firm Magellan Ltd. The footage was shot during efforts to capture the first digital scans of the shipwreck, which present the wreck almost as if it's been retrieved from the water. Other objects surrounding the necklace have not been identified, although it appears to be surrounded by small ring-shaped beads. Magellan Ltd, which is working with Atlantic Productions on a documentary about last year's expedition, is prohibited from taking them from the sea floor, however.


One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

arXiv.org Artificial Intelligence

This paper proposes a unified diffusion framework (dubbed UniDiffuser) to fit all distributions relevant to a set of multi-modal data in one model. Our key insight is -- learning diffusion models for marginal, conditional, and joint distributions can be unified as predicting the noise in the perturbed data, where the perturbation levels (i.e. timesteps) can be different for different modalities. Inspired by the unified view, UniDiffuser learns all distributions simultaneously with a minimal modification to the original diffusion model -- perturbs data in all modalities instead of a single modality, inputs individual timesteps in different modalities, and predicts the noise of all modalities instead of a single modality. UniDiffuser is parameterized by a transformer for diffusion models to handle input types of different modalities. Implemented on large-scale paired image-text data, UniDiffuser is able to perform image, text, text-to-image, image-to-text, and image-text pair generation by setting proper timesteps without additional overhead. In particular, UniDiffuser is able to produce perceptually realistic samples in all tasks and its quantitative results (e.g., the FID and CLIP score) are not only superior to existing general-purpose models but also comparable to the bespoken models (e.g., Stable Diffusion and DALL-E 2) in representative tasks (e.g., text-to-image generation).


GPT4GEO: How a Language Model Sees the World's Geography

arXiv.org Artificial Intelligence

Large language models (LLMs) have shown remarkable capabilities across a broad range of tasks involving question answering and the generation of coherent text and code. Comprehensively understanding the strengths and weaknesses of LLMs is beneficial for safety, downstream applications and improving performance. In this work, we investigate the degree to which GPT-4 has acquired factual geographic knowledge and is capable of using this knowledge for interpretative reasoning, which is especially important for applications that involve geographic data, such as geospatial analysis, supply chain management, and disaster response. To this end, we design and conduct a series of diverse experiments, starting from factual tasks such as location, distance and elevation estimation to more complex questions such as generating country outlines and travel networks, route finding under constraints and supply chain analysis. We provide a broad characterisation of what GPT-4 (without plugins or Internet access) knows about the world, highlighting both potentially surprising capabilities but also limitations.


Applications of Machine Learning in Chemical and Biological Oceanography

arXiv.org Artificial Intelligence

Machine learning (ML) refers to computer algorithms that predict a meaningful output or categorize complex systems based on a large amount of data. ML is applied in various areas including natural science, engineering, space exploration, and even gaming development. This review focuses on the use of machine learning in the field of chemical and biological oceanography. In the prediction of global fixed nitrogen levels, partial carbon dioxide pressure, and other chemical properties, the application of ML is a promising tool. Machine learning is also utilized in the field of biological oceanography to detect planktonic forms from various images (i.e., microscopy, FlowCAM, and video recorders), spectrometers, and other signal processing techniques. Moreover, ML successfully classified the mammals using their acoustics, detecting endangered mammalian and fish species in a specific environment. Most importantly, using environmental data, the ML proved to be an effective method for predicting hypoxic conditions and harmful algal bloom events, an essential measurement in terms of environmental monitoring. Furthermore, machine learning was used to construct a number of databases for various species that will be useful to other researchers, and the creation of new algorithms will help the marine research community better comprehend the chemistry and biology of the ocean.


Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model

arXiv.org Artificial Intelligence

The immense scale of the recent large language models (LLM) allows many interesting properties, such as, instruction- and chain-of-thought-based fine-tuning, that has significantly improved zero- and few-shot performance in many natural language processing (NLP) tasks. Inspired by such successes, we adopt such an instruction-tuned LLM Flan-T5 as the text encoder for text-to-audio (TTA) generation -- a task where the goal is to generate an audio from its textual description. The prior works on TTA either pre-trained a joint text-audio encoder or used a non-instruction-tuned model, such as, T5. Consequently, our latent diffusion model (LDM)-based approach TANGO outperforms the state-of-the-art AudioLDM on most metrics and stays comparable on the rest on AudioCaps test set, despite training the LDM on a 63 times smaller dataset and keeping the text encoder frozen. This improvement might also be attributed to the adoption of audio pressure level-based sound mixing for training set augmentation, whereas the prior methods take a random mix.


Writing user personas with Large Language Models: Testing phase 6 of a Thematic Analysis of semi-structured interviews

arXiv.org Artificial Intelligence

The goal of this paper is establishing if we can satisfactorily perform a Thematic Analysis (TA) of semi-structured interviews using a Large Language Model (more precisely GPT3.5-Turbo). Building on previous work by the author, which established an embryonal process for conducting a TA with the model, this paper will perform a further analysis and then cover the last phase of a TA (phase 6), which entails the writing up of the result. This phase was not covered by the previous work. In particular, the focus will be on using the results of a TA done with the LLM on a dataset of user interviews, for writing user personas, with the model building on the TA to produce the personas narratives. User personas are models of real users, usually built from a data analysis like interviews with a sample of users. User personas are tools often used in User Centered Design processes. The paper shows that the model can build basic user personas with an acceptable quality deriving them from themes, and that the model can serve for the generation of ideas for user personas.