Atlantic Ocean
PRACTIQ: A Practical Conversational Text-to-SQL dataset with Ambiguous and Unanswerable Queries
Dong, Mingwen, Kumar, Nischal Ashok, Hu, Yiqun, Chauhan, Anuj, Hang, Chung-Wei, Chang, Shuaichen, Pan, Lin, Lan, Wuwei, Zhu, Henghui, Jiang, Jiarong, Ng, Patrick, Wang, Zhiguo
Previous text-to-SQL datasets and systems have primarily focused on user questions with clear intentions that can be answered. However, real user questions can often be ambiguous with multiple interpretations or unanswerable due to a lack of relevant data. In this work, we construct a practical conversational text-to-SQL dataset called PRACTIQ, consisting of ambiguous and unanswerable questions inspired by real-world user questions. We first identified four categories of ambiguous questions and four categories of unanswerable questions by studying existing text-to-SQL datasets. Then, we generate conversations with four turns: the initial user question, an assistant response seeking clarification, the user's clarification, and the assistant's clarified SQL response with the natural language explanation of the execution results. For some ambiguous queries, we also directly generate helpful SQL responses, that consider multiple aspects of ambiguity, instead of requesting user clarification. To benchmark the performance on ambiguous, unanswerable, and answerable questions, we implemented large language model (LLM)-based baselines using various LLMs. Our approach involves two steps: question category classification and clarification SQL prediction. Our experiments reveal that state-of-the-art systems struggle to handle ambiguous and unanswerable questions effectively. We will release our code for data generation and experiments on GitHub.
Unknown drone fleet breached US military base airspace in Virginia for 17 straight days: report
A mysterious fleet of drones entered restricted airspace and swarmed a U.S. military base along the Virginia coast for 17 days late last year, stumping the Pentagon, according to a new report. For several nights last December, U.S. military personnel reported witnessing a fleet of unknown unmanned aircraft breach restricted airspace over a stretch of land at Langley Air Force Base along Virginia's shore, the Wall Street Journal first reported. The drones would start to arrive about 45 minutes to an hour after sunset each night, one official reportedly told U.S. Air Force Gen. Mark Kelly, who joined several other officers responsible for the country's most advanced jet fighters, including F-22 Raptors, on a squadron rooftop. Kelly described the first drone he saw as roughly 20 feet long and flying at more than 100 miles an hour, at an altitude of roughly 3,000 to 4,000 feet. As many as a dozen or more drones followed, flying across Chesapeake Bay, and then traveling toward Norfolk, Virginia, and through a space overlooking the base for the Navy's SEAL Team Six and Naval Station Norfolk, the world's largest naval port, according to the Journal.
SpaceX 'catches' giant Starship rocket booster in fifth flight test
SpaceX has launched its fifth Starship test flight from Texas and returned the rocket's towering first-stage booster back to land for the first time, achieving a novel recovery method involving large metal arms. The rocket's Super Heavy first-stage booster lifted off at 7:25 am (12:25 GMT) on Sunday from SpaceX's launch facilities in Boca Chica, Texas, sending the second-stage Starship rocket on a path in space bound for the Indian Ocean west of Australia, where it will attempt atmospheric reentry followed by a water landing. The Super Heavy booster, after separating from the Starship booster some 74km (46 miles) in altitude, returned to the same area from which it was launched to make its landing attempt, aided by two robotic arms attached to the launch tower. "The tower has caught the rocket!!" SpaceX founder Elon Musk posted on X. Towering almost 121 metres (400 feet), the empty Starship arched over the Gulf of Mexico like the four Starships before it that ended up being destroyed, either soon after liftoff or while ditching into the sea. The last one in June was the most successful yet, completing its flight without exploding.
A Tidal Current Speed Forecasting Model based on Multiple Periodicity Learning
Cheng, Tengfei, Dong, Yunxuan, Huang, Yangdi
Tidal energy is one of the key components in increasing the penetration rate of renewable energy. The penetration of tidal energy in the electrical grid depends on the accuracy of tidal current speed forecasting. Modeling inaccuracies hinder forecast accuracy. Previous research has primarily used physical models to forecast tidal current speed. However, tidal current variations influenced by the orbital periods of celestial bodies make accurate physical modeling challenging. Researching the multiple periodicity of tides is crucial for accurately forecasting tidal current speed. In this article, we propose the Wavelet-Enhanced Convolutional Network (WCN) to learn multiple periodicity. The framework embeds intra-period and inter-period variations of one-dimensional tidal current data into the rows and columns of a two-dimensional tensor. Then, the two-dimensional variations of the sequence can be processed by convolutional kernels. We integrate a time-frequency analysis method into the framework to further address local periodic features. Additionally, to enhance the framework's stability, we optimize the framework's hyperparameters with the Tree-structured Parzen Estimator algorithm. The proposed framework avoids the lack of learning multiple periodicity. Compared with benchmarks, the proposed framework reduces the mean absolute error and mean square error in 10-step forecasting by, at most, 90.36% and 97.56%, respectively.
SlimSeiz: Efficient Channel-Adaptive Seizure Prediction Using a Mamba-Enhanced Network
Lu, Guorui, Peng, Jing, Huang, Bingyuan, Gao, Chang, Stefanov, Todor, Hao, Yong, Chen, Qinyu
Epileptic seizures cause abnormal brain activity, and their unpredictability can lead to accidents, underscoring the need for long-term seizure prediction. Although seizures can be predicted by analyzing electroencephalogram (EEG) signals, existing methods often require too many electrode channels or larger models, limiting mobile usability. This paper introduces a SlimSeiz framework that utilizes adaptive channel selection with a lightweight neural network model. SlimSeiz operates in two states: the first stage selects the optimal channel set for seizure prediction using machine learning algorithms, and the second stage employs a lightweight neural network based on convolution and Mamba for prediction. On the Children's Hospital Boston-MIT (CHB-MIT) EEG dataset, SlimSeiz can reduce channels from 22 to 8 while achieving a satisfactory result of 94.8% accuracy, 95.5% sensitivity, and 94.0% specificity with only 21.2K model parameters, matching or outperforming larger models' performance. We also validate SlimSeiz on a new EEG dataset, SRH-LEI, collected from Shanghai Renji Hospital, demonstrating its effectiveness across different patients. The code and SRH-LEI dataset are available at https://github.com/guoruilu/SlimSeiz.
Learning from the past: predicting critical transitions with machine learning trained on surrogates of historical data
Ma, Zhiqin, Zeng, Chunhua, Zhang, Yi-Cheng, Bury, Thomas M.
Complex systems can undergo critical transitions, where slowly changing environmental conditions trigger a sudden shift to a new, potentially catastrophic state. Early warning signals for these events are crucial for decision-making in fields such as ecology, biology and climate science. Generic early warning signals motivated by dynamical systems theory have had mixed success on real noisy data. More recent studies found that deep learning classifiers trained on synthetic data could improve performance. However, neither of these methods take advantage of historical, system-specific data. Here, we introduce an approach that trains machine learning classifiers directly on surrogate data of past transitions, namely surrogate data-based machine learning (SDML). The approach provides early warning signals in empirical and experimental data from geology, climatology, sociology, and cardiology with higher sensitivity and specificity than two widely used generic early warning signals -- variance and lag-1 autocorrelation. Since the approach is trained directly on surrogates of historical data, it is not bound by the restricting assumption of a local bifurcation like previous methods. This system-specific approach can contribute to improved early warning signals to help humans better prepare for or avoid undesirable critical transitions.
Testing GPT-4-o1-preview on math and science problems: A follow-up study
In August 2023, Scott Aaronson and I reported the results of testing GPT4 with the Wolfram Alpha and Code Interpreter plug-ins over a collection of 105 original high-school level and college-level science and math problems (Davis and Aaronson, 2023). In September 2024, I tested the recently released model GPT-4o1-preview on the same collection. Overall I found that performance had significantly improved, but was still considerably short of perfect. In particular, problems that involve spatial reasoning are often stumbling blocks. On September 12, OpenAI (2024) released two preliminary versions, "ChatGPT-o1-preview" and "ChatGPT-o1-mini" of a forthcoming product "ChatGPT-o1".
Russian strike kills seven in latest attack on Ukrainian port
Russia's overnight attacks on Ukraine also left several people wounded in the southern city of Zaporizhzhia. Meanwhile, Ukrainian drones targeted a military airfield in the Maikop region of southern Russia. Local officials evacuated 40 people from a nearby village. Russia's missile strike on the Odesa region hit a Panamanian-registered ship on Wednesday night, Oleh Kiper said - two days after a Palau-flagged ship was attacked, leaving one dead on board. Another ship, which was said to be carrying 6,000 tonnes of corn, was attacked on Sunday.
IceDiff: High Resolution and High-Quality Sea Ice Forecasting with Generative Diffusion Prior
Xu, Jingyi, Tu, Siwei, Yang, Weidong, Li, Shuhao, Liu, Keyi, Luo, Yeqi, Ma, Lipeng, Fei, Ben, Bai, Lei
Variation of Arctic sea ice has significant impacts on polar ecosystems, transporting routes, coastal communities, and global climate. Tracing the change of sea ice at a finer scale is paramount for both operational applications and scientific studies. Recent pan-Arctic sea ice forecasting methods that leverage advances in artificial intelligence has made promising progress over numerical models. However, forecasting sea ice at higher resolutions is still under-explored. To bridge the gap, we propose a two-staged deep learning framework, IceDiff, to forecast sea ice concentration at finer scales. IceDiff first leverages an independently trained vision transformer to generate coarse yet superior forecasting over previous methods at a regular 25km x 25km grid. This high-quality sea ice forecasting can be utilized as reliable guidance for the next stage. Subsequently, an unconditional diffusion model pre-trained on sea ice concentration maps is utilized for sampling down-scaled sea ice forecasting via a zero-shot guided sampling strategy and a patch-based method. For the first time, IceDiff demonstrates sea ice forecasting with the 6.25km x 6.25km resolution. IceDiff extends the boundary of existing sea ice forecasting models and more importantly, its capability to generate high-resolution sea ice concentration data is vital for pragmatic usages and research.
From Logits to Hierarchies: Hierarchical Clustering made Simple
Palumbo, Emanuele, Vandenhirtz, Moritz, Ryser, Alain, Daunhawer, Imant, Vogt, Julia E.
The structure of many real-world datasets is intrinsically hierarchical, making the modeling of such hierarchies a critical objective in both unsupervised and supervised machine learning. Recently, novel approaches for hierarchical clustering with deep architectures have been proposed. In this work, we take a critical perspective on this line of research and demonstrate that many approaches exhibit major limitations when applied to realistic datasets, partly due to their high computational complexity. In particular, we show that a lightweight procedure implemented on top of pre-trained non-hierarchical clustering models outperforms models designed specifically for hierarchical clustering. Our proposed approach is computationally efficient and applicable to any pre-trained clustering model that outputs logits, without requiring any fine-tuning. To highlight the generality of our findings, we illustrate how our method can also be applied in a supervised setup, recovering meaningful hierarchies from a pre-trained ImageNet classifier.