Goto

Collaborating Authors

 ottawa


Data-Driven Spectrum Demand Prediction: A Spatio-Temporal Framework with Transfer Learning

arXiv.org Artificial Intelligence

Accurate spectrum demand prediction is crucial for informed spectrum allocation, effective regulatory planning, and fostering sustainable growth in modern wireless communication networks. It supports governmental efforts, particularly those led by the international telecommunication union (ITU), to establish fair spectrum allocation policies, improve auction mechanisms, and meet the requirements of emerging technologies such as advanced 5G, forthcoming 6G, and the internet of things (IoT). This paper presents an effective spatio-temporal prediction framework that leverages crowdsourced user-side key performance indicators (KPIs) and regulatory datasets to model and forecast spectrum demand. The proposed methodology achieves superior prediction accuracy and cross-regional generalizability by incorporating advanced feature engineering, comprehensive correlation analysis, and transfer learning techniques. Unlike traditional ITU models, which are often constrained by arbitrary inputs and unrealistic assumptions, this approach exploits granular, data-driven insights to account for spatial and temporal variations in spectrum utilization. Comparative evaluations against ITU estimates, as the benchmark, underscore our framework's capability to deliver more realistic and actionable predictions. Experimental results validate the efficacy of our methodology, highlighting its potential as a robust approach for policymakers and regulatory bodies to enhance spectrum management and planning.


Canada warns of election threats from China, Russia, India and Pakistan

Al Jazeera

China and India are likely to attempt to interfere in upcoming elections, Canada's intelligence agency has warned, adding that Russia and Pakistan also pose a potential threat. The deputy director of operations for the Canadian Security Intelligence Service (CSIS) said on Tuesday that the agency is braced for efforts to meddle in the April 28 vote. Ottawa's relations with China and India in particular have been strained. Vanessa Lloyd told a media conference that such countries are increasingly using artificial intelligence (AI) to interfere in elections around the globe. China is "highly likely to use AI-enabled tools to attempt to interfere with Canada's democratic process in this current election," she said. India has the "intent and capability" to do likewise, she continued, adding that Russia and Pakistan could also potentially seek to interfere.


Enhancing Fake News Detection in Social Media via Label Propagation on Cross-modal Tweet Graph

arXiv.org Artificial Intelligence

Fake news detection in social media has become increasingly important due to the rapid proliferation of personal media channels and the consequential dissemination of misleading information. Existing methods, which primarily rely on multimodal features and graph-based techniques, have shown promising performance in detecting fake news. However, they still face a limitation, i.e., sparsity in graph connections, which hinders capturing possible interactions among tweets. This challenge has motivated us to explore a novel method that densifies the graph's connectivity to capture denser interaction better. Our method constructs a cross-modal tweet graph using CLIP, which encodes images and text into a unified space, allowing us to extract potential connections based on similarities in text and images. We then design a Feature Contextualization Network with Label Propagation (FCN-LP) to model the interaction among tweets as well as positive or negative correlations between predicted labels of connected tweets. The propagated labels from the graph are weighted and aggregated for the final detection. To enhance the model's generalization ability to unseen events, we introduce a domain generalization loss that ensures consistent features between tweets on seen and unseen events. We use three publicly available fake news datasets, Twitter, PHEME, and Weibo, for evaluation. Our method consistently improves the performance over the state-of-the-art methods on all benchmark datasets and effectively demonstrates its aptitude for generalizing fake news detection in social media.


OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection

arXiv.org Artificial Intelligence

Recently, there has been considerable attention on detecting hallucinations and omissions in Machine Translation (MT) systems. The two dominant approaches to tackle this task involve analyzing the MT system's internal states or relying on the output of external tools, such as sentence similarity or MT quality estimators. In this work, we introduce OTTAWA, a novel Optimal Transport (OT)-based word aligner specifically designed to enhance the detection of hallucinations and omissions in MT systems. Our approach explicitly models the missing alignments by introducing a "null" vector, for which we propose a novel one-side constrained OT setting to allow an adaptive null alignment. Our approach yields competitive results compared to state-of-the-art methods across 18 language pairs on the HalOmi benchmark. In addition, it shows promising features, such as the ability to distinguish between both error types and perform word-level detection without accessing the MT system's internal states.


Bringing Robots Home: The Rise of AI Robots in Consumer Electronics

arXiv.org Artificial Intelligence

On March 18, 2024, NVIDIA unveiled Project GR00T, a general-purpose multimodal generative AI model designed specifically for training humanoid robots. Preceding this event, Tesla's unveiling of the Optimus Gen 2 humanoid robot on December 12, 2023, underscored the profound impact robotics is poised to have on reshaping various facets of our daily lives. While robots have long dominated industrial settings, their presence within our homes is a burgeoning phenomenon. This can be attributed, in part, to the complexities of domestic environments and the challenges of creating robots that can seamlessly integrate into our daily routines.


MATK: The Meme Analytical Tool Kit

arXiv.org Artificial Intelligence

The rise of social media platforms has brought about a new digital culture called memes. Memes, which combine visuals and text, can strongly influence public opinions on social and cultural issues. As a result, people have become interested in categorizing memes, leading to the development of various datasets and multimodal models that show promising results in this field. However, there is currently a lack of a single library that allows for the reproduction, evaluation, and comparison of these models using fair benchmarks and settings. To fill this gap, we introduce the Meme Analytical Tool Kit (MATK), an open-source toolkit specifically designed to support existing memes datasets and cutting-edge multimodal models. MATK aims to assist researchers and engineers in training and reproducing these multimodal models for meme classification tasks, while also providing analysis techniques to gain insights into their strengths and weaknesses. To access MATK, please visit \url{https://github.com/Social-AI-Studio/MATK}.


CCMB: A Large-scale Chinese Cross-modal Benchmark

arXiv.org Artificial Intelligence

Vision-language pre-training (VLP) on large-scale datasets has shown premier performance on various downstream tasks. In contrast to plenty of available benchmarks with English corpus, large-scale pre-training datasets and downstream datasets with Chinese corpus remain largely unexplored. In this work, we build a large-scale high-quality Chinese Cross-Modal Benchmark named CCMB for the research community, which contains the currently largest public pre-training dataset Zero and five human-annotated fine-tuning datasets for downstream tasks. Zero contains 250 million images paired with 750 million text descriptions, plus two of the five fine-tuning datasets are also currently the largest ones for Chinese cross-modal downstream tasks. Along with the CCMB, we also develop a VLP framework named R2D2, applying a pre-Ranking + Ranking strategy to learn powerful vision-language representations and a two-way distillation method (i.e., target-guided Distillation and feature-guided Distillation) to further enhance the learning capability. With the Zero and the R2D2 VLP framework, we achieve state-of-the-art performance on twelve downstream datasets from five broad categories of tasks including image-text retrieval, image-text matching, image caption, text-to-image generation, and zero-shot image classification. The datasets, models, and codes are available at https://github.com/yuxie11/R2D2


Semi-supervised Deep Multi-view Stereo

arXiv.org Artificial Intelligence

Significant progress has been witnessed in learning-based Multi-view Stereo (MVS) under supervised and unsupervised settings. To combine their respective merits in accuracy and completeness, meantime reducing the demand for expensive labeled data, this paper explores the problem of learning-based MVS in a semi-supervised setting that only a tiny part of the MVS data is attached with dense depth ground truth. However, due to huge variation of scenarios and flexible settings in views, it may break the basic assumption in classic semi-supervised learning, that unlabeled data and labeled data share the same label space and data distribution, named as semi-supervised distribution-gap ambiguity in the MVS problem. To handle these issues, we propose a novel semi-supervised distribution-augmented MVS framework, namely SDA-MVS. For the simple case that the basic assumption works in MVS data, consistency regularization encourages the model predictions to be consistent between original sample and randomly augmented sample. For further troublesome case that the basic assumption is conflicted in MVS data, we propose a novel style consistency loss to alleviate the negative effect caused by the distribution gap. The visual style of unlabeled sample is transferred to labeled sample to shrink the gap, and the model prediction of generated sample is further supervised with the label in original labeled sample. The experimental results in semi-supervised settings of multiple MVS datasets show the superior performance of the proposed method. With the same settings in backbone network, our proposed SDA-MVS outperforms its fully-supervised and unsupervised baselines.


Bridging Trustworthiness and Open-World Learning: An Exploratory Neural Approach for Enhancing Interpretability, Generalization, and Robustness

arXiv.org Machine Learning

As researchers strive to narrow the gap between machine intelligence Contemporary artificial intelligence (AI) continues to furnish benefits and human through the development of artificial intelligence to real-society from economic and environmental perspectives, technologies, it is imperative that we recognize the critical among others [12, 33]. As AI gradually penetrates into high-risk importance of trustworthiness in open-world, which has become fields such as healthcare, finance and medicine, which are closely ubiquitous in all aspects of daily life for everyone. However, several related to human attributes, there is growing consensus awareness challenges may create a crisis of trust in current artificial intelligence that people urgently expect these AI solutions to be trustworthy systems that need to be bridged: 1) Insufficient explanation of [8, 16]. For instance, lenders expect the system to provide credible predictive results; 2) Inadequate generalization for learning models; explanations for rejecting their applications; engineers wish to develop 3) Poor adaptability to uncertain environments. Consequently, we common system interfaces to adapt to wider environments; explore a neural program to bridge trustworthiness and open-world businesspeople desire that the system can still operate effectively learning, extending from single-modal to multi-modal scenarios under various complex conditions, among other expectations.


India's foreign minister says he briefed US officials on Canada row

Al Jazeera

India's foreign minister has confirmed that he discussed his country's row with Canada over the killing of a Canadian Sikh leader with top United States government officials during a visit to Washington, DC, this week. Subrahmanyam Jaishankar said on Friday that he laid out India's concerns about Sikh separatist movement supporters in Canada during talks a day earlier with US Secretary of State Antony Blinken and US National Security Adviser Jake Sullivan. Canadian Prime Minister Justin Trudeau said on September 18 that his government was investigating "credible allegations of a potential link" between Indian government agents and the June killing of Hardeep Singh Nijjar, a prominent Sikh leader in western Canada. "They [Blinken and Sullivan] obviously shared US views and assessments on this whole situation and I explained to them … the concerns which I had," Jaishankar said during an event on Friday morning at the Hudson Institute, a conservative US think tank. "Hopefully we both came out of those meetings better informed."