AITopics

FOX NewsDec-14-2024, 13:30:12 GMT

Fox News AI Newsletter: Chatbot's deadly prompt

Artificial intelligence is being used to power the personalization of popular sports betting apps to tailor experiences to users' preferences. SUITS MOUNTING: Two Texas parents filed a lawsuit this week against the makers of Character.AI, claiming the artificial intelligence chatbot is a "clear and present danger to minors," with one plaintiff alleging it encouraged their teen to kill his parents. GENERATION AT RISK: Senate lawmakers unanimously passed the bipartisan-led Take It Down Act that would force social media companies to speedily remove sexually explicit deepfakes, prevent them from being posted and criminalize the act. 'WHAT WILL BE LEFT?': Lisa Kudrow fears an uncertain future as artificial intelligence becomes more and more prevalent in Hollywood. FUTURISTIC ROBOCOP: Footage from the streets of China captured a scene straight from a science fiction novel – spherical drones alongside patrolling law enforcement.

artificial intelligence, chatbot, natural language, (7 more...)

FOX News

Country:

North America > United States > Texas (0.27)
Asia > China (0.27)

Industry:

Law > Litigation (1.00)
Information Technology (1.00)
Media > News (0.69)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

Qiao, Yaqiong, Huang, Guojun

A Novel End-To-End Event Geolocation Method Leveraging Hyperbolic Space and Toponym Hierarchies

Abstract: Timely detection and geolocation of events based on social data can provide critical information for applications such as crisis response and resource allocation. However, most existing methods are greatly affected by event detection errors, leading to insufficient geolocation accuracy. To this end, this paper proposes a novel end-to-end event geolocation method (GTOP) leveraging Hyperbolic space and toponym hierarchies. Specifically, the proposed method contains one event detection module and one geolocation module. The event detection module constructs a heterogeneous information networks based on social data, and then constructs a homogeneous message graph and combines it with the text and time feature of the message to learning initial features of nodes. Node features are updated in Hyperbolic space and then fed into a classifier for event detection. To reduce the geolocation error, this paper proposes a noise toponym filtering algorithm (HIST) based on the hierarchical structure of toponyms. HIST analyzes the hierarchical structure of toponyms mentioned in the event cluster, taking the highly frequent city-level locations as the coarsegrained locations for events. To further improve the geolocation accuracy, we propose a fine-grained pseudo toponyms generation algorithm (FIT) based on the output of HIST, and combine generated pseudo toponyms with filtered toponyms to locate events based on the geographic center points of the combined toponyms. Extensive experiments are conducted on the Chinese dataset constructed in this paper and another public English dataset. The experimental results show that the proposed method is superior to the state-of-the-art baselines.

data mining, machine learning, toponym, (19 more...)

2412.1087

Country:

North America > United States > Texas > Dallas County > Dallas (0.04)
Asia > China > Zhejiang Province > Hangzhou (0.04)
South America > Brazil > Ceará > Fortaleza (0.04)
(15 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Information Technology (0.68)
Law (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(3 more...)

SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation

Wu, Qilong, Xiang, Xiaoneng, Huang, Hejia, Wang, Xuan, Jie, Yeo Wei, Satapathy, Ranjan, Filho, Ricardo Shirota, Veeravalli, Bharadwaj

The rapid growth of the financial sector and the rising focus on Environmental, Social, and Governance (ESG) considerations highlight the need for advanced NLP tools. However, open-source LLMs proficient in both finance and ESG domains remain scarce. To address this gap, we introduce SusGen-30K, a category-balanced dataset comprising seven financial NLP tasks and ESG report generation, and propose TCFD-Bench, a benchmark for evaluating sustainability report generation. Leveraging this dataset, we developed SusGen-GPT, a suite of models achieving state-of-the-art performance across six adapted and two off-the-shelf tasks, trailing GPT-4 by only 2% despite using 7-8B parameters compared to GPT-4's 1,700B. Based on this, we propose the SusGen system, integrated with Retrieval-Augmented Generation (RAG), to assist in sustainability report generation. This work demonstrates the efficiency of our approach, advancing research in finance and ESG.

large language model, machine learning, natural language, (20 more...)

2412.10906

Country:

North America > United States > Louisiana (0.04)
North America > United States > Mississippi (0.04)
North America > United States > Arkansas (0.04)
(5 more...)

Genre:

Research Report (0.82)
Public Relations > Community Relations (0.81)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Banking & Finance > Trading (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Torka, Simon, Albayrak, Sahin

Optimizing AI-Assisted Code Generation

In recent years, the rise of AI-assisted code-generation tools has significantly transformed software development. While code generators have mainly been used to support conventional software development, their use will be extended to powerful and secure AI systems. Systems capable of generating code, such as ChatGPT, OpenAI Codex, GitHub Copilot, and AlphaCode, take advantage of advances in machine learning (ML) and natural language processing (NLP) enabled by large language models (LLMs). However, it must be borne in mind that these models work probabilistically, which means that although they can generate complex code from natural language input, there is no guarantee for the functionality and security of the generated code. However, to fully exploit the considerable potential of this technology, the security, reliability, functionality, and quality of the generated code must be guaranteed. This paper examines the implementation of these goals to date and explores strategies to optimize them. In addition, we explore how these systems can be optimized to create safe, high-performance, and executable artificial intelligence (AI) models, and consider how to improve their accessibility to make AI development more inclusive and equitable.

code generator, large language model, machine learning, (18 more...)

2412.10953

Country:

North America > United States > New Jersey > Middlesex County > Piscataway (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(3 more...)

Genre: Research Report > Experimental Study (0.67)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

Qiao, Yaqiong, Peng, Shixuan

CRENER: A Character Relation Enhanced Chinese NER Model

Chinese Named Entity Recognition (NER) is an important task in information extraction, which has a significant impact on downstream applications. Due to the lack of natural separators in Chinese, previous NER methods mostly relied on external dictionaries to enrich the semantic and boundary information of Chinese words. However, such methods may introduce noise that affects the accuracy of named entity recognition. To this end, we propose a character relation enhanced Chinese NER model (CRENER). This model defines four types of tags that reflect the relationships between characters, and proposes a fine-grained modeling of the relationships between characters based on three types of relationships: adjacency relations between characters, relations between characters and tags, and relations between tags, to more accurately identify entity boundaries and improve Chinese NER accuracy. Specifically, we transform the Chinese NER task into a character-character relationship classification task, ensuring the accuracy of entity boundary recognition through joint modeling of relation tags. To enhance the model's ability to understand contextual information, WRENER further constructed an adapted transformer encoder that combines unscaled direction-aware and distance-aware masked self-attention mechanisms. Moreover, a relationship representation enhancement module was constructed to model predefined relationship tags, effectively mining the relationship representations between characters and tags. Experiments conducted on four well-known Chinese NER benchmark datasets have shown that the proposed model outperforms state-of-the-art baselines. The ablation experiment also demonstrated the effectiveness of the proposed model.

artificial intelligence, machine learning, natural language, (18 more...)

2412.10858

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Ontario > Toronto (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)
(14 more...)

Genre: Research Report (1.00)

Industry: Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

EngadgetDec-13-2024, 22:56:14 GMT

OpenAI published more of Elon Musk's emails if that's something you want to read

OpenAI published receipts, in the form of a long timeline of emails, texts and legal filings, illustrating that Elon Musk's injunction to prevent OpenAI from converting into a for-profit company runs counter to what he wanted in 2017. Essentially, OpenAI is providing even more evidence to the fact that its former co-founder wanted the AI startup to become a for-profit company and make him CEO. You should read the whole blog to get all of the details (and get a sense for how billionaires email) but the gist is that in 2017, Musk and OpenAI came to an understanding that the then non-profit needed to become a for-profit to "advance its mission" and seemingly capitalize on the public interest earned from its AI beating professional Dota 2 players in one-on-one matches. According to OpenAI, Musk proposed a new board structure where he "would unequivocally have initial control of the company," which OpenAI was opposed to. That led to the disagreements between Musk and OpenAI leadership, and him ultimately leaving the nonprofit's board in 2018.

large language model, machine learning, natural language, (8 more...)

Engadget

Industry: Law (0.61)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

WIREDDec-13-2024, 14:00:00 GMT

Human Misuse Will Make Artificial Intelligence More Dangerous

OpenAI CEO Sam Altman expects AGI, or artificial general intelligence--AI that outperforms humans at most tasks--around 2027 or 2028. Elon Musk's prediction is either 2025 or 2026, and he has claimed that he was "losing sleep over the threat of AI danger." As the limitations of current AI become increasingly clear, most AI researchers have come to the view that simply building bigger and more powerful chatbots won't lead to AGI. This story is from the WIRED World in 2025, our annual trends briefing. However, in 2025, AI will still pose a massive risk: not from artificial superintelligence, but from human misuse.

deepfake, machine learning, natural language, (16 more...)

WIRED

Country:

North America > United States > New York (0.05)
North America > United States > Colorado (0.05)
North America > Canada > British Columbia (0.05)
Europe > Netherlands (0.05)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.92)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.39)

FOX NewsDec-13-2024, 09:00:08 GMT

Drone mystery: New Jersey homeowners threaten to take matters into their own hands if government doesn't act

New Jersey residents frustrated with a lack of answers regarding dozens of potential drone sightings in the skies above their homes are threatening to take action on their own if the government doesn't start providing answers. James Ward, a Jersey Shore Realtor, shared video on Facebook that he said shows "SUV-size drones" above Island Beach State Park taken Sunday. It's difficult to judge their size in the clip, which shows a number of lights hovering in the sky. "Dozens of SUV-size drones in all directions," the caption says. "Emerging at same time and flying over the ocean and then heading in different directions – what do you think?" "A good shotgun will fix that problem," one commenter replied.

artificial intelligence, drone, social media, (11 more...)

FOX News

Country:

Europe > Jersey (0.88)
North America > United States > California (0.15)
North America > United States > New Jersey > Morris County > Randolph (0.05)
(2 more...)

Genre: Personal > Interview (0.55)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (0.99)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.98)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Communications > Social Media (0.76)

Xue, Zhiyu, Liu, Guangliang, Chen, Bocheng, Johnson, Kristen Marie, Pedarsani, Ramtin

No Free Lunch for Defending Against Prefilling Attack by In-Context Learning

arXiv.org Artificial IntelligenceDec-13-2024

The security of Large Language Models (LLMs) has become an important research topic since the emergence of ChatGPT. Though there have been various effective methods to defend against jailbreak attacks, prefilling attacks remain an unsolved and popular threat against open-sourced LLMs. In-Context Learning (ICL) offers a computationally efficient defense against various jailbreak attacks, yet no effective ICL methods have been developed to counter prefilling attacks. In this paper, we: (1) show that ICL can effectively defend against prefilling jailbreak attacks by employing adversative sentence structures within demonstrations; (2) characterize the effectiveness of this defense through the lens of model size, number of demonstrations, over-defense, integration with other jailbreak attacks, and the presence of safety alignment. Given the experimental results and our analysis, we conclude that there is no free lunch for defending against prefilling jailbreak attacks with ICL. On the one hand, current safety alignment methods fail to mitigate prefilling jailbreak attacks, but adversative structures within ICL demonstrations provide robust defense across various model sizes and complex jailbreak attacks. On the other hand, LLMs exhibit similar over-defensiveness when utilizing ICL demonstrations with adversative structures, and this behavior appears to be independent of model size.

demonstration, large language model, machine learning, (19 more...)

2412.12192

Country:

North America > United States > Michigan (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)

Genre: Research Report (0.50)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government (1.00)
Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)