AITopics

Genre: Research Report (0.59)

Industry: Information Technology (0.59)

Technology:

Information Technology > Software (0.59)
Information Technology > Artificial Intelligence > Machine Learning (0.42)

Neural Information Processing SystemsFeb-19-2026, 05:46:38 GMT

2663c994c84a79b338bca613fe1ae223-Paper-Datasets_and_Benchmarks_Track.pdf

conference, dataset, refuse, (17 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Texas > Travis County > Austin (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
(11 more...)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Takahashi, Hayami, Takahashi, Kensuke

Logic of Montage

arXiv.org Artificial IntelligenceNov-25-2025

In expressing emotions, as an expression form separate from natural language, we propose an alternative form that complements natural language, acting as a proxy or window for emotional states. First, we set up an expression form "Effect of Contradictory Structure." "Effect of Contradictory Structure" is not static but dynamic. Effect in "Effect of Contradictory Structure" is unpleasant or pleasant, and the orientation to avoid that unpleasantness is considered pseudo-expression of will. Second, "Effect of Contradictory Structure" can be overlapped with each other. This overlapping operation is called "montage." A broader "Structure" that includes related "Effect of Contradictory Structure" and "Effect of Structure" are set up. Montage produces "Effect of Structure". In montage, it is necessary to set something like "strength," so we adopted Deleuze and Deleuze/Guattari's word "intensity" and set it as an element of our model. We set up a general theoretical framework - Word Import Between Systems (Models) and justified the import of "intensity" through Austin's use of the word "force." "Effect of Structure" process is demonstrated using the example of proceeding to the next level of education.

artificial intelligence, intensity, natural language, (18 more...)

2511.19063

Country:

North America > United States > Minnesota (0.28)
Europe > United Kingdom > England (0.28)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > Illinois > Cook County (0.14)

Genre: Personal (0.67)

Industry:

Leisure & Entertainment (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.46)

arXiv.org Artificial IntelligenceNov-19-2025

GeoPl@ntNet: A Platform for Exploring Essential Biodiversity Variables

Picek, Lukas, Leblanc, César, Joly, Alexis, Bonnet, Pierre, Palard, Rémi, Servajean, Maximilien

This paper describes GeoPl@ntNet, an interactive web application designed to make Essential Biodiversity V ariables accessible and understandable to everyone through dynamic maps and fact sheets. Its core purpose is to allow users to explore high-resolution AI-generated maps of species distributions, habitat types, and biodiversity indicators across Europe. These maps, developed through a cascading pipeline involving convolutional neural networks and large language models, provide an intuitive yet information-rich interface to better understand biodiversity, with resolutions as precise as 50 50 meters. The website also enables exploration of specific regions, allowing users to select areas of interest on the map (e.g., urban green spaces, protected areas, or riverbanks) to view local species and their coverage. Additionally, GeoPl@ntNet generates comprehensive reports for selected regions, including insights into the number of protected species, invasive species, and endemic species.

artificial intelligence, machine learning, natural language, (20 more...)

doi: 10.3233/FAIA251430

2511.1379

Country: Europe (1.00)

Genre:

Research Report (0.50)
Overview (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Neural Information Processing SystemsOct-9-2025, 21:15:55 GMT

2663c994c84a79b338bca613fe1ae223-Paper-Datasets_and_Benchmarks_Track.pdf

dataset, detection, proceedings, (17 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Texas > Travis County > Austin (0.04)
(12 more...)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Software (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
(3 more...)

Popular ScienceJun-5-2025, 15:00:00 GMT

Worm towers are all around us

Breakthroughs, discoveries, and DIY tips sent every weekday. Biologists estimate that four out of five animals on Earth are nematodes (AKA roundworms).The tiny, wriggling, transparent invertebrates are the most abundant creatures on the planet and are found nearly everywhere–from permafrost to the deep ocean. More than one million species make up this ubiquitous group, which includes parasites, decomposers, predators, and more. "They're not about to take over the world, because they already did," says Serena Ding, a biologist at the Max Planck Institute of Animal Behavior in Konstanz, Germany tells Popular Science. "Global worming has already happened."

artificial intelligence, nematode, worm, (13 more...)

Popular Science

Country: Europe > Germany (0.25)

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence (0.35)

Neural Information Processing SystemsMay-27-2025, 04:18:40 GMT

Assemblage: Automatic Binary Dataset Construction for Machine Learning

artificial intelligence, assemblage, machine learning, (6 more...)

Genre: Research Report (0.42)

Industry: Information Technology (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.78)
Information Technology > Software (0.62)

Sadallah, Abdelrahman, Kotova, Daria, Kochmar, Ekaterina

What Makes Cryptic Crosswords Challenging for LLMs?

arXiv.org Artificial IntelligenceJan-14-2025

Cryptic crosswords are puzzles that rely on general knowledge and the solver's ability to manipulate language on different levels, dealing with various types of wordplay. Previous research suggests that solving such puzzles is challenging even for modern NLP models, including Large Language Models (LLMs). However, there is little to no research on the reasons for their poor performance on this task. In this paper, we establish the benchmark results for three popular LLMs: Gemma2, LLaMA3 and ChatGPT, showing that their performance on this task is still significantly below that of humans. We also investigate why these models struggle to achieve superior performance. We release our code and introduced datasets at https://github.com/bodasadallah/decrypting-crosswords.

anagram, llm, wordplay type, (13 more...)

2412.09012

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceOct-29-2024

Is Function Similarity Over-Engineered? Building a Benchmark

Saul, Rebecca, Liu, Chang, Fleischmann, Noah, Zak, Richard, Micinski, Kristopher, Raff, Edward, Holt, James

Binary analysis is a core component of many critical security tasks, including reverse engineering, malware analysis, and vulnerability detection. Manual analysis is often time-consuming, but identifying commonly-used or previously-seen functions can reduce the time it takes to understand a new file. However, given the complexity of assembly, and the NP-hard nature of determining function equivalence, this task is extremely difficult. Common approaches often use sophisticated disassembly and decompilation tools, graph analysis, and other expensive pre-processing steps to perform function similarity searches over some corpus. In this work, we identify a number of discrepancies between the current research environment and the underlying application need. To remedy this, we build a new benchmark, REFuSE-Bench, for binary function similarity detection consisting of high-quality datasets and tests that better reflect real-world use cases. In doing so, we address issues like data duplication and accurate labeling, experiment with real malware, and perform the first serious evaluation of ML binary function similarity models on Windows data. Our benchmark reveals that a new, simple basline, one which looks at only the raw bytes of a function, and requires no disassembly or other pre-processing, is able to achieve state-of-the-art performance in multiple settings. Our findings challenge conventional assumptions that complex models with highly-engineered features are being used to their full potential, and demonstrate that simpler approaches can provide significant value.

data mining, machine learning, programming language, (22 more...)

2410.22677

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Texas > Travis County > Austin (0.04)
(12 more...)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

arXiv.org Artificial IntelligenceMay-7-2024

Assemblage: Automatic Binary Dataset Construction for Machine Learning

Liu, Chang, Saul, Rebecca, Sun, Yihao, Raff, Edward, Fuchs, Maya, Pantano, Townsend Southard, Holt, James, Micinski, Kristopher

Binary code is pervasive, and binary analysis is a key task in reverse engineering, malware classification, and vulnerability discovery. Unfortunately, while there exist large corpuses of malicious binaries, obtaining high-quality corpuses of benign binaries for modern systems has proven challenging (e.g., due to licensing issues). Consequently, machine learning based pipelines for binary analysis utilize either costly commercial corpuses (e.g., VirusTotal) or open-source binaries (e.g., coreutils) available in limited quantities. To address these issues, we present Assemblage: an extensible cloud-based distributed system that crawls, configures, and builds Windows PE binaries to obtain high-quality binary corpuses suitable for training state-of-the-art models in binary analysis. We have run Assemblage on AWS over the past year, producing 890k Windows PE and 428k Linux ELF binaries across 29 configurations. Assemblage is designed to be both reproducible and extensible, enabling users to publish "recipes" for their datasets, and facilitating the extraction of a wide array of features. We evaluated Assemblage by using its data to train modern learning-based pipelines for compiler provenance and binary function similarity. Our results illustrate the practical need for robust corpuses of high-quality Windows PE binaries in training modern learning-based binary analyses. Assemblage can be downloaded from https: //assemblage-dataset.net/.

assemblage, dataset, repository, (16 more...)

2405.03991

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Software (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)