AITopics | html file

Collaborating Authors

html file

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

e2cfb719f58585f779d0a4f9f07bd618-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsApr-30-2026, 02:17:07 GMT

A.1 Creation of the Multimodal Web Document Dataset A.1.1 Collecting of a Large Number of HTMLFiles Our data collection process begins by considering the 25 most recent Common Crawl6 dumps available at the time of dataset creation. It contains webpages spanning from February 2020 to January/February 2023. We use a modified version of readability-lxml7 to extract the main text from the pages, discarding any pages that contain text of excessively high perplexity. This process yields a total of 41.2 billion documents. Selection of English content To identify non-English content, we apply the FastText classifier (Joulin et al., 2017) to the extracted text, e ectively filtering out 63.6% of the documents. Early text deduplication Often, a set of URLs is crawled repeatedly across di erent Common Crawl snapshots. However, the content of these websites may vary as web administrators make changes over time. Hence, at this stage, we refrain from deduplicating documents based on their URLs. Instead, we perform MinHash (Broder, 1997) deduplication with 16 hashes calculated over 5-grams. To further refine the data, we eliminate documents containing substantial proportions of repeated paragraphs and n-grams, employing the methodology described in MassiveText (Rae et al., 2022).

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
Africa (1.00)
North America > Canada (0.93)
(2 more...)

Genre: Research Report > Experimental Study (0.46)

Industry:

Leisure & Entertainment > Sports > Martial Arts (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
(14 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Mobile (0.68)
(2 more...)

Add feedback

e2cfb719f58585f779d0a4f9f07bd618-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-17-2026, 14:56:24 GMT

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

Asia > Russia (0.14)
Asia > Armenia (0.14)
Africa > Tanzania (0.14)
(82 more...)

Genre: Research Report > Experimental Study (0.46)

Industry:

Leisure & Entertainment > Sports > Martial Arts (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
(13 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Mobile (0.68)
(2 more...)

Add feedback

Computer-Use Agents as Judges for Generative User Interface

Lin, Kevin Qinghong, Hu, Siyuan, Li, Linjie, Yang, Zhengyuan, Wang, Lijuan, Torr, Philip, Shou, Mike Zheng

arXiv.org Artificial IntelligenceNov-20-2025

Computer-Use Agents (CUA) are becoming increasingly capable of autonomously operating digital environments through Graphical User Interfaces (GUI). Yet, most GUI remain designed primarily for humans--prioritizing aesthetics and usability--forcing agents to adopt human-oriented behaviors that are unnecessary for efficient task execution. At the same time, rapid advances in coding-oriented language models (Coder) have transformed automatic GUI design. This raises a fundamental question: Can CUA as judges to assist Coder for automatic GUI design? To investigate, we introduce AUI-Gym, a benchmark for Automatic GUI development spanning 52 applications across diverse domains. Using language models, we synthesize 1560 tasks that simulate real-world scenarios. To ensure task reliability, we further develop a verifier that programmatically checks whether each task is executable within its environment. Building on this, we propose a Coder-CUA in Collaboration framework: the Coder acts as Designer, generating and revising websites, while the CUA serves as Judge, evaluating functionality and refining designs. Success is measured not by visual appearance, but by task solvability and CUA navigation success rate. To turn CUA feedback into usable guidance, we design a CUA Dashboard that compresses multi-step navigation histories into concise visual summaries, offering interpretable guidance for iterative redesign. By positioning agents as both designers and judges, our framework shifts interface design toward agent-native efficiency and reliability. Our work takes a step toward shifting agents from passive use toward active participation in digital environments. Our code and dataset are available at https://github.com/showlab/AUI.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2511.15567

Genre: Research Report (0.82)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Add feedback

WebChecker: A Versatile EVL Plugin for Validating HTML Pages with Bootstrap Frameworks

Cherukuri, Milind

arXiv.org Artificial IntelligenceFeb-11-2025

WebChecker is a plugin for Epsilon Validation Language (EVL), designed to validate both static and dynamic HTML pages utilizing frameworks like Bootstrap. By employing configurable EVL constraints, WebChecker enforces implicit rules governing HTML and CSS frameworks. The effectiveness of the plugin is demonstrated through its application on Bootstrap, the widely adopted HTML, CSS, and JavaScript framework. WebChecker comes with a set of EVL constraints to assess Bootstrap based web pages. To substantiate our claims, I present an illustrative example featuring two solutions that effectively enforce implicit rules.

artificial intelligence, programming language, webchecker, (15 more...)

arXiv.org Artificial Intelligence

2502.07479

Genre: Research Report (0.50)

Technology:

Information Technology > Software > Programming Languages (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.47)
Information Technology > Communications > Web (0.37)

Add feedback

Resources - Second Edition -- An Introduction to Statistical Learning

#artificialintelligenceApr-18-2023, 18:51:39 GMT

The original Chapter 10 lab made use of keras, an R package for deep learning that relies on Python. Getting keras to work on your computer can be a bit of a challenge. Installation instructions are available here. RStudio has recently released a new R package for deep learning, called torch, that does not require a Python installation. Daniel Falbel and Sigrid Keydana, two of the torch developers, translated our keras version of the Chapter 10 lab to torch.

chapter 10, rmd file, statistical learning, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.40)

Add feedback

GitHub - bocheng97/TRGN510_Final: FIrst_taste_of_Machine_learning_in_R

#artificialintelligenceMar-15-2023, 12:22:54 GMT

In this project, I try to use a machine learning package, MLSeq from Bioconductor to find out the best model that predicts breast cancer subtype. I use 28 data sets from TCGA to train and test this model using 12 datasets. So I may do many copy and paste, but I will give my understanding and opinions in the R notebook. According to the Vignette, I've input the data and converted the data to be right data frames which are ready to do MLSeq. And the next step is to choose a model, do the Normalization and transformation, and use the normalized data to train model.

dataset, mlseq, possible biomarker, (10 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area > Oncology (0.57)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Being A Game Developer Without Being A Game Developer With ChatGPT

#artificialintelligenceMar-10-2023, 14:13:31 GMT

When ChatGPT came out at the end of 2022, It caused a huge surprise for millions of people around the world due to its capabilities. I experienced the rise of it firsthand: I have personally seen the reactions of people using it and have had numerous interactions with GPT. For some people, It may even sound surreal to build a game with ChatGPT without writing code, but it is possible now. In case you have never heard it before, ChatGPT is a super powerful AI tool created by OpenAI, a company founded in San Francisco in late 2015 by Sam Altman, Elon Musk, and others. You can take a look here to learn its capabilities briefly.

chatgpt, extension, game developer, (9 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.25)

Industry:

Leisure & Entertainment > Games > Computer Games (0.76)
Information Technology > Software (0.76)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GitHub - cdpierse/transformers-interpret: Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

#artificialintelligenceDec-2-2022, 18:56:31 GMT

Transformers Interpret is a model explainability tool designed to work exclusively with the transformers package. In line with the philosophy of the Transformers package Transformers Interpret allows any transformers model to be explained in just two lines. Explainers are available for both text and computer vision models. Visualizations are also available in notebooks and as savable png and html files. Positive attribution numbers indicate a word contributes positively towards the predicted class, while negative numbers indicate a word contributes negatively towards the predicted class.

attribution, explainer, visualize, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision (0.88)

Add feedback

DeviantArt is launching its own AI art generator

EngadgetNov-11-2022, 14:00:06 GMT

While not everyone's convinced that AI art is actual art, the generators used to whip them up are likely here to stay. DeviantArt is now getting into the space with a generator of its own called DreamUp, promising "safe and fair" generation for creators. The website says one of artists' main concerns about AI art is that their work may be used to train artificial intelligence models, which means the generator could spit out pieces in their style without their consent. In an attempt to give artists control over their work, DeviantArt is giving them the ability to choose whether or not the tool can use their style for direct inspiration. Further, the website is giving them the power to declare whether or not to allow their work to be used in datasets used to train third-party AI models.

deviantart, generator, own ai art generator, (9 more...)

Engadget

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

My First Impression Trying Python on Browser

#artificialintelligenceMay-12-2022, 05:36:46 GMT

Whenever we debate with other devs about the best programming language, we talk about JavaScript and Python for hours. Both are powerful, flexible languages that are dominating the world today. But a dead end to Python is its inability to run on browsers. JavaScript (JS), with the discovery of Node, runs on almost any platform. It even has modules to build machine learning algorithms.

browser, pyscript, python, (15 more...)

#artificialintelligence

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.30)

Add feedback