AITopics | albany

Collaborating Authors

albany

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Order Effect: Investigating Prompt Sensitivity in Closed-Source LLMs

Guan, Bryan, Roosta, Tanya, Passban, Peyman, Rezagholizadeh, Mehdi

arXiv.org Artificial IntelligenceFeb-6-2025

As large language models (LLMs) become integral to diverse applications, ensuring their reliability under varying input conditions is crucial. One key issue affecting this reliability is order sensitivity, wherein slight variations in input arrangement can lead to inconsistent or biased outputs. Although recent advances have reduced this sensitivity, the problem remains unresolved. This paper investigates the extent of order sensitivity in closed-source LLMs by conducting experiments across multiple tasks, including paraphrasing, relevance judgment, and multiple-choice questions. Our results show that input order significantly affects performance across tasks, with shuffled inputs leading to measurable declines in output accuracy. Few-shot prompting demonstrates mixed effectiveness and offers partial mitigation, however, fails to fully resolve the problem. These findings highlight persistent risks, particularly in high-stakes applications, and point to the need for more robust LLMs or improved input-handling techniques in future development. In recent years, large language models (LLMs) have become essential across various applications, helping users complete tasks in diverse domains, thanks to their remarkable abilities in understanding, analyzing, and generating text (Shen et al., 2023a; Yu et al., 2023). However, LLMs are not without their problems and risks. Many of these issues, such as bias (Talat et al., 2022; Motoki et al., 2023), hallucination (Chen et al., 2023; Sadat et al., 2023), consistency (Tam et al., 2023; Ye et al., 2023), and reliability (Shen et al., 2023b) have been extensively discussed in the literature. However, a more fundamental challenge to the long-term success of LLMs is their ability to reason: the distinguishing factor between probabilistic pattern matching and logical understanding. This distinction has significant implications for the future of LLMs and how we employ these models in decision-making. One necessary requirement for reasoning is order independence.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2502.04134

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Minnesota > Stearns County (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Evolving Deeper LLM Thinking

Lee, Kuang-Huei, Fischer, Ian, Wu, Yueh-Hua, Marwood, Dave, Baluja, Shumeet, Schuurmans, Dale, Chen, Xinyun

arXiv.org Artificial IntelligenceJan-16-2025

We explore an evolutionary search strategy for scaling inference time compute in Large Language Models. The proposed approach, Mind Evolution, uses a language model to generate, recombine and refine candidate responses. The proposed approach avoids the need to formalize the underlying inference problem whenever a solution evaluator is available. Controlling for inference cost, we find that Mind Evolution significantly outperforms other inference strategies such as Best-of-N and Sequential Revision in natural language planning tasks. In the TravelPlanner and Natural Plan benchmarks, Mind Evolution solves more than 98% of the problem instances using Gemini 1.5 Pro without the use of a formal solver.

evolutionary algorithm, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2501.09891

Country:

North America > Canada > Alberta (0.14)
North America > United States > Illinois > Cook County > Chicago (0.05)
Europe > Spain > Galicia > Madrid (0.05)
(8 more...)

Genre: Research Report > New Finding (0.46)

Industry: Consumer Products & Services > Travel (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

Stochastic Online AUC Maximization Department of Mathematics and Statistics SUNY at Albany, Albany, NY, 12222, USA Department of Computer Science SUNY at Albany, Albany, NY, 12222, USA

Neural Information Processing SystemsMar-12-2024, 17:15:30 GMT

Area under ROC (AUC) is a metric which is widely used for measuring the classification performance for imbalanced data. It is of theoretical and practical interest to develop online learning algorithms that maximizes AUC for large-scale data. A specific challenge in developing online AUC maximization algorithm is that the learning objective function is usually defined over a pair of training examples of opposite classes, and existing methods achieves on-line processing with higher space and time complexity. In this work, we propose a new stochastic online algorithm for AUC maximization. In particular, we show that AUC optimization can be equivalently formulated as a convex-concave saddle point problem. From this saddle representation, a stochastic online algorithm (SOLAM) is proposed which has time and space complexity of one datum. We establish theoretical convergence of SOLAM with high probability and demonstrate its effectiveness on standard benchmark datasets.

artificial intelligence, inductive learning, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > Albany County > Albany (0.76)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Industry: Education > Educational Setting > Online (0.36)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Towards Uncertainty-Aware Language Agent

Han, Jiuzhou, Buntine, Wray, Shareghi, Ehsan

arXiv.org Artificial IntelligenceFeb-7-2024

While Language Agents have achieved promising success by placing Large Language Models at the core of a more versatile design that dynamically interacts with the external world, the existing approaches neglect the notion of uncertainty during these interactions. We present the Uncertainty-Aware Language Agent (UALA), a framework that orchestrates the interaction between the agent and the external world using uncertainty quantification. Compared with other well-known counterparts like ReAct, our extensive experiments across 3 representative tasks (HotpotQA, StrategyQA, MMLU) and various LLM sizes demonstrate that UALA brings a significant improvement of performance, while having a substantially lower reliance on the external world (i.e., reduced number of tool calls and tokens). Our analyses provide various insights including the great potential of UALA compared with agent fine-tuning, and underscore the unreliability of verbalised confidence of LLMs as a proxy for uncertainty.

calibration, estimation, hotpotqa, (15 more...)

arXiv.org Artificial Intelligence

2401.14016

Country:

North America > United States > Colorado (0.05)
North America > United States > New York > Albany County > Albany (0.04)
North America > United States > Georgia > Dougherty County > Albany (0.04)
(12 more...)

Genre:

Research Report > New Finding (0.67)
Research Report > Experimental Study (0.46)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

FireAct: Toward Language Agent Fine-tuning

Chen, Baian, Shu, Chang, Shareghi, Ehsan, Collier, Nigel, Narasimhan, Karthik, Yao, Shunyu

arXiv.org Artificial IntelligenceOct-9-2023

Recent efforts have augmented language models (LMs) with external tools or environments, leading to the development of language agents that can reason and act. However, most of these agents rely on few-shot prompting techniques with off-the-shelf LMs. In this paper, we investigate and argue for the overlooked direction of fine-tuning LMs to obtain language agents. Using a setup of question answering (QA) with a Google search API, we explore a variety of base LMs, prompting methods, fine-tuning data, and QA tasks, and find language agents are consistently improved after fine-tuning their backbone LMs. For example, fine-tuning Llama2-7B with 500 agent trajectories generated by GPT-4 leads to a 77% HotpotQA performance increase. Furthermore, we propose FireAct, a novel approach to fine-tuning LMs with trajectories from multiple tasks and prompting methods, and show having more diverse fine-tuning data can further improve agents. Along with other findings regarding scaling effects, robustness, generalization, efficiency and cost, our work establishes comprehensive benefits of fine-tuning LMs for agents, and provides an initial set of experimental designs, insights, as well as open questions toward language agent fine-tuning.

arxiv, fine-tuning, language model, (16 more...)

arXiv.org Artificial Intelligence

2310.05915

Country:

North America > United States > New York > Albany County > Albany (0.05)
North America > United States > Georgia > Dougherty County > Albany (0.05)
North America > United States > Colorado (0.04)
(12 more...)

Genre: Research Report (1.00)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts

Zhou, Ben, Richardson, Kyle, Yu, Xiaodong, Roth, Dan

arXiv.org Artificial IntelligenceOct-30-2022

Explicit decomposition modeling, which involves breaking down complex tasks into more straightforward and often more interpretable sub-tasks, has long been a central theme in developing robust and interpretable NLU systems. However, despite the many datasets and resources built as part of this effort, the majority have small-scale annotations and limited scope, which is insufficient to solve general decomposition tasks. In this paper, we look at large-scale intermediate pre-training of decomposition-based transformers using distant supervision from comparable texts, particularly large-scale parallel news. We show that with such intermediate pre-training, developing robust decomposition-based models for a diverse range of tasks becomes more feasible. For example, on semantic parsing, our model, DecompT5, improves 20% to 30% on two datasets, Overnight and TORQUE, over the baseline language model. We further use DecompT5 to build a novel decomposition-based QA system named DecompEntail, improving over state-of-the-art models, including GPT-3, on both HotpotQA and StrategyQA by 8% and 4%, respectively.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2210.16865

Country:

Africa > South Africa (0.29)
Europe > Italy (0.14)
North America > United States > Georgia > Dougherty County > Albany (0.05)
(11 more...)

Genre: Research Report > Promising Solution (0.34)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.87)

Add feedback

Classification of Misinformation in New Articles using Natural Language Processing and a Recurrent Neural Network

Cunha, Brendan, Manikonda, Lydia

arXiv.org Artificial IntelligenceOct-24-2022

One of the first issues to address with these labels is the Misinformation in news articles has been one of the main inconsistency of scales used. For example, some labels are topics for discussion over the past few years. There have scaled from 0-3 in terms of level of misinformation, others been several organizations that developed methods for assessing are scaled in a binary manner with 0 and 1, and some have 4 reliability and personal bias of news coverage. In today's categorical values based on levels of media bias. So there is day in age, it is unnatural to arbitrarily trust the news quite a bit of processing that needed to be done to normalize outlets that claim to be truly objective and unbiased because everything and transform the qualitative variables into quantitative the term "bias" is relative. What one person perceives as variables.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2210.13534

Country:

Asia > Russia (0.14)
North America > United States > New York > Rensselaer County > Troy (0.04)
Europe > Russia (0.04)

Genre: Research Report (0.40)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Add feedback

Artificial Intelligence Coming to University at Albany

#artificialintelligenceJun-23-2022, 08:35:18 GMT

In a press release on Tuesday, Governor Kathy Hochul announced that the University at Albany will become the home of a new artificial intelligence supercomputing initiative. The $200 million project will turn the building which was formerly Albany High School into an engineering college capable of housing a supercomputer that can reach a quintillion computations per second. It would be the first university-based supercomputer capable of reaching that kind of production. In the press release, Governor Hochul said "My administration is steadfast in its commitment to transform SUNY into a globally renowned, 21st century education leader. This funding will help drive economic revenue by attracting companies to New York's emerging advanced research centers, creating jobs and strengthening communities for decades to come."

albany, artificial intelligence, university, (4 more...)

#artificialintelligence

Country: North America > United States > New York (0.28)

Industry: Education (0.58)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Scientific Computing (0.91)

Add feedback

Machine Learning Models Predict COVID-19 Impact in Smaller Cities

#artificialintelligenceApr-17-2020, 14:24:07 GMT

According to a robust machine learning model that can predict pandemic impact even in smaller cities, with 75% of the population in the Capital Region in New York remaining at home, the COVID-19 pandemic will peak locally in the second half of May. If the rate of people staying home drops to 50%, it will peak in early June. Rensselaer Polytechnic Institute researcher Malik Magdon-Ismail tailored the models he is developing to work with sparse data points, like those available during the early phase in a pandemic or in smaller cities, which ordinarily make trend-spotting difficult. "There are no simple, robust, general tools that, for example, officials in Albany could use to make projections," said Magdon-Ismail, a professor of computer science, and expert in machine learning, data mining, and pattern recognition. "These models show that the projections vary enormously from one city to another. This knowledge could relieve some of the uncertainty that is around in developing policy."

infection, learning model predict covid-19 impact, magdon-ismail, (6 more...)

#artificialintelligence

Country: North America > United States > New York > Schenectady County (0.06)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.62)
Health & Medicine > Therapeutic Area > Immunology (0.62)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Capitol Watch: New York to take on artificial intelligence

#artificialintelligenceJul-28-2019, 07:43:34 GMT

In New York government news, state officials are examining the opportunities -- and risks -- posed by artificial intelligence. Gov. Andrew Cuomo, a Democrat, signed legislation this month that creates a 13-member commission tasked with reviewing the emerging technology and what it will mean for New Yorkers. Meanwhile, the ongoing scourge of opioid abuse is getting some attention with lawmakers announcing a series of public hearings to identify ways the state could do a better job of addressing the problem. While no one is predicting a robot uprising any time soon, state officials say they are concerned by how the rise of artificial intelligence and robotics could affect jobs, the delivery of government services and personal privacy. The New York State Artificial Intelligence, Robotics and Automation Commission, approved by lawmakers earlier this year, will also look at how A.I. could be used "in unlawful or unsafe ways."

artificial intelligence, hinchey, new york, (10 more...)

#artificialintelligence

Country: North America > United States > New York (1.00)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.57)

Add feedback