AITopics | customizability

Collaborating Authors

customizability

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Archilles' Heel in Semi-open LLMs: Hiding Bottom against Recovery Attacks

Huang, Hanbo, Li, Yihan, Jiang, Bowen, Liu, Lin, Sun, Ruoyu, Liu, Zhuotao, Liang, Shiyu

arXiv.org Artificial IntelligenceOct-14-2024

Closed-source large language models deliver strong performance but have limited downstream customizability. Semi-open models, combining both closed-source and public layers, were introduced to improve customizability. However, parameters in the closed-source layers are found vulnerable to recovery attacks. In this paper, we explore the design of semi-open models with fewer closed-source layers, aiming to increase customizability while ensuring resilience to recovery attacks. We analyze the contribution of closed-source layer to the overall resilience and theoretically prove that in a deep transformer-based model, there exists a transition layer such that even small recovery errors in layers before this layer can lead to recovery failure. SCARA employs a fine-tuning-free metric to estimate the maximum number of layers that can be publicly accessible for customization. We apply it to five models (1.3B to 70B parameters) to construct semi-open models, validating their customizability on six downstream tasks and assessing their resilience against various recovery attacks on sixteen benchmarks. We compare SCARA to baselines and observe that it generally improves downstream customization performance and offers similar resilience with over 10 times fewer closed-source parameters. We empirically investigate the existence of transition layers, analyze the effectiveness of our scheme and finally discuss its limitations. Open-sourcing more parameters and structure details apparently enhances downstream customizability. However, Zanella-Beguelin et al. (2021) showed that semi-open LLMs with only a few closed-source parameters are vulnerable to model recovery attacks. Recovery attackers query the closed-source module and then train a new module that imitates its functionality. This can lead to the full replication and theft of closed-source modules (Solaiman, 2023). Recovery attackers targeting fully closed-source models seek to fine-tune a new model that precisely replicates the closed-source model (Tamber et al., 2024; Dubiński et al., 2024). In contrast, attackers in semi-open settings are not required to exactly replicate the closed-source module. Instead, they can fine-tune the closed-source module alongside the public module to reconstruct the overall functionality. While open-sourcing more layers enhances downstream flexibility, it also facilitates easier replication.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2410.11182

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(6 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (0.67)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Zhang, Zhexin, Lu, Yida, Ma, Jingyuan, Zhang, Di, Li, Rui, Ke, Pei, Sun, Hao, Sha, Lei, Sui, Zhifang, Wang, Hongning, Huang, Minlie

arXiv.org Artificial IntelligenceFeb-26-2024

The safety of Large Language Models (LLMs) has gained increasing attention in recent years, but there still lacks a comprehensive approach for detecting safety issues within LLMs' responses in an aligned, customizable and explainable manner. In this paper, we propose ShieldLM, an LLM-based safety detector, which aligns with general human safety standards, supports customizable detection rules, and provides explanations for its decisions. To train ShieldLM, we compile a large bilingual dataset comprising 14,387 query-response pairs, annotating the safety of responses based on various safety standards. Through extensive experiments, we demonstrate that ShieldLM surpasses strong baselines across four test sets, showcasing remarkable customizability and explainability. Besides performing well on standard detection datasets, ShieldLM has also been shown to be effective in real-world situations as a safety evaluator for advanced LLMs. We release ShieldLM at \url{https://github.com/thu-coai/ShieldLM} to support accurate and explainable safety detection under various safety standards, contributing to the ongoing efforts to enhance the safety of LLMs.

dataset, llm, shieldlm, (16 more...)

arXiv.org Artificial Intelligence

2402.16444

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Connecticut (0.04)
Asia > Singapore (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Optimizing Machine Translation through Prompt Engineering: An Investigation into ChatGPT's Customizability

Yamada, Masaru

arXiv.org Artificial IntelligenceAug-2-2023

This paper explores the influence of integrating the purpose of the translation and the target audience into prompts on the quality of translations produced by ChatGPT. Drawing on previous translation studies, industry practices, and ISO standards, the research underscores the significance of the pre-production phase in the translation process. The study reveals that the inclusion of suitable prompts in large-scale language models like ChatGPT can yield flexible translations, a feat yet to be realized by conventional Machine Translation (MT). The research scrutinizes the changes in translation quality when prompts are used to generate translations that meet specific conditions. The evaluation is conducted from a practicing translator's viewpoint, both subjectively and qualitatively, supplemented by the use of OpenAI's word embedding API for cosine similarity calculations. The findings suggest that the integration of the purpose and target audience into prompts can indeed modify the generated translations, generally enhancing the translation quality by industry standards. The study also demonstrates the practical application of the "good translation" concept, particularly in the context of marketing documents and culturally dependent idioms.

customizability, optimizing machine translation, prompt engineering, (2 more...)

arXiv.org Artificial Intelligence

2308.01391

Genre: Research Report > New Finding (0.53)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AutoML in The Wild: Obstacles, Workarounds, and Expectations

Sun, Yuan, Song, Qiurong, Gui, Xinning, Ma, Fenglong, Wang, Ting

arXiv.org Artificial IntelligenceFeb-21-2023

Automated machine learning (AutoML) is envisioned to make ML While machine learning (ML) has been successfully applied to solve techniques accessible to ordinary users. Recent work has investigated many challenging tasks across various domains, building performant the role of humans in enhancing AutoML functionality ML solutions still requires substantial resources and extensive throughout a standard ML workflow. However, it is also critical to human expertise [34]. Automated machine learning (AutoML), a understand how users adopt existing AutoML solutions in complex, novel concept for automating the whole ML pipeline without (or real-world settings from a holistic perspective. To fill this gap, this as little as possible) human intervention [39], has emerged as a study conducted semi-structured interviews of AutoML users ( way to significantly reduce expensive development costs [75]. As = 19) focusing on understanding (1) the limitations of AutoML encountered illustrated in Figure 1, envisioned to enable domain experts without by users in their real-world practices, (2) the strategies considerable ML backgrounds (e.g., marketing and business analysts) users adopt to cope with such limitations, and (3) how the limitations to build ML solutions more easily, AutoML holds the promise and workarounds impact their use of AutoML.

artificial intelligence, automl, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3544548.3581082

2302.10827

Country:

Europe > Germany > Hamburg (0.05)
North America > United States > Pennsylvania (0.04)
North America > United States > Virginia (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Personal > Interview (0.66)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Banking & Finance (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.46)

Add feedback

PyTorch Adapt

Musgrave, Kevin, Belongie, Serge, Lim, Ser-Nam

arXiv.org Artificial IntelligenceNov-28-2022

PyTorch Adapt is a library for domain adaptation, a type of machine learning algorithm that re-purposes existing models to work in new domains. It is a fully-featured toolkit, allowing users to create a complete train/test pipeline in a few lines of code. It is also modular, so users can import just the parts they need, and not worry about being locked into a framework. One defining feature of this library is its customizability. In particular, complex training algorithms can be easily modified and combined, thanks to a system of composable, lazily-evaluated hooks. In this technical report, we explain in detail these features and the overall design of the library.

adapter, artificial intelligence, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2211.15673

Country: Europe > Denmark > Capital Region > Copenhagen (0.05)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.77)

Add feedback

Smart Home, The Next Generation: Closing the Gap between Users and Technology

Hwang, Amy (Univeristy of Toronto) | Hoey, Jesse (University of Waterloo)

AAAI ConferencesNov-5-2012

In this paper we discuss the gap that exists between the caregivers of older adults attempting to age-in-place and sophisticated ”smart-home” systems that can sense the environment and provide assistance when needed. We argue that smart-home systems need to be customizable by end-users, and we present a general-purpose model for cognitive assistive technology that can be adapted to suit many different tasks, users and environments. Al- though we can provide mechanisms for engineers and designers to build and adapt smart-home systems based on this general-purpose model, these mechanisms are not easily understood by or sufficiently user-friendly for actual end users such as older adults and their care- givers. Our goal is therefore to study how to bridge the gap between the end-users and this technology. In this paper, we discuss our work on this problem from both sides: developing technology that is customizable and general-purpose, and studying user’s abilities and needs when it comes to building smart-home systems to help with activities of daily living. We show how a large gap still exists, and propose ideas for how to bridge the gap.

artificial intelligence, human computer interaction, machine learning, (18 more...)

AAAI Conferences

2012 AAAI Fall Symposium Series

Country:

North America > Canada > Ontario > Toronto (0.28)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Poland (0.04)

Genre: Research Report > New Finding (0.68)

Industry:

Information Technology > Smart Houses & Appliances (1.00)
Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.46)

Technology:

Information Technology > Internet of Things (1.00)
Information Technology > Human Computer Interaction (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback