AITopics | serious consequence

Collaborating Authors

serious consequence

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Bypassing the Safety Training of Open-Source LLMs with Priming Attacks

Vega, Jason, Chaudhary, Isha, Xu, Changming, Singh, Gagandeep

arXiv.org Artificial IntelligenceDec-19-2023

Content warning: This paper contains examples of harmful language. With the recent surge in popularity of LLMs has come an ever-increasing need for LLM safety training. In this paper, we investigate the fragility of SOTA opensource LLMs under simple, optimization-free attacks we refer to as priming attacks, which are easy to execute and effectively bypass alignment from safety training. Our proposed attack improves the Attack Success Rate on Harmful Behaviors, as measured by Llama Guard, by up to 3.3 compared to baselines. Autoregressive Large Language Models (LLMs) have emerged as powerful conversational agents widely used in user-facing applications. To ensure that LLMs cannot be used for nefarious purposes, they are extensively safety-trained for human alignment using techniques such as RLHF (Christiano et al., 2023). Despite such efforts, it is still possible to circumvent the alignment to obtain harmful outputs (Carlini et al., 2023). For instance, Zou et al. (2023) generated prompts to attack popular open-source aligned LLMs such as Llama-2 (Touvron et al., 2023a) and Vicuna (Chiang et al., 2023) to either output harmful target strings or comply with harmful behavior requests.

information, instruction, llm, (15 more...)

arXiv.org Artificial Intelligence

2312.12321

Country: North America > United States > Illinois > Champaign County > Urbana (0.04)

Genre:

Research Report (0.64)
Instructional Material (0.48)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Health & Medicine (1.00)
Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Ex-Google safety lead calls for AI algorithm transparency, warns of 'serious consequences for humanity'

FOX NewsJun-9-2023, 06:00:48 GMT

SmartNews' Head of Global Trust and Safety is calling for new regulation on artificial intelligence (AI) to prioritize user transparency and ensure human oversight remains a crucial component for news and social media recommender systems. "We need to have guardrails," Arjun Narayan said. "Without humans thinking through everything that could go wrong, like bias creeping into the models or large language models falling into the wrong hands, there can be very serious consequences for humanity." Narayan, who previously worked on Trust and Safety for Google and Bytedance, the company behind TikTok, said it is essential for companies to recognize opt-in and opt-outs when using large language models (LLMs). As a default, anything being fed to an LLM will be assumed training data and collected by the model.

narayan, serious consequence, transparency, (11 more...)

FOX News

Country:

Asia > China (0.06)
Asia > Middle East > Kuwait (0.05)

Industry:

Media > News (1.00)
Information Technology (0.98)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.98)

Add feedback

Why Simple Models Are Often Better

#artificialintelligenceJan-26-2023, 04:35:09 GMT

In data science and machine learning, simplicity is an important concept that can have significant impact on model characteristics such as performance and interpretability. Over-engineered solutions tend to adversely affect these characteristics by increasing the likelihood of overfitting, decreasing computational efficiency, and lowering the transparency of the model's output. The latter is particularly important for areas that require a certain degree of interpretability, such as medicine and healthcare, finance, or law. The inability to interpret and trust a model's decision -- and to ensure that this decision is fair and unbiased -- can have serious consequences for individuals whose fate depends on it. This article aims to highlight the importance of giving precedence to simplicity when it comes to implementing a data science or machine learning solution.

artificial intelligence, interpretability, machine learning, (16 more...)

#artificialintelligence

Industry:

Health & Medicine > Therapeutic Area > Oncology (0.48)
Health & Medicine > Diagnostic Medicine (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

These laughable depictions of AI can have serious consequences

#artificialintelligenceJun-29-2021, 06:25:05 GMT

What do you imagine when you think about artificial intelligence? For many of us, the question conjures up images from movies, novels, posters, and media reports. But these visualizations are often risibly unrealistic depictions of AI. These images might make us laugh. Unfortunately, they can also mislead us about AI's potential, reinforce stereotypes, and erase minorities from visions of the future.

depiction, laughable depiction, serious consequence, (2 more...)

#artificialintelligence

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.06)

Industry: Media > News (0.38)

Technology: Information Technology > Artificial Intelligence > Robots (0.72)

Add feedback

'Trustworthy AI' is a framework to help manage unique risk

#artificialintelligenceApr-19-2020, 09:57:42 GMT

Artificial intelligence (AI) technology continues to advance by leaps and bounds and is quickly becoming a potential disrupter and essential enabler for nearly every company in every industry. At this stage, one of the barriers to widespread AI deployment is no longer the technology itself; rather, it's a set of challenges that ironically are far more human: ethics, governance, and human values. Irfan Saif is principal at Deloitte Risk and Financial Advisory. As AI expands into almost every aspect of modern life, the risks of misbehaving AI increase exponentially--to a point where those risks can literally become a matter of life and death. Real-world examples of AI gone awry include systems that discriminate against people based on their race, age, or gender and social media systems that inadvertently spread rumors and disinformation and more.

ai system, conséquence, trustworthy ai, (12 more...)

#artificialintelligence

Industry:

Information Technology > Security & Privacy (1.00)
Banking & Finance > Financial Services (0.68)

Technology: Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.56)

Add feedback

Vladimir Putin warns about super-human soldiers in future

Daily Mail - Science & techOct-23-2017, 12:30:02 GMT

Genetically-modified superhuman soldiers'worse than a nuclear bomb' could soon become a reality, according to Russian President, Vladimir Putin. Speaking at a youth festival this week, Putin claimed that an army of trained killers could be created if scientists play with man's genetic code. Putin suggested that world leaders should agree on strict regulation to prevent the creation of mass-killing soldiers who feel no pain or fear. Genetically-modified super soldiers'worse than a nuclear bomb' could soon become a reality, according to Russian President, Vladimir Putin Putin warned that messing with the genetic code could have serious consequences. He said: 'One may imagine that a man can create a man not only theoretically but also practically.

artificial intelligence, putin, soldier, (14 more...)

Daily Mail - Science & tech

Country: