AITopics | hir

Collaborating Authors

hir

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Wisdom of Hindsight Makes Language Models Better Instruction Followers

Zhang, Tianjun, Liu, Fangchen, Wong, Justin, Abbeel, Pieter, Gonzalez, Joseph E.

arXiv.org Artificial IntelligenceFeb-10-2023

Reinforcement learning has seen wide success in finetuning large language models to better align with instructions via human feedback. The so-called algorithm, Reinforcement Learning with Human Feedback (RLHF) demonstrates impressive performance on the GPT series models. However, the underlying Reinforcement Learning (RL) algorithm is complex and requires an additional training pipeline for reward and value networks. In this paper, we consider an alternative approach: converting feedback to instruction by relabeling the original one and training the model for better alignment in a supervised manner. Such an algorithm doesn't require any additional parameters except for the original language model and maximally reuses the pretraining pipeline. To achieve this, we formulate instruction alignment problem for language models as a goal-reaching problem in decision making. We propose Hindsight Instruction Relabeling (HIR), a novel algorithm for aligning language models with instructions. The resulting two-stage algorithm shed light to a family of reward-free approaches that utilize the hindsightly relabeled instructions based on feedback. We evaluate the performance of HIR extensively on 12 challenging BigBench reasoning tasks and show that HIR outperforms the baseline algorithms and is comparable to or even surpasses supervised finetuning.

large language model, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2302.05206

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.68)
Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Boosting Neural Networks to Decompile Optimized Binaries

Cao, Ying, Liang, Ruigang, Chen, Kai, Hu, Peiwei

arXiv.org Artificial IntelligenceJan-3-2023

Decompilation aims to transform a low-level program language (LPL) (eg., binary file) into its functionally-equivalent high-level program language (HPL) (e.g., C/C++). It is a core technology in software security, especially in vulnerability discovery and malware analysis. In recent years, with the successful application of neural machine translation (NMT) models in natural language processing (NLP), researchers have tried to build neural decompilers by borrowing the idea of NMT. They formulate the decompilation process as a translation problem between LPL and HPL, aiming to reduce the human cost required to develop decompilation tools and improve their generalizability. However, state-of-the-art learning-based decompilers do not cope well with compiler-optimized binaries. Since real-world binaries are mostly compiler-optimized, decompilers that do not consider optimized binaries have limited practical significance. In this paper, we propose a novel learning-based approach named NeurDP, that targets compiler-optimized binaries. NeurDP uses a graph neural network (GNN) model to convert LPL to an intermediate representation (IR), which bridges the gap between source code and optimized binary. We also design an Optimized Translation Unit (OTU) to split functions into smaller code fragments for better translation performance. Evaluation results on datasets containing various types of statements show that NeurDP can decompile optimized binaries with 45.21% higher accuracy than state-of-the-art neural decompilation frameworks.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3564625.3567998

2301.00969

Country:

North America > United States > Texas > Travis County > Austin (0.15)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.14)
(23 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Expressive Description Logic with Instantiation Metamodelling

Kubincová, Petra (Comenius University in Bratislava) | Kľuka, Ján (Comenius University in Bratislava) | Homola, Martin (Comenius University in Bratislava)

AAAI ConferencesApr-19-2016

We investigate a higher-order extension of the description logic (DL) SROIQ that provides a fixedly interpreted role semantically coupled with instantiation. It is useful to express interesting meta-level constraints on the modelled ontology. We provide a model-theoretic characterization of the semantics, and we show the decidability by means of reduction.

hir, instanceof, sroiq, (15 more...)

AAAI Conferences

Fifteenth International Conference on the Principles of Knowledge Representation and Reasoning

Country: Europe > Slovakia > Bratislava > Bratislava (0.05)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.92)
Information Technology > Artificial Intelligence > Representation & Reasoning > Description Logic (0.91)

Add feedback