AITopics | kale

Collaborating Authors

kale

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Awadalla, Anas, Xue, Le, Shu, Manli, Yan, An, Wang, Jun, Purushwalkam, Senthil, Shen, Sheng, Lee, Hannah, Lo, Oscar, Park, Jae Sung, Guha, Etash, Savarese, Silvio, Schmidt, Ludwig, Choi, Yejin, Xiong, Caiming, Xu, Ran

arXiv.org Artificial IntelligenceNov-11-2024

Table 1: Comparison of open-source synthetic image-text datasets: We compare various datasets in terms of scale (number of samples), density (average number of words per sample), whether they are knowledge-augmented (meaning that the caption includes information found in image's web scraped alt-text), and the size of the captioning model used to generate the descriptions. For KALE, we create an initial pool of 100M captions from a 17B parameter model and use it to distill a 2B parameter model that matches the performance of the larger 17B model. We introduce BLIP3-KALE, a dataset of 218 million image-text pairs that advances the state of knowledge-augmented image captioning. KALE builds upon recent work in this area, particularly CapsFusion [28], which pioneered the use of large language models to fuse synthetically generated captions with alt-text to incorporate real-world knowledge.

caption, dataset, semanticscholar, (16 more...)

arXiv.org Artificial Intelligence

2411.07461

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)

Add feedback

KALE: An Artwork Image Captioning System Augmented with Heterogeneous Graph

Jiang, Yanbei, Ehinger, Krista A., Lau, Jey Han

arXiv.org Artificial IntelligenceSep-17-2024

Exploring the narratives conveyed by fine-art paintings is a challenge in image captioning, where the goal is to generate descriptions that not only precisely represent the visual content but also offer a in-depth interpretation of the artwork's meaning. The task is particularly complex for artwork images due to their diverse interpretations and varied aesthetic principles across different artistic schools and styles. In response to this, we present KALE Knowledge-Augmented vision-Language model for artwork Elaborations), a novel approach that enhances existing vision-language models by integrating artwork metadata as additional knowledge. KALE incorporates the metadata in two ways: firstly as direct textual input, and secondly through a multimodal heterogeneous knowledge graph. To optimize the learning of graph representations, we introduce a new cross-modal alignment loss that maximizes the similarity between the image and its corresponding metadata. Experimental results demonstrate that KALE achieves strong performance (when evaluated with CIDEr, in particular) over existing state-of-the-art work across several artwork datasets. Source code of the project is available at https://github.com/Yanbei-Jiang/Artwork-Interpretation.

artwork, graph, metadata, (15 more...)

arXiv.org Artificial Intelligence

2409.10921

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Italy > Trentino-Alto Adige/Südtirol > Trentino Province > Trento (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders

Campos, Daniel, Magnani, Alessandro, Zhai, ChengXiang

arXiv.org Artificial IntelligenceJun-1-2023

In this paper, we consider the problem of improving the inference latency of language model-based dense retrieval systems by introducing structural compression and model size asymmetry between the context and query encoders. First, we investigate the impact of pre and post-training compression on the MSMARCO, Natural Questions, TriviaQA, SQUAD, and SCIFACT, finding that asymmetry in the dual encoders in dense retrieval can lead to improved inference efficiency. Knowing this, we introduce Kullback Leibler Alignment of Embeddings (KALE), an efficient and accurate method for increasing the inference efficiency of dense retrieval methods by pruning and aligning the query encoder after training. Specifically, KALE extends traditional Knowledge Distillation after bi-encoder training, allowing for effective query encoder compression without full retraining or index generation. Using KALE and asymmetric training, we can generate models which exceed the performance of DistilBERT despite having 3x faster inference.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2304.01016

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Kale

AAAI ConferencesFeb-8-2022, 11:03:33 GMT

In the age of big data, data analytics expertise is increasingly valuable. This expertise includes not only formal knowledge, such as algorithms and statistics, but also practical skills that are learned through practice and are difficult to teach in classroom settings: management and preparation of data sets, feature design, and iterative exploratory analysis. Semantic workflows are a valuable tool for empowering non-expert users to carry out systematic analytics on large datasets using sophisticated machine learning methods captured in the workflows and their semantic constraints. In this paper we motivate and illustrate the role of visualizations in the usability of workflows by non-experts as well as their role in learning practical data analytics skills to gain interesting insights into data and methods. This capability is particularly important when confronting large datasets, where the selection of appropriate methods and their configuration with the best parameter and algorithm selections can be crucial in obtaining useful results.

expertise, selection, workflow, (2 more...)

AAAI Conferences

Technology:

Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning (0.66)

Add feedback

The Future Of Artificial Intelligence Is Now - Liwaiwai

#artificialintelligenceJun-24-2021, 23:45:08 GMT

Imagine if doctors, nurses, and health care researchers had the ability to interrogate both the healthy and diseased states of a patient's biology and then use that data to uncover a network of causal relationships between historical, molecular, and other data types to approach treatment or develop the right type of drugs. BERG Health is using this information with a platform that uses artificial intelligence (AI) and machine learning to examine disparate sets of data from patient biology and electronic medical records. "Artificial intelligence has the potential to disrupt many industries, but perhaps most importantly is its impact on health care, where the unsolved challenge is getting the right treatments to the right patients by utilizing tremendous amounts of experimental and observational data," says Niven Narain, co-founder, president and CEO of BERG Health. "By comparing individual patient health data to the greater population health data, we can develop prescriptive analytics that can determine what treatments will work best for that patient, while also warning patients of potential side effects." AI is a set of complex algorithms and technologies that enables machines, systems and software to make human-like decisions.

ai application, artificial intelligence, information, (12 more...)

#artificialintelligence

Industry: Health & Medicine > Health Care Technology > Medical Record (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Biomedical Informatics > Clinical Informatics (0.76)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.41)

Add feedback

KALE Flow: A Relaxed KL Gradient Flow for Probabilities with Disjoint Support

Glaser, Pierre, Arbel, Michael, Gretton, Arthur

arXiv.org Machine LearningJun-16-2021

We study the gradient flow for a relaxed approximation to the Kullback-Leibler (KL) divergence between a moving source and a fixed target distribution. This approximation, termed the KALE (KL approximate lower-bound estimator), solves a regularized version of the Fenchel dual problem defining the KL over a restricted class of functions. When using a Reproducing Kernel Hilbert Space (RKHS) to define the function class, we show that the KALE continuously interpolates between the KL and the Maximum Mean Discrepancy (MMD). Like the MMD and other Integral Probability Metrics, the KALE remains well defined for mutually singular distributions. Nonetheless, the KALE inherits from the limiting KL a greater sensitivity to mismatch in the support of the distributions, compared with the MMD. These two properties make the KALE gradient flow particularly well suited when the target distribution is supported on a low-dimensional manifold. Under an assumption of sufficient smoothness of the trajectories, we show the global convergence of the KALE flow. We propose a particle implementation of the flow given initial samples from the source and the target distribution, which we use to empirically confirm the KALE's properties.

gradient flow, kale, kale flow, (14 more...)

arXiv.org Machine Learning

2106.08929

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Neural Network Gaussian Process Considering Input Uncertainty for Composite Structures Assembly

Lee, Cheolhei, Wu, Jianguo, Wang, Wenjia, Yue, Xiaowei

arXiv.org Machine LearningNov-21-2020

Developing machine learning enabled smart manufacturing is promising for composite structures assembly process. To improve production quality and efficiency of the assembly process, accurate predictive analysis on dimensional deviations and residual stress of the composite structures is required. The novel composite structures assembly involves two challenges: (i) the highly nonlinear and anisotropic properties of composite materials; and (ii) inevitable uncertainty in the assembly process. To overcome those problems, we propose a neural network Gaussian process model considering input uncertainty for composite structures assembly. Deep architecture of our model allows us to approximate a complex process better, and consideration of input uncertainty enables robust modeling with complete incorporation of the process uncertainty. Based on simulation and case study, the NNGPIU can outperform other benchmark methods when the response function is nonsmooth and nonlinear. Although we use composite structure assembly as an example, the proposed methodology can be applicable to other engineering systems with intrinsic uncertainties.

composite structure, kernel, nngpiu, (13 more...)

arXiv.org Machine Learning

2011.10861

Country:

North America > United States > Virginia > Montgomery County > Blacksburg (0.04)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry: Materials (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

The way you version control your ML projects is wrong

#artificialintelligenceOct-4-2020, 11:45:43 GMT

A Data Scientist spends most of his time inside a Jupyter Notebook exploring the data and drafting ideas. Usually, when we try to version our work, we end up with a bunch of duplicated ipynb files, assuming different naming schemes. Can we have something that automatically snapshots our work, before and after every step in an ML pipeline? Moreover, can we get started using it without a ton of configuration needed? Just open a Notebook, do our thing and be sure that everything else will take care of itself.

artificial intelligence, machine learning, notebook, (17 more...)

#artificialintelligence

Country: Europe (0.15)

Industry: Government (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Communications (0.72)

Add feedback

Filters

Collaborating Authors

kale

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

neurips_2021_kale_flow(10).pdf

neurips_2021_kale_flow(10).pdf

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

KALE: An Artwork Image Captioning System Augmented with Heterogeneous Graph

Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders

Kale

The Future Of Artificial Intelligence Is Now - Liwaiwai

KALE Flow: A Relaxed KL Gradient Flow for Probabilities with Disjoint Support

Neural Network Gaussian Process Considering Input Uncertainty for Composite Structures Assembly

The way you version control your ML projects is wrong