AITopics | synthesized

Collaborating Authors

synthesized

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts

Kim, Jun Seong, Thu, Kyaw Ye, Ismayilzada, Javad, Park, Junyeong, Kim, Eunsu, Ahmad, Huzama, An, Na Min, Thorne, James, Oh, Alice

arXiv.org Artificial IntelligenceMar-20-2025

In a highly globalized world, it is important for multi-modal large language models (MLLMs) to recognize and respond correctly to mixed-cultural inputs. For example, a model should correctly identify kimchi (Korean food) in an image both when an Asian woman is eating it, as well as an African man is eating it. However, current MLLMs show an over-reliance on the visual features of the person, leading to misclassification of the entities. To examine the robustness of MLLMs to different ethnicity, we introduce MixCuBe, a cross-cultural bias benchmark, and study elements from five countries and four ethnicities. Our findings reveal that MLLMs achieve both higher accuracy and lower sensitivity to such perturbation for high-resource cultures, but not for low-resource cultures. GPT-4o, the best-performing model overall, shows up to 58% difference in accuracy between the original and perturbed cultural settings in low-resource cultures. Our dataset is publicly available at: https://huggingface.co/datasets/kyawyethu/MixCuBe.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2503.16826

Country:

Asia > Azerbaijan (0.07)
Asia > Myanmar (0.06)
Asia > South Korea (0.05)
(11 more...)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models

Croxford, Emma, Gao, Yanjun, Pellegrino, Nicholas, Wong, Karen K., Wills, Graham, First, Elliot, Schnier, Miranda, Burton, Kyle, Ebby, Cris G., Gorskic, Jillian, Kalscheur, Matthew, Khalil, Samy, Pisani, Marie, Rubeor, Tyler, Stetson, Peter, Liao, Frank, Goswami, Cherodeep, Patterson, Brian, Afshar, Majid

arXiv.org Artificial IntelligenceJan-15-2025

As Large Language Models (LLMs) are integrated into electronic health record (EHR) workflows, validated instruments are essential to evaluate their performance before implementation. Existing instruments for provider documentation quality are often unsuitable for the complexities of LLM-generated text and lack validation on real-world data. The Provider Documentation Summarization Quality Instrument (PDSQI-9) was developed to evaluate LLM-generated clinical summaries. Multi-document summaries were generated from real-world EHR data across multiple specialties using several LLMs (GPT-4o, Mixtral 8x7b, and Llama 3-8b). Validation included Pearson correlation for substantive validity, factor analysis and Cronbach's alpha for structural validity, inter-rater reliability (ICC and Krippendorff's alpha) for generalizability, a semi-Delphi process for content validity, and comparisons of high- versus low-quality summaries for discriminant validity. Seven physician raters evaluated 779 summaries and answered 8,329 questions, achieving over 80% power for inter-rater reliability. The PDSQI-9 demonstrated strong internal consistency (Cronbach's alpha = 0.879; 95% CI: 0.867-0.891) and high inter-rater reliability (ICC = 0.867; 95% CI: 0.867-0.868), supporting structural validity and generalizability. Factor analysis identified a 4-factor model explaining 58% of the variance, representing organization, clarity, accuracy, and utility. Substantive validity was supported by correlations between note length and scores for Succinct (rho = -0.200, p = 0.029) and Organized (rho = -0.190, p = 0.037). Discriminant validity distinguished high- from low-quality summaries (p < 0.001). The PDSQI-9 demonstrates robust construct validity, supporting its use in clinical practice to evaluate LLM-generated summaries and facilitate safer integration of LLMs into healthcare workflows.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2501.08977

Country: North America > United States > Wisconsin > Dane County > Madison (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Health Care Technology > Medical Record (0.69)
Health & Medicine > Diagnostic Medicine (0.67)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AI-driven platform identifies and remediates biases in data - i4.0 today

#artificialintelligenceNov-18-2020, 20:20:08 GMT

The Community Edition is one part of Synthesized's data platform. The complete platform uses AI to automate all stages of data provisioning; the process of making data available in an orderly and secure way. This level of automation enables organisations to generate synthesized datasets, allowing them to better test data for new products and tools, validate mathematical models, or train machine learning models. Synthesized completely removes the heavy and costly burden of finding, collecting, and preparing data. Gartner estimates that data scientists and test engineers currently waste up to 80% of their valuable time on such repetitive tasks.

ai-driven platform identify, platform identify and remediate bias, synthesized, (1 more...)

#artificialintelligence

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.08)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.65)

Add feedback

Equitable tech: AI-enabled platform to reduce bias in datasets released

#artificialintelligenceNov-12-2020, 03:50:13 GMT

On Wednesday, London-based Synthesized launched a platform to help organizations identify and rectify biases in their data. Synthesized touts the platform as the "first publicly available solution to accurately detect and remove biases in data." A "freemium" Community Edition of the platform designed to mitigate bias in data is now available. "The reputational risk of all organisations is under threat due to biased data, and we've seen this will no longer be tolerated at any level. It's a burning priority now and must be dealt with as a matter of urgency, both from a legal and ethical standpoint," said Nicolai Baldin, CEO and founder of Synthesized in a press release.

dataset, platform, synthesized, (6 more...)

#artificialintelligence

Genre: Press Release (0.43)

Technology:

Information Technology > Artificial Intelligence (0.90)
Information Technology > Data Science (0.54)

Add feedback

Tractable Monotone Temporal Planning

Cooper, Martin C. (University of Toulouse) | Maris, Frederic (University of Toulouse) | Regnier, Pierre (University of Toulouse)

AAAI ConferencesJun-8-2012

This paper describes a polynomially-solvable sub-problem of temporal planning. Polynomiality follows from two assumptions. Firstly, by supposing that each sub-goal fluent can be established by at most one action, we can quickly determine which actions are necessary in any plan. Secondly, the monotonicity of sub-goal fluents allows us to express planning as an instance of STP≠ (Simple Temporal Problem, difference constraints). Our class includes temporally-expressive problems, which we illustrate with an example of chemical process planning.

constraint, monotone, planning problem, (15 more...)

AAAI Conferences

Twenty-Second International Conference on Automated Planning and Scheduling

Country:

North America > United States > Oklahoma > Payne County > Cushing (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)

Industry: Materials > Chemicals (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.98)

Add feedback