AITopics | Weber, Melanie

Collaborating Authors

Weber, Melanie

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Curvature-based Clustering on Graphs

Tian, Yu, Lubberts, Zachary, Weber, Melanie

arXiv.org Artificial IntelligenceJul-19-2023

Unsupervised node clustering (or community detection) is a classical graph learning task. In this paper, we study algorithms, which exploit the geometry of the graph to identify densely connected substructures, which form clusters or communities. Our method implements discrete Ricci curvatures and their associated geometric flows, under which the edge weights of the graph evolve to reveal its community structure. We consider several discrete curvature notions and analyze the utility of the resulting algorithms. In contrast to prior literature, we study not only single-membership community detection, where each node belongs to exactly one community, but also mixed-membership community detection, where communities may overlap. For the latter, we argue that it is beneficial to perform community detection on the line graph, i.e., the graph's dual. We provide both theoretical and empirical evidence for the utility of our curvature-based clustering algorithms. In addition, we give several results on the relationship between the curvature of a graph and that of its dual, which enable the efficient implementation of our proposed mixed-membership community detection approach and which may be of independent interest for curvature-based network analysis.

curvature, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2307.10155

Country:

North America > United States (0.67)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.67)
Education (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.66)

Add feedback

Continuum Limits of Ollivier's Ricci Curvature on data clouds: pointwise consistency and global lower bounds

Trillos, Nicolas Garcia, Weber, Melanie

arXiv.org Artificial IntelligenceJul-5-2023

Let $\mathcal{M} \subseteq \mathbb{R}^d$ denote a low-dimensional manifold and let $\mathcal{X}= \{ x_1, \dots, x_n \}$ be a collection of points uniformly sampled from $\mathcal{M}$. We study the relationship between the curvature of a random geometric graph built from $\mathcal{X}$ and the curvature of the manifold $\mathcal{M}$ via continuum limits of Ollivier's discrete Ricci curvature. We prove pointwise, non-asymptotic consistency results and also show that if $\mathcal{M}$ has Ricci curvature bounded from below by a positive constant, then the random geometric graph will inherit this global structural property with high probability. We discuss applications of the global discrete curvature bounds to contraction properties of heat kernels on graphs, as well as implications for manifold learning from data clouds. In particular, we show that the consistency results allow for characterizing the intrinsic curvature of a manifold from extrinsic curvature.

artificial intelligence, curvature, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2307.02378

Country: North America > United States > Wisconsin (0.14)

Genre:

Research Report (0.50)
Instructional Material > Course Syllabus & Notes (0.45)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension

Hua, Wenyue, Zhang, Yuchen, Chen, Zhe, Li, Josie, Weber, Melanie

arXiv.org Artificial IntelligenceDec-15-2022

The application of Natural Language Processing (NLP) to specialized domains, such as the law, has recently received a surge of interest. As many legal services rely on processing and analyzing large collections of documents, automating such tasks with NLP tools emerges as a key challenge. Many popular language models, such as BERT or RoBERTa, are general-purpose models, which have limitations on processing specialized legal terminology and syntax. In addition, legal documents may contain specialized vocabulary from other domains, such as medical terminology in personal injury text. Here, we propose LegalRelectra, a legal-domain language model that is trained on mixed-domain legal and medical corpora. We show that our model improves over general-domain and single-domain medical and legal language models when processing mixed-domain (personal injury) text. Our training architecture implements the Electra framework, but utilizes Reformer instead of BERT for its generator and discriminator. We show that this improves the model's performance on processing long passages and results in better long-range text comprehension.

artificial intelligence, natural language, text processing, (19 more...)

arXiv.org Artificial Intelligence

2212.08204

Country: North America > United States (0.68)

Genre: Research Report (0.82)

Industry:

Law > Torts Law (0.57)
Law > Litigation (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.71)

Add feedback

Identifying biases in legal data: An algorithmic fairness perspective

Sargent, Jackson, Weber, Melanie

arXiv.org Machine LearningSep-21-2021

As artificial intelligence enters the legal space, it is essential to recognize biases in legal data and ensure that they are not replicated and reinforced with legal technology [7, 13, 18]. Furthermore, understanding biases in legal data and developing discrimination-free technology could help the legal space to become fairer and more widely accessible. We typically find two types of biases in legal data: First, representation biases, i.e., certain social groups are over-or underrepresented in a data set. Second, sentencing disparities, i.e., the outcome of legal proceedings for similar cases varies across social groups. Representation biases may reflect disparities in policing (arrest rates) or in offense rates.

artificial intelligence, legal data, us government, (18 more...)

arXiv.org Machine Learning

2109.09946

Country: North America > United States > Illinois > Cook County (0.14)

Genre: Research Report > New Finding (0.94)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.30)

Add feedback

The Oracle of DLphi

Alfke, Dominik, Baines, Weston, Blechschmidt, Jan, Sarmina, Mauricio J. del Razo, Drory, Amnon, Elbrächter, Dennis, Farchmin, Nando, Gambara, Matteo, Glas, Silke, Grohs, Philipp, Hinz, Peter, Kivaranovic, Danijel, Kümmerle, Christian, Kutyniok, Gitta, Lunz, Sebastian, Macdonald, Jan, Malthaner, Ryan, Naisat, Gregory, Neufeld, Ariel, Petersen, Philipp Christian, Reisenhofer, Rafael, Sheng, Jun-Da, Thesing, Laura, Trunschke, Philipp, von Lindheim, Johannes, Weber, David, Weber, Melanie

arXiv.org Machine LearningJan-27-2019

This paper takes aim at achieving nothing less than the impossible. To be more precise, we seek to predict labels of unknown data from entirely uncorrelated labelled training data. This will be accomplished by an application of an algorithm based on deep learning, as well as, by invoking one of the most fundamental concepts of set theory. Estimating the behaviour of a system in unknown situations is one of the central problems of humanity. Indeed, we are constantly trying to produce predictions for future events to be able to prepare ourselves.

deep learning, neural network, training data, (18 more...)

arXiv.org Machine Learning

1901.05744

Country:

North America > United States (0.28)
Europe > United Kingdom > England (0.14)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Heuristic Framework for Multi-Scale Testing of the Multi-Manifold Hypothesis

Medina, F. Patricia, Ness, Linda, Weber, Melanie, Djima, Karamatou Yacoubou

arXiv.org Machine LearningJul-1-2018

When analyzing empirical data, we often find that global linear models overestimate the number of parameters required. In such cases, we may ask whether the data lies on or near a manifold or a set of manifolds (a so-called multi-manifold) of lower dimension than the ambient space. This question can be phrased as a (multi-) manifold hypothesis. The identification of such intrinsic multiscale features is a cornerstone of data analysis and representation and has given rise to a large body of work on manifold learning. In this work, we review key results on multi-scale data analysis and intrinsic dimension followed by the introduction of a heuristic, multiscale framework for testing the multi-manifold hypothesis. Our method implements a hypothesis test on a set of spline-interpolated manifolds constructed from variance-based intrinsic dimensions. The workflow is suitable for empirical data analysis as we demonstrate on two use cases.

artificial intelligence, intrinsic dimension, machine learning, (19 more...)

arXiv.org Machine Learning

1807.00349

Country: North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Energy (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback