AITopics | Linn County

Collaborating Authors

Linn County

Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers

Neural Information Processing SystemsFeb-17-2026, 04:23:39 GMT

Despite several works trying to reduce their computational cost, most of LLMs still adopt attention layers between all pairs of tokens in the sequence, thus incurring a quadratic cost. In this study, we present a novel approach that dynamically prunes contextual information while preserving the model's expressiveness, resulting in reduced memory and computational

arxiv preprint arxiv, large language model, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Lebanon (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Oregon > Linn County > Lebanon (0.04)
(7 more...)

Genre:

Research Report > Promising Solution (0.87)
Research Report > New Finding (0.87)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers

Neural Information Processing SystemsOct-9-2025, 07:49:57 GMT

arxiv preprint arxiv, large language model, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Lebanon (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Oregon > Linn County > Lebanon (0.04)
(7 more...)

Genre:

Research Report > Promising Solution (0.87)
Research Report > New Finding (0.87)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Collective Memory and Narrative Cohesion: A Computational Study of Palestinian Refugee Oral Histories in Lebanon

Awwad, Ghadeer, Dunagan, Lavinia, Gamba, David, Rayan, Tamara N.

arXiv.org Artificial IntelligenceJan-23-2025

This study uses the Palestinian Oral History Archive (POHA) to investigate how Palestinian refugee groups in Lebanon sustain a cohesive collective memory of the Nakba through shared narratives. Grounded in Halbwachs' theory of group memory, we employ statistical analysis of pairwise similarity of narratives, focusing on the influence of shared gender and location. We use textual representation and semantic embeddings of narratives to represent the interviews themselves. Our analysis demonstrates that shared origin is a powerful determinant of narrative similarity across thematic keywords, landmarks, and significant figures, as well as in semantic embeddings of the narratives. Meanwhile, shared residence fosters cohesion, with its impact significantly amplified when paired with shared origin. Additionally, women's narratives exhibit heightened thematic cohesion, particularly in recounting experiences of the British occupation, underscoring the gendered dimensions of memory formation. This research deepens the understanding of collective memory in diasporic settings, emphasizing the critical role of oral histories in safeguarding Palestinian identity and resisting erasure.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2501.13682

Country:

Asia > Middle East > Palestine (0.29)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
(18 more...)

Genre:

Research Report > New Finding (0.93)
Personal > Interview (0.67)

Industry:

Government > Regional Government (0.93)
Government > Immigration & Customs (0.93)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Information Management (0.67)
(2 more...)

Add feedback

ArabicNLU 2024: The First Arabic Natural Language Understanding Shared Task

Khalilia, Mohammed, Malaysha, Sanad, Suwaileh, Reem, Jarrar, Mustafa, Aljabari, Alaa, Elsayed, Tamer, Zitouni, Imed

arXiv.org Artificial IntelligenceJul-30-2024

This paper presents an overview of the Arabic Natural Language Understanding (ArabicNLU 2024) shared task, focusing on two subtasks: Word Sense Disambiguation (WSD) and Location Mention Disambiguation (LMD). The task aimed to evaluate the ability of automated systems to resolve word ambiguity and identify locations mentioned in Arabic text. We provided participants with novel datasets, including a sense-annotated corpus for WSD, called SALMA with approximately 34k annotated tokens, and the IDRISI-DA dataset with 3,893 annotations and 763 unique location mentions. These are challenging tasks. Out of the 38 registered teams, only three teams participated in the final evaluation phase, with the highest accuracy being 77.8% for WSD and the highest MRR@1 being 95.0% for LMD. The shared task not only facilitated the evaluation and comparison of different techniques, but also provided valuable insights and resources for the continued advancement of Arabic NLU technologies.

computational linguistic, dataset, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2407.20663

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > Qatar (0.05)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.05)
(17 more...)

Genre:

Research Report (0.64)
Overview (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.94)

Add feedback

A lexicon obtained and validated by a data-driven approach for organic residues valorization in emerging and developing countries

Rakotomalala, Christiane, Paillat, Jean-Marie, Feder, Frédéric, Avadí, Angel, Thuriès, Laurent, Vermeire, Marie-Liesse, Médoc, Jean-Michel, Wassenaar, Tom, Hottelart, Caroline, Kieffer, Lilou, Ndjie, Elisa, Picart, Mathieu, Tchamgoue, Jorel, Tulle, Alvin, Valade, Laurine, Boyer, Annie, Duchamp, Marie-Christine, Roche, Mathieu

arXiv.org Artificial IntelligenceJun-2-2024

The text mining method presented in this paper was used for annotation of terms related to biological transformation and valorization of organic residues in agriculture in low and middle-income country. Specialized lexicon was obtained through different steps: corpus and extraction of terms, annotation of extracted terms, selection of relevant terms.

montpellier, recyclage et risque, valorization, (11 more...)

arXiv.org Artificial Intelligence

2406.00682

Country:

Africa > Saint Helena, Ascension and Tristan da Cunha (0.29)
North America > Central America (0.14)
Asia > North Korea (0.14)
(132 more...)

Genre: Research Report (0.64)

Industry: Food & Agriculture > Agriculture (0.93)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

All eyes on Israel's response to Iranian drone and missile attacks

BBC NewsApr-14-2024, 11:04:23 GMT

It could listen to its neighbours in the region and exercise what is known as "strategic patience", holding off from responding in kind and instead continuing to target Iran's proxy allies in the region such as Hezbollah in Lebanon or military supply sites in Syria, as it has been doing for years.

iranian drone and missile attack, israel

BBC News

Country:

North America > United States > Oregon > Linn County > Lebanon (0.43)
Asia > Middle East > Syria (0.43)
Asia > Middle East > Lebanon (0.43)
(2 more...)

Industry: Government > Military (0.40)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.40)

Add feedback

Can't Touch This: Real-Time, Safe Motion Planning and Control for Manipulators Under Uncertainty

Michaux, Jonathan, Holmes, Patrick, Zhang, Bohao, Chen, Che, Wang, Baiyue, Sahgal, Shrey, Zhang, Tiancheng, Dey, Sidhartha, Kousik, Shreyas, Vasudevan, Ram

arXiv.org Artificial IntelligenceNov-1-2023

Ensuring safe, real-time motion planning in arbitrary environments requires a robotic manipulator to avoid collisions, obey joint limits, and account for uncertainties in the mass and inertia of objects and the robot itself. This paper proposes Autonomous Robust Manipulation via Optimization with Uncertainty-aware Reachability (ARMOUR), a provably-safe, receding-horizon trajectory planner and tracking controller framework for robotic manipulators to address these challenges. ARMOUR first constructs a robust controller that tracks desired trajectories with bounded error despite uncertain dynamics. ARMOUR then uses a novel recursive Newton-Euler method to compute all inputs required to track any trajectory within a continuum of desired trajectories. Finally, ARMOUR over-approximates the swept volume of the manipulator; this enables one to formulate an optimization problem that can be solved in real-time to synthesize provably-safe motions. This paper compares ARMOUR to state of the art methods on a set of challenging manipulation examples in simulation and demonstrates its ability to ensure safety on real hardware in the presence of model uncertainty without sacrificing performance. Project page: https://roahmlab.github.io/armour/.

polynomial zonotope, robot, trajectory, (15 more...)

arXiv.org Artificial Intelligence

2301.13308

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > Oregon > Linn County > Albany (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(3 more...)

Genre: Research Report (0.70)

Industry: Automobiles & Trucks (0.45)

Technology:

Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

Linking Symptom Inventories using Semantic Textual Similarity

Kennedy, Eamonn, Vadlamani, Shashank, Lindsey, Hannah M, Peterson, Kelly S, OConnor, Kristen Dams, Murray, Kenton, Agarwal, Ronak, Amiri, Houshang H, Andersen, Raeda K, Babikian, Talin, Baron, David A, Bigler, Erin D, Caeyenberghs, Karen, Delano-Wood, Lisa, Disner, Seth G, Dobryakova, Ekaterina, Eapen, Blessen C, Edelstein, Rachel M, Esopenko, Carrie, Genova, Helen M, Geuze, Elbert, Goodrich-Hunsaker, Naomi J, Grafman, Jordan, Haberg, Asta K, Hodges, Cooper B, Hoskinson, Kristen R, Hovenden, Elizabeth S, Irimia, Andrei, Jahanshad, Neda, Jha, Ruchira M, Keleher, Finian, Kenney, Kimbra, Koerte, Inga K, Liebel, Spencer W, Livny, Abigail, Lovstad, Marianne, Martindale, Sarah L, Max, Jeffrey E, Mayer, Andrew R, Meier, Timothy B, Menefee, Deleene S, Mohamed, Abdalla Z, Mondello, Stefania, Monti, Martin M, Morey, Rajendra A, Newcombe, Virginia, Newsome, Mary R, Olsen, Alexander, Pastorek, Nicholas J, Pugh, Mary Jo, Razi, Adeel, Resch, Jacob E, Rowland, Jared A, Russell, Kelly, Ryan, Nicholas P, Scheibel, Randall S, Schmidt, Adam T, Spitz, Gershon, Stephens, Jaclyn A, Tal, Assaf, Talbert, Leah D, Tartaglia, Maria Carmela, Taylor, Brian A, Thomopoulos, Sophia I, Troyanskaya, Maya, Valera, Eve M, van der Horn, Harm Jan, Van Horn, John D, Verma, Ragini, Wade, Benjamin SC, Walker, Willian SC, Ware, Ashley L, Werner, J Kent Jr, Yeates, Keith Owen, Zafonte, Ross D, Zeineh, Michael M, Zielinski, Brandon, Thompson, Paul M, Hillary, Frank G, Tate, David F, Wilde, Elisabeth A, Dennis, Emily L

arXiv.org Artificial IntelligenceSep-8-2023

An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores across previously incongruous symptom inventories. We tested the ability of four pre-trained STS models to screen thousands of symptom description pairs for related content - a challenging task typically requiring expert panels. Models were tasked to predict symptom severity across four different inventories for 6,607 participants drawn from 16 international data sources. The STS approach achieved 74.8% accuracy across five tasks, outperforming other models tested. This work suggests that incorporating contextual, semantic information can assist expert decision-making processes, yielding gains for both general and disease-specific clinical assessment.

inventory, symptom, university, (11 more...)

arXiv.org Artificial Intelligence

2309.04607

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.30)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
North America > United States > Virginia > Albemarle County > Charlottesville (0.14)
(50 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers

Anagnostidis, Sotiris, Pavllo, Dario, Biggio, Luca, Noci, Lorenzo, Lucchi, Aurelien, Hofmann, Thomas

arXiv.org Artificial IntelligenceMay-28-2023

Autoregressive Transformers adopted in Large Language Models (LLMs) are hard to scale to long sequences. Despite several works trying to reduce their computational cost, most of LLMs still adopt attention layers between all pairs of tokens in the sequence, thus incurring a quadratic cost. In this study, we present a novel approach that dynamically prunes contextual information while preserving the model's expressiveness, resulting in reduced memory and computational requirements during inference. Our method employs a learnable mechanism that determines which uninformative tokens can be dropped from the context at any point across the generation process. By doing so, our approach not only addresses performance concerns but also enhances interpretability, providing valuable insight into the model's decision-making process. Our technique can be applied to existing pre-trained models through a straightforward fine-tuning process, and the pruning strength can be specified by a sparsity parameter. Notably, our empirical findings demonstrate that we can effectively prune up to 80\% of the context without significant performance degradation on downstream tasks, offering a valuable tool for mitigating inference costs. Our reference implementation achieves up to $2\times$ increase in inference throughput and even greater memory savings.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2305.15805

Country:

Asia > Middle East > Lebanon (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Oregon > Linn County > Lebanon (0.04)
(7 more...)

Genre:

Research Report > New Finding (0.68)
Research Report > Promising Solution (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Large Language Models Can Be Strong Differentially Private Learners

Li, Xuechen, Tramèr, Florian, Liang, Percy, Hashimoto, Tatsunori

arXiv.org Artificial IntelligenceNov-10-2022

Differentially Private (DP) learning has seen limited success for building large deep learning models of text, and straightforward attempts at applying Differentially Private Stochastic Gradient Descent (DP-SGD) to NLP tasks have resulted in large performance drops and high computational overhead. We show that this performance drop can be mitigated with (1) the use of large pretrained language models; (2) non-standard hyperparameters that suit DP optimization; and (3) fine-tuning objectives which are aligned with the pretraining procedure. With the above, we obtain NLP models that outperform state-of-the-art DP-trained models under the same privacy budget and strong non-private baselines -- by directly fine-tuning pretrained models with DP optimization on moderately-sized corpora. To address the computational challenge of running DP-SGD with large Transformers, we propose a memory saving technique that allows clipping in DP-SGD to run without instantiating per-example gradients for any linear layer in the model. The technique enables privately training Transformers with almost the same memory cost as non-private training at a modest run-time overhead. Contrary to conventional wisdom that DP optimization fails at learning high-dimensional models (due to noise that scales with dimension) empirical results reveal that private learning with pretrained language models doesn't tend to suffer from dimension-dependent performance degradation. Code to reproduce results can be found at https://github.com/lxuechen/private-transformers.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2110.05679

Country:

North America > United States > Oregon > Linn County > Albany (0.14)
North America > United States > District of Columbia > Washington (0.14)
Europe > Spain > Galicia > Madrid (0.04)
(6 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.97)

Add feedback