herrera
Right-Wing Gun Enthusiasts and Extremists Are Working Overtime to Justify Alex Pretti's Killing
Right-Wing Gun Enthusiasts and Extremists Are Working Overtime to Justify Alex Pretti's Killing Donald Trump has appeared to undermine Second Amendment rights in statements about Alex Pretti's killing. Many in the firearms community are going along with it. In the hours after Border Patrol agents shot and killed Alex Pretti in Minneapolis, President Donald Trump and his administration appeared to directly undermine the rights granted to gun owners in the Second Amendment. Department of Homeland Security secretary Kristi Noem inaccurately said Pretti was a " domestic terrorist " who was "brandishing" his legally held gun. FBI director Kash Patel wrongly told Fox News it's illegal to bring a gun to a protest.
- South America > Venezuela (0.48)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.27)
- North America > United States > California (0.05)
- (7 more...)
Apple Engineers Are Inspecting Bacon Packaging to Help Level Up US Manufacturers
Initial participants in the new Apple Manufacturing Academy tell WIRED that the tech giant's surprising frankness and hands-on support are already benefiting their bottom lines. An instructor at the Apple Manufacturing Academy in Detroit demonstrates how an iPhone and optical inspection software can be used to photograph and automatically identify an issue with a part. About 10 Apple employees spent some of their valuable hours over recent months on a project that might seem unusual for the tech giant: customizing an open source AI tool for ImageTek, a small manufacturer in Springfield, Vermont whose lines of business include printing millions of labels for food packaging. The Apple engineers developed a computer vision system to automatically identify color errors, and on one run it picked up bacon labels with a far-too-pinkish beige before they got shipped, according to Marji Smith, ImageTek's president. She says the timely catch helped ImageTek from losing a crucial customer.
- North America > United States > Michigan (0.40)
- North America > United States > Vermont (0.25)
- Asia > China (0.05)
- (5 more...)
- Information Technology (0.87)
- Government > Regional Government > North America Government > United States Government (0.47)
- Energy > Renewable > Geothermal (0.47)
Overview of BioASQ 2024: The twelfth BioASQ challenge on Large-Scale Biomedical Semantic Indexing and Question Answering
Nentidis, Anastasios, Katsimpras, Georgios, Krithara, Anastasia, Lima-López, Salvador, Farré-Maduell, Eulàlia, Krallinger, Martin, Loukachevitch, Natalia, Davydova, Vera, Tutubalina, Elena, Paliouras, Georgios
This is an overview of the twelfth edition of the BioASQ challenge in the context of the Conference and Labs of the Evaluation Forum (CLEF) 2024. BioASQ is a series of international challenges promoting advances in large-scale biomedical semantic indexing and question answering. This year, BioASQ consisted of new editions of the two established tasks b and Synergy, and two new tasks: a) MultiCardioNER on the adaptation of clinical entity detection to the cardiology domain in a multilingual setting, and b) BIONNE on nested NER in Russian and English. In this edition of BioASQ, 37 competing teams participated with more than 700 distinct submissions in total for the four different shared tasks of the challenge. Similarly to previous editions, most of the participating systems achieved competitive performance, suggesting the continuous advancement of the state-of-the-art in the field.
- Asia > Russia (0.04)
- Europe > Greece > Central Macedonia > Thessaloniki (0.04)
- Europe > Germany > Bavaria > Regensburg (0.04)
- (9 more...)
- Overview (0.68)
- Research Report (0.64)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)
- (3 more...)
StylOch at PAN: Gradient-Boosted Trees with Frequency-Based Stylometric Features
Ochab, Jeremi K., Matias, Mateusz, Boba, Tymoteusz, Walkowiak, Tomasz
This submission to the binary AI detection task is based on a modular stylometric pipeline, where: public spaCy models are used for text preprocessing (including tokenisation, named entity recognition, dependency parsing, part-of-speech tagging, and morphology annotation) and extracting several thousand features (frequencies of n-grams of the above linguistic annotations); light-gradient boosting machines are used as the classifier. We collect a large corpus of more than 500 000 machine-generated texts for the classifier's training. We explore several parameter options to increase the classifier's capacity and take advantage of that training set. Our approach follows the non-neural, computationally inexpensive but explainable approach found effective previously.
- Europe > Switzerland (0.05)
- Europe > Poland > Lesser Poland Province > Kraków (0.05)
- Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.05)
- (10 more...)
- Research Report (0.50)
- Overview (0.46)
Addressing Multilabel Imbalance with an Efficiency-Focused Approach Using Diffusion Model-Generated Synthetic Samples
Charte, Francisco, Dávila, Miguel Ángel, Pérez-Godoy, María Dolores, del Jesus, María José
Predictive models trained on imbalanced data tend to produce biased results. This problem is exacerbated when there is not just one output label, but a set of them. This is the case for multilabel learning (MLL) algorithms used to classify patterns, rank labels, or learn the distribution of outputs. Many solutions have been proposed in the literature. The one that can be applied universally, independent of the algorithm used to build the model, is data resampling. The generation of new instances associated with minority labels, so that empty areas of the feature space are filled, helps to improve the obtained models. The quality of these new instances depends on the algorithm used to generate them. In this paper, a diffusion model tailored to produce new instances for MLL data, called MLDM (\textit{MultiLabel Diffusion Model}), is proposed. Diffusion models have been mainly used to generate artificial images and videos. Our proposed MLDM is based on this type of models. The experiments conducted compare MLDM with several other MLL resampling algorithms. The results show that MLDM is competitive while it improves efficiency.
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- North America > United States > District of Columbia > Washington (0.04)
- Europe > Spain > Andalusia > Jaén Province > Jaén (0.04)
- (2 more...)
The Paradox of Success in Evolutionary and Bioinspired Optimization: Revisiting Critical Issues, Key Studies, and Methodological Pathways
Molina, Daniel, Del Ser, Javier, Poyatos, Javier, Herrera, Francisco
Evolutionary and bioinspired computation are crucial for efficiently addressing complex optimization problems across diverse application domains. By mimicking processes observed in nature, like evolution itself, these algorithms offer innovative solutions beyond the reach of traditional optimization methods. They excel at finding near-optimal solutions in large, complex search spaces, making them invaluable in numerous fields. However, both areas are plagued by challenges at their core, including inadequate benchmarking, problem-specific overfitting, insufficient theoretical grounding, and superfluous proposals justified only by their biological metaphor. This overview recapitulates and analyzes in depth the criticisms concerning the lack of innovation and rigor in experimental studies within the field. To this end, we examine the judgmental positions of the existing literature in an informed attempt to guide the research community toward directions of solid contribution and advancement in these areas. We summarize guidelines for the design of evolutionary and bioinspired optimizers, the development of experimental comparisons, and the derivation of novel proposals that take a step further in the field. We provide a brief note on automating the process of creating these algorithms, which may help align metaheuristic optimization research with its primary objective (solving real-world problems), provided that our identified pathways are followed. Our conclusions underscore the need for a sustained push towards innovation and the enforcement of methodological rigor in prospective studies to fully realize the potential of these advanced computational techniques.
- North America > United States > New York (0.04)
- Oceania > New Zealand (0.04)
- Europe > Switzerland (0.04)
- Europe > Spain > Basque Country (0.04)
- Research Report > Promising Solution (1.00)
- Research Report > New Finding (1.00)
The NSA Warns That US Adversaries Free to Mine Private Data May Have an AI Edge
Electrical engineer Gilbert Herrera was appointed research director of the US National Security Agency in late 2021, just as an AI revolution was brewing inside the US tech industry. The NSA, sometimes jokingly said to stand for No Such Agency, has long hired top math and computer science talent. Its technical leaders have been early and avid users of advanced computing and AI. And yet when Herrera spoke with me by phone about the implications of the latest AI boom from NSA headquarters in Fort Meade, Maryland, it seemed that, like many others, the agency has been stunned by the recent success of the large language models behind ChatGPT and other hit AI products. The conversation has been lightly edited for clarity and length.
Efficient Hybrid Oversampling and Intelligent Undersampling for Imbalanced Big Data Classification
Vairetti, Carla, Assadi, José Luis, Maldonado, Sebastián
Imbalanced classification is a well-known challenge faced by many real-world applications. This issue occurs when the distribution of the target variable is skewed, leading to a prediction bias toward the majority class. With the arrival of the Big Data era, there is a pressing need for efficient solutions to solve this problem. In this work, we present a novel resampling method called SMOTENN that combines intelligent undersampling and oversampling using a MapReduce framework. Both procedures are performed on the same pass over the data, conferring efficiency to the technique. The SMOTENN method is complemented with an efficient implementation of the neighborhoods related to the minority samples. Our experimental results show the virtues of this approach, outperforming alternative resampling techniques for small- and medium-sized datasets while achieving positive results on large datasets with reduced running times.
- South America > Uruguay > Maldonado > Maldonado (0.04)
- South America > Chile > Valparaíso Region > Los Andes Province > Los Andes (0.04)
- Asia > Thailand > Chiang Mai > Chiang Mai (0.04)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.88)
- (2 more...)
mldr.resampling: Efficient Reference Implementations of Multilabel Resampling Algorithms
Rivera, Antonio J., Dávila, Miguel A., Elizondo, David, del Jesus, María J., Charte, Francisco
MultiLabel Learning (MLL) [1] is one of the most common machine learning tasks today. It is based on the idea that each data sample is associated with a certain subset of labels. The full set of labels can be large, in many cases even having more labels than input features. As a result, it is common for some labels to occur in only a few samples, while others occur much more frequently. The label imbalance [2] in MLL is almost always present, and it is a serious obstacle to training good classifiers. Class imbalance is a very well-known problem in traditional learning tasks such as binary and multiclass classification. Hundreds of articles [3, 4, 5], conference papers [6] and books [7] have been devoted to studying it and proposing possible solutions. The most popular are data resampling, cost-sensitive learning and mixtures of these approaches [8, 9]. However, imbalanced learning in the MLL field presents some specific aspects that make it more difficult to deal with this problem.
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- Europe > Spain > Castile and León > Salamanca Province > Salamanca (0.04)
- Europe > Spain > Andalusia > Jaén Province > Jaén (0.04)
- (4 more...)
Literature Review of the Recent Trends and Applications in various Fuzzy Rule based systems
Varshney, Ayush K., Torra, Vicenç
Fuzzy rule based systems (FRBSs) is a rule-based system which uses linguistic fuzzy variables as antecedents and consequent to represent human understandable knowledge. They have been applied to various applications and areas throughout the soft computing literature. However, FRBSs suffers from many drawbacks such as uncertainty representation, high number of rules, interpretability loss, high computational time for learning etc. To overcome these issues with FRBSs, there exists many extensions of FRBSs. This paper presents an overview and literature review of recent trends on various types and prominent areas of fuzzy systems (FRBSs) namely genetic fuzzy system (GFS), hierarchical fuzzy system (HFS), neuro fuzzy system (NFS), evolving fuzzy system (eFS), FRBSs for big data, FRBSs for imbalanced data, interpretability in FRBSs and FRBSs which use cluster centroids as fuzzy rules. The review is for years 2010-2021. This paper also highlights important contributions, publication statistics and current trends in the field. The paper also addresses several open research areas which need further attention from the FRBSs research community.
- Europe > Sweden > Västerbotten County > Umeå (0.04)
- Europe > Germany > North Rhine-Westphalia > Arnsberg Region > Dortmund (0.04)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Africa > Middle East > Egypt (0.04)
- Overview (1.00)
- Research Report > New Finding (0.46)
- Energy (1.00)
- Information Technology > Security & Privacy (0.93)
- Health & Medicine > Therapeutic Area (0.68)