AITopics | efficientnet-b3

Collaborating Authors

efficientnet-b3

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Purrturbed but Stable: Human-Cat Invariant Representations Across CNNs, ViTs and Self-Supervised ViTs

Shah, Arya, Tripathi, Vaibhav

arXiv.org Artificial IntelligenceNov-5-2025

Cats and humans differ in ocular anatomy. Most notably, Felis Catus (domestic cats) have vertically elongated pupils linked to ambush predation; yet, how such specializations manifest in downstream visual representations remains incompletely understood. We present a unified, frozen-encoder benchmark that quantifies feline-human cross-species representational alignment in the wild, across convolutional networks, supervised Vision Transformers, windowed transformers, and self-supervised ViTs (DINO), using layer-wise Centered Kernel Alignment (linear and RBF) and Representational Similarity Analysis, with additional distributional and stability tests reported in the paper. Across models, DINO ViT-B/16 attains the most substantial alignment (mean CKA-RBF $\approx0.814$, mean CKA-linear $\approx0.745$, mean RSA $\approx0.698$), peaking at early blocks, indicating that token-level self-supervision induces early-stage features that bridge species-specific statistics. Supervised ViTs are competitive on CKA yet show weaker geometric correspondence than DINO (e.g., ViT-B/16 RSA $\approx0.53$ at block8; ViT-L/16 $\approx0.47$ at block14), revealing depth-dependent divergences between similarity and representational geometry. CNNs remain strong baselines but below plain ViTs on alignment, and windowed transformers underperform plain ViTs, implicating architectural inductive biases in cross-species alignment. Results indicate that self-supervision coupled with ViT inductive biases yields representational geometries that more closely align feline and human visual systems than widely used CNNs and windowed Transformers, providing testable neuroscientific hypotheses about where and how cross-species visual computations converge. We release our code and dataset for reference and reproducibility.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2511.02404

Country: Asia > India (0.28)

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Deep Learning-Based Transfer Learning for Classification of Cassava Disease

Junior, Ademir G. Costa, da Silva, Fábio S., Rios, Ricardo

arXiv.org Artificial IntelligenceFeb-26-2025

This paper presents a performance comparison among four Convolutional Neural Network architectures (EfficientNet-B3, InceptionV3, ResNet50, and VGG16) for classifying cassava disease images. The images were sourced from an imbalanced dataset from a competition. Appropriate metrics were employed to address class imbalance. The results indicate that EfficientNet-B3 achieved on this task accuracy of 87.7%, precision of 87.8%, revocation of 87.8% and F1-Score of 87.7%. These findings suggest that EfficientNet-B3 could be a valuable tool to support Digital Agriculture.

classificac ao, conjunto, doenc, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.5753/eniac.2024.244378

2502.19351

Country:

South America > Brazil > Amazonas > Manaus (0.04)
North America > United States (0.04)
Africa > Uganda > Central Region > Kampala (0.04)

Genre: Research Report (0.70)

Industry: Food & Agriculture > Agriculture (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Brain Tumor Radiogenomic Classification

Mohamed, Amr, Rabea, Mahmoud, Sameh, Aya, Kamal, Ehab

arXiv.org Artificial IntelligenceJan-11-2024

Related Work methylation[2] is an important biomarker for glioblastoma[9], the most common and Brain tumor detection from MRI scan has seen aggressive form of brain cancer in adults.By great advancements over the past few years, introducing a Radiogenomic based imaging Abdusalomov et al. [10] proposed a deep method, the process of detecting the presence of learning approach using pre-trained YOLOv7 brain tumor shall be less invasive which will for object detection, Bi-directional Feature eventually improve the survival and prospects of Pyramid Network (BiFPN) for feature patients with brain cancer. Also, knowing the extraction, and Channel and Spatial Attention methylation status helps guide treatment module (CBAM) for improved attention decisions, as tumors with methylation are more mechanisms. Their model achieved a responsive to certain therapies.

architecture, dataset, efficientnet-b3, (12 more...)

arXiv.org Artificial Intelligence

2401.09471

Country:

Europe > Switzerland > Basel-City > Basel (0.04)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.04)

Genre: Research Report > New Finding (0.95)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Brain Cancer (0.92)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)

Add feedback

Kernel Inversed Pyramidal Resizing Network for Efficient Pavement Distress Recognition

Qin, Rong, Huangfu, Luwen, Hood, Devon, Ma, James, Huang, Sheng

arXiv.org Artificial IntelligenceDec-4-2022

Pavement Distress Recognition (PDR) is an important step in pavement inspection and can be powered by image-based automation to expedite the process and reduce labor costs. Pavement images are often in high-resolution with a low ratio of distressed to non-distressed areas. Advanced approaches leverage these properties via dividing images into patches and explore discriminative features in the scale space. However, these approaches usually suffer from information loss during image resizing and low efficiency due to complex learning frameworks. In this paper, we propose a novel and efficient method for PDR. A light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is introduced for image resizing, and can be flexibly plugged into the image classification network as a pre-network to exploit resolution and scale information. In KIPRN, pyramidal convolution and kernel inversed convolution are specifically designed to mine discriminative information across different feature granularities and scales. The mined information is passed along to the resized images to yield an informative image pyramid to assist the image classification network for PDR. We applied our method to three well-known Convolutional Neural Networks (CNNs), and conducted an evaluation on a large-scale pavement image dataset named CQU-BPDD. Extensive results demonstrate that KIPRN can generally improve the pavement distress recognition of these CNN models and show that the simple combination of KIPRN and EfficientNet-B3 significantly outperforms the state-of-the-art patch-based method in both performance and efficiency.

artificial intelligence, machine learning, pavement image, (14 more...)

arXiv.org Artificial Intelligence

2212.0179

Country:

North America > United States > California > San Diego County > San Diego (0.06)
Asia > China > Chongqing Province > Chongqing (0.05)
North America > United States > Colorado > El Paso County > Colorado Springs (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Materials > Construction Materials (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Plot2API: Recommending Graphic API from Plot via Semantic Parsing Guided Neural Network

Wang, Zeyu, Huang, Sheng, Liu, Zhongxin, Yan, Meng, Xia, Xin, Wang, Bei, Yang, Dan

arXiv.org Artificial IntelligenceApr-2-2021

Plot-based Graphic API recommendation (Plot2API) is an unstudied but meaningful issue, which has several important applications in the context of software engineering and data visualization, such as the plotting guidance of the beginner, graphic API correlation analysis, and code conversion for plotting. Plot2API is a very challenging task, since each plot is often associated with multiple APIs and the appearances of the graphics drawn by the same API can be extremely varied due to the different settings of the parameters. Additionally, the samples of different APIs also suffer from extremely imbalanced. Considering the lack of technologies in Plot2API, we present a novel deep multi-task learning approach named Semantic Parsing Guided Neural Network (SPGNN) which translates the Plot2API issue as a multi-label image classification and an image semantic parsing tasks for the solution. In SPGNN, the recently advanced Convolutional Neural Network (CNN) named EfficientNet is employed as the backbone network for API recommendation. Meanwhile, a semantic parsing module is complemented to exploit the semantic relevant visual information in feature learning and eliminate the appearance-relevant visual information which may confuse the visual-information-based API recommendation. Moreover, the recent data augmentation technique named random erasing is also applied for alleviating the imbalance of API categories. We collect plots with the graphic APIs used to drawn them from Stack Overflow, and release three new Plot2API datasets corresponding to the graphic APIs of R and Python programming languages for evaluating the effectiveness of Plot2API techniques. Extensive experimental results not only demonstrate the superiority of our method over the recent deep learning baselines but also show the practicability of our method in the recommendation of graphic APIs.

api, dataset, recommendation, (14 more...)

arXiv.org Artificial Intelligence

2104.01032

Country:

Asia > China > Chongqing Province > Chongqing (0.04)
Oceania > Australia (0.04)
Asia > China > Zhejiang Province > Hangzhou (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

efficientnet-b3

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Purrturbed but Stable: Human-Cat Invariant Representations Across CNNs, ViTs and Self-Supervised ViTs

7712b1075f5e0eae297702845714098f-Supplemental-Conference.pdf

Deep Learning-Based Transfer Learning for Classification of Cassava Disease

Brain Tumor Radiogenomic Classification

Kernel Inversed Pyramidal Resizing Network for Efficient Pavement Distress Recognition

Plot2API: Recommending Graphic API from Plot via Semantic Parsing Guided Neural Network