Goto

Collaborating Authors

 feature pyramid network



We would first like to thank the reviewers for especially detailed and high quality reviews

Neural Information Processing Systems

We would first like to thank the reviewers for especially detailed and high quality reviews. That said, this may have had the opposite effect of being confusing. Simple blurs are another example of a semigroup action on the whole signal. Resource Efficient Image Classification" (Huang et al., 2018) are also very interesting and have now been added to our Indeed, in Dilated Residual Networks (Y u et al., 2017), there is no bandlimiting at all (we Moreover, maybe the use of dilated convolutions in the end "does Furthermore, in related works there are no "standard ImageNet, but in Feature Pyramid Networks, the authors look at COCO. We do intend to extend the current work to other domains and to other kinds of semigroup action.


NS-FPN: Improving Infrared Small Target Detection and Segmentation from Noise Suppression Perspective

Yuan, Maoxun, Meng, Duanni, Xi, Ziteng, Zhao, Tianyi, Zhao, Shiji, Dai, Yimian, Wei, Xingxing

arXiv.org Artificial Intelligence

Infrared small target detection and segmentation (IRSTDS) is a critical yet challenging task in defense and civilian applications, owing to the dim, shapeless appearance of targets and severe background clutter. Recent CNN-based methods have achieved promising target perception results, but they only focus on enhancing feature representation to offset the impact of noise, which results in the increased false alarms problem. In this paper, through analyzing the problem from the frequency domain, we pioneer in improving performance from noise suppression perspective and propose a novel noise-suppression feature pyramid network (NS-FPN), which integrates a low-frequency guided feature purification (LFP) module and a spiral-aware feature sampling (SFS) module into the original FPN structure. The LFP module suppresses the noise features by purifying high-frequency components to achieve feature enhancement devoid of noise interference, while the SFS module further adopts spiral sampling to fuse target-relevant features in feature fusion process. Our NS-FPN is designed to be lightweight yet effective and can be easily plugged into existing IRSTDS frameworks. Extensive experiments on the public IRSTDS datasets demonstrate that our method significantly reduces false alarms and achieves superior performance on IRSTDS tasks.


Deep Learning-Based Fatigue Cracks Detection in Bridge Girders using Feature Pyramid Networks

Zhang, Jiawei, Li, Jun, Ly, Reachsak, Liu, Yunyi, Shu, Jiangpeng

arXiv.org Artificial Intelligence

For structural health monitoring, continuous and automatic crack detection has been a challenging problem. This study is conducted to propose a framework of automatic crack segmentation from high-resolution images containing crack information about steel box girders of bridges. Considering the multi-scale feature of cracks, convolutional neural network architecture of Feature Pyramid Networks (FPN) for crack detection is proposed. As for input, 120 raw images are processed via two approaches (shrinking the size of images and splitting images into sub-images). Then, models with the proposed structure of FPN for crack detection are developed. The result shows all developed models can automatically detect the cracks at the raw images. By shrinking the images, the computation efficiency is improved without decreasing accuracy. Because of the separable characteristic of crack, models using the splitting method provide more accurate crack segmentations than models using the resizing method. Therefore, for high-resolution images, the FPN structure coupled with the splitting method is an promising solution for the crack segmentation and detection.


[Re] CLRNet: Cross Layer Refinement Network for Lane Detection

N, Viswesh, Jadhav, Kaushal, Amalanshu, Avi, Mondal, Bratin, Waran, Sabaris, Sadhwani, Om, Kumar, Apoorv, Chakravarty, Debashish

arXiv.org Artificial Intelligence

The following work is a reproducibility report for CLRNet: Cross Layer Refinement Network for Lane Detection. The basic code was made available by the author. The paper proposes a novel Cross Layer Refinement Network to utilize both high and low level features for lane detection. The authors assert that the proposed technique sets the new state-of-the-art on three lane-detection benchmarks


An image segmentation algorithm based on multi-scale feature pyramid network

Xiao, Yu, Yang, Xin, Huang, Sijuan, Guo, Lihua

arXiv.org Artificial Intelligence

Medical image segmentation is particularly critical as a prerequisite for relevant quantitative analysis in the treatment of clinical diseases. For example, in clinical cervical cancer radiotherapy, after acquiring subabdominal MRI images, a fast and accurate image segmentation of organs and tumors in MRI images can optimize the clinical radiotherapy process, whereas traditional approaches use manual annotation by specialist doctors, which is time consuming and laborious, therefore, automatic organ segmentation of subabdominal MRI images is a valuable research topic. In the field of automatic segmentation in medical image, U Net, proposed by Ronneberger et al. [1] in 2015, still has an irreplaceable influence today. Many transformers of U Net network are proposed, and various plug and play components use it as a backbone network [3 10]. Image semantic segmentation differs from image classification.


Lightweight wood panel defect detection method incorporating attention mechanism and feature fusion network

Cao, Yongxin, Liu, Fanghua, Jiang, Lai, Bao, Cheng, Miao, You, Chen, Yang

arXiv.org Artificial Intelligence

In recent years, deep learning has made significant progress in wood panel defect detection. However, there are still challenges such as low detection , slow detection speed, and difficulties in deploying embedded devices on wood panel surfaces. To overcome these issues, we propose a lightweight wood panel defect detection method called YOLOv5-LW, which incorporates attention mechanisms and a feature fusion network.Firstly, to enhance the detection capability of acceptable defects, we introduce the Multi-scale Bi-directional Feature Pyramid Network (MBiFPN) as a feature fusion network. The MBiFPN reduces feature loss, enriches local and detailed features, and improves the model's detection capability for acceptable defects.Secondly, to achieve a lightweight design, we reconstruct the ShuffleNetv2 network model as the backbone network. This reconstruction reduces the number of parameters and computational requirements while maintaining performance. We also introduce the Stem Block and Spatial Pyramid Pooling Fast (SPPF) models to compensate for any accuracy loss resulting from the lightweight design, ensuring the model's detection capabilities remain intact while being computationally efficient.Thirdly, we enhance the backbone network by incorporating Efficient Channel Attention (ECA), which improves the network's focus on key information relevant to defect detection. By attending to essential features, the model becomes more proficient in accurately identifying and localizing defects.We validate the proposed method using a self-developed wood panel defect dataset.The experimental results demonstrate the effectiveness of the improved YOLOv5-LW method. Compared to the original model, our approach achieves a 92.8\% accuracy rate, reduces the number of parameters by 27.78\%, compresses computational volume by 41.25\%, improves detection inference speed by 10.16\%


The Topology-Overlap Trade-Off in Retinal Arteriole-Venule Segmentation

Muller, Angel Victor Juanco, Mota, Joao F. C., Goatman, Keith A., Hoogendoorn, Corne

arXiv.org Machine Learning

Retinal fundus images can be an invaluable diagnosis tool for screening epidemic diseases like hypertension or diabetes. And they become especially useful when the arterioles and venules they depict are clearly identified and annotated. However, manual annotation of these vessels is extremely time demanding and taxing, which calls for automatic segmentation. Although convolutional neural networks can achieve high overlap between predictions and expert annotations, they often fail to produce topologically correct predictions of tubular structures. This situation is exacerbated by the bifurcation versus crossing ambiguity which causes classification mistakes. This paper shows that including a topology preserving term in the loss function improves the continuity of the segmented vessels, although at the expense of artery-vein misclassification and overall lower overlap metrics. However, we show that by including an orientation score guided convolutional module, based on the anisotropic single sided cake wavelet, we reduce such misclassification and further increase the topology correctness of the results. We evaluate our model on public datasets with conveniently chosen metrics to assess both overlap and topology correctness, showing that our model is able to produce results on par with state-of-the-art from the point of view of overlap, while increasing topological accuracy.


Maximizing Object Detection Accuracy with FPN: A Comprehensive Overview

#artificialintelligence

FPN (Feature Pyramid Network) is a type of convolutional neural network architecture for object detection tasks. It is designed to improve the performance of object detection models by making use of both high-level and low-level features from the input image. The basic idea behind FPN is to build a pyramid of features, where each level in the pyramid represents a different scale or resolution of the input image. The top of the pyramid represents the high-level, semantically rich features, while the bottom of the pyramid represents the low-level, fine-grained features. By combining features from different levels in the pyramid, the model is able to make use of both the semantically rich high-level features and the fine-grained low-level features to improve the accuracy of object detection.


Panoptic Segmentation Explained

#artificialintelligence

We all know a single image may convey a message more effectively than a lot of written words. But what consists of an image? When it comes to image segmentation, a common answer might be "things" and "stuff". The concept of things and stuff is used when describing image segmentation methods such as instance and semantic segmentation. Instance segmentation is the identification of countable objects, while semantic segmentation is the identification of regions of texture.