AITopics | convolutional neural network architecture

Neural Information Processing Systems http://nips.cc/

convolutional neural network architecture, matching natural language sentence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Convolutional Neural Network Architectures for Matching Natural Language Sentences

Neural Information Processing SystemsSep-30-2025, 10:09:16 GMT

Semantic matching is of central importance to many natural language tasks \cite{bordes2014semantic,RetrievalQA}. A successful matching algorithm needs to adequately model the internal structures of language objects and the interaction between them. As a step toward this goal, we propose convolutional neural network models for matching two sentences, by adapting the convolutional strategy in vision and speech. The proposed models not only nicely represent the hierarchical structures of sentences with their layer-by-layer composition and pooling, but also capture the rich matching patterns at different levels. Our models are rather generic, requiring no prior knowledge on language, and can hence be applied to matching tasks of different nature and in different languages. The empirical study on a variety of matching tasks demonstrates the efficacy of the proposed model on a variety of matching tasks and its superiority to competitor models.

convolutional neural network architecture, matching natural language sentence, name change, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Convolutional Neural Network Architectures for Matching Natural Language Sentences

Baotian Hu, Zhengdong Lu, Hang Li, Qingcai Chen

Neural Information Processing SystemsFeb-9-2025, 18:14:53 GMT

Semantic matching is of central importance to many natural language tasks [2, 28]. A successful matching algorithm needs to adequately model the internal structures of language objects and the interaction between them. As a step toward this goal, we propose convolutional neural network models for matching two sentences, by adapting the convolutional strategy in vision and speech. The proposed models not only nicely represent the hierarchical structures of sentences with their layerby-layer composition and pooling, but also capture the rich matching patterns at different levels. Our models are rather generic, requiring no prior knowledge on language, and can hence be applied to matching tasks of different nature and in different languages. The empirical study on a variety of matching tasks demonstrates the efficacy of the proposed model on a variety of matching tasks and its superiority to competitor models.

artificial intelligence, machine learning, representation, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Convolutional Neural Network Architectures for Matching Natural Language Sentences Zhengdong Lu Hang Li Qingcai Chen

Neural Information Processing SystemsMar-13-2024, 12:17:33 GMT

Semantic matching is of central importance to many natural language tasks [2, 28]. A successful matching algorithm needs to adequately model the internal structures of language objects and the interaction between them. As a step toward this goal, we propose convolutional neural network models for matching two sentences, by adapting the convolutional strategy in vision and speech. The proposed models not only nicely represent the hierarchical structures of sentences with their layerby-layer composition and pooling, but also capture the rich matching patterns at different levels. Our models are rather generic, requiring no prior knowledge on language, and can hence be applied to matching tasks of different nature and in different languages. The empirical study on a variety of matching tasks demonstrates the efficacy of the proposed model on a variety of matching tasks and its superiority to competitor models.

architecture, proceedings, representation, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Optimizing Convolutional Neural Network Architecture

Balderas, Luis, Lastra, Miguel, Benítez, José M.

arXiv.org Artificial IntelligenceDec-17-2023

Convolutional Neural Networks (CNN) are widely used to face challenging tasks like speech recognition, natural language processing or computer vision. As CNN architectures get larger and more complex, their computational requirements increase, incurring significant energetic costs and challenging their deployment on resource-restricted devices. In this paper, we propose Optimizing Convolutional Neural Network Architecture (OCNNA), a novel CNN optimization and construction method based on pruning and knowledge distillation designed to establish the importance of convolutional layers. The proposal has been evaluated though a thorough empirical study including the best known datasets (CIFAR-10, CIFAR-100 and Imagenet) and CNN architectures (VGG-16, ResNet-50, DenseNet-40 and MobileNet), setting Accuracy Drop and Remaining Parameters Ratio as objective metrics to compare the performance of OCNNA against the other state-of-art approaches. Our method has been compared with more than 20 convolutional neural network simplification algorithms obtaining outstanding results. As a result, OCNNA is a competitive CNN constructing method which could ease the deployment of neural networks into IoT or resource-limited devices.

dataset, neural network, ocnna, (11 more...)

arXiv.org Artificial Intelligence

2401.01361

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Spain > Andalusia > Granada Province > Granada (0.04)
North America > United States > New York (0.04)
North America > United States > Massachusetts (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

PCONet: A Convolutional Neural Network Architecture to Detect Polycystic Ovary Syndrome (PCOS) from Ovarian Ultrasound Images

Hosain, A. K. M. Salman, Mehedi, Md Humaion Kabir, Kabir, Irteza Enan

arXiv.org Artificial IntelligenceOct-1-2022

Polycystic Ovary Syndrome (PCOS) is an endrocrinological dysfunction prevalent among women of reproductive age. PCOS is a combination of syndromes caused by an excess of androgens - a group of sex hormones - in women. Syndromes including acne, alopecia, hirsutism, hyperandrogenaemia, oligo-ovulation, etc. are caused by PCOS. It is also a major cause of female infertility. An estimated 15% of reproductive-aged women are affected by PCOS globally. The necessity of detecting PCOS early due to the severity of its deleterious effects cannot be overstated. In this paper, we have developed PCONet - a Convolutional Neural Network (CNN) - to detect polycistic ovary from ovarian ultrasound images. We have also fine tuned InceptionV3 - a pretrained convolutional neural network of 45 layers - by utilizing the transfer learning method to classify polcystic ovarian ultrasound images. We have compared these two models on various quantitative performance evaluation parameters and demonstrated that PCONet is the superior one among these two with an accuracy of 98.12%, whereas the fine tuned InceptionV3 showcased an accuracy of 96.56% on test images.

artificial intelligence, machine learning, pconet, (15 more...)

arXiv.org Artificial Intelligence

2210.00407

Country:

Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.04)
North America > United States (0.04)
Europe > Netherlands > South Holland > Rotterdam (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (0.72)
Health & Medicine > Therapeutic Area > Endocrinology (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Putting a strain on semiconductors for next-gen chips

#artificialintelligenceJul-16-2021, 21:05:27 GMT

Skoltech researchers and their colleagues from the U.S. and Singapore have created a neural network that can help tweak semiconductor crystals in a controlled fashion to achieve superior properties for electronics. This enables a new direction of development of next-generation chips and solar cells by exploiting a controllable deformation that may change the properties of a material on the fly. The paper was published in the journal npj Computational Materials. Materials at the nanoscale can withstand major deformation. In what's called the strained state, they can exhibit remarkable optical, thermal, electronic, and other properties due to a change in interatomic distances.

semiconductor, skoltech, tsymbalov, (10 more...)

#artificialintelligence

Country:

Asia > Singapore (0.25)
North America > United States > Massachusetts (0.07)
Europe > Russia (0.07)
Asia > Russia (0.07)

Genre: Research Report (0.70)

Industry: Energy (0.38)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.78)

Add feedback

Neural Network Structure Design based on N-Gauss Activation Function

Lu, Xiangri, Ma, Hongbin, Zhang, Jingcheng

arXiv.org Artificial IntelligenceJun-1-2021

Recent work has shown that the activation function of the convolutional neural network can meet the Lipschitz condition, then the corresponding convolutional neural network structure can be constructed according to the scale of the data set, and the data set can be trained more deeply, more accurately and more effectively. In this article, we have accepted the experimental results and introduced the core block N-Gauss, N-Gauss, and Swish (Conv1, Conv2, FC1) neural network structure design to train MNIST, CIFAR10, and CIFAR100 respectively. Experiments show that N-Gauss gives full play to the main role of nonlinear modeling of activation functions, so that deep convolutional neural networks have hierarchical nonlinear mapping learning capabilities. At the same time, the training ability of N-Gauss on simple one-dimensional channel small data sets is equivalent to the performance of ReLU and Swish.

activation function, neural network, neural network structure design, (11 more...)

arXiv.org Artificial Intelligence

2106.07562

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States > New York (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Artificial Intelligence for X-Ray and CT-Scan in Modern Healthcare

#artificialintelligenceDec-1-2020, 04:20:39 GMT

The applications of Artificial Intelligence (AI) for X-ray and CT-Scan image analysis using Convolutional neural network architectures, Generative adversarial networks, transfer learning, and data augmentation techniques are discussed. Currently, AI algorithms embedded on a mobile x-ray and CT-Scan devices for automated diagnosis, measurements, case prioritization, and quality control are most popular research area. More than 60,000 research articles have been published related to the use of deep learning in healthcare and related applications. Established architectures, such as ResNet-50 or DenseNet-161 (with 50 and 161 representing the number of layers within the respective neural network) are easy to use. Integration of the AI modules with the drug systems and the experts are the key issues of implementing AI systems in healthcare.

architecture, network architecture, neural network, (14 more...)

#artificialintelligence

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Convolutional Neural Network Architectures for Matching Natural Language Sentences

Hu, Baotian, Lu, Zhengdong, Li, Hang, Chen, Qingcai

Neural Information Processing SystemsFeb-14-2020, 09:13:25 GMT

Semantic matching is of central importance to many natural language tasks \cite{bordes2014semantic,RetrievalQA}. A successful matching algorithm needs to adequately model the internal structures of language objects and the interaction between them. As a step toward this goal, we propose convolutional neural network models for matching two sentences, by adapting the convolutional strategy in vision and speech. The proposed models not only nicely represent the hierarchical structures of sentences with their layer-by-layer composition and pooling, but also capture the rich matching patterns at different levels. Our models are rather generic, requiring no prior knowledge on language, and can hence be applied to matching tasks of different nature and in different languages.

convolutional neural network architecture, matching natural language sentence

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Filters

Collaborating Authors

convolutional neural network architecture

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Convolutional Neural Network Architectures for Matching Natural Language Sentences

Convolutional Neural Network Architectures for Matching Natural Language Sentences

Convolutional Neural Network Architectures for Matching Natural Language Sentences

Convolutional Neural Network Architectures for Matching Natural Language Sentences Zhengdong Lu Hang Li Qingcai Chen

Optimizing Convolutional Neural Network Architecture

PCONet: A Convolutional Neural Network Architecture to Detect Polycystic Ovary Syndrome (PCOS) from Ovarian Ultrasound Images

Putting a strain on semiconductors for next-gen chips

Neural Network Structure Design based on N-Gauss Activation Function

Artificial Intelligence for X-Ray and CT-Scan in Modern Healthcare

Convolutional Neural Network Architectures for Matching Natural Language Sentences