AITopics

2509.21358

Country: North America > United States > Missouri > Jackson County > Kansas City (0.15)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.93)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Mazzawi, Hanna, Gonzalvo, Xavi, Wunder, Michael

Deep Fusion: Efficient Network Training via Pre-trained Initializations

arXiv.org Artificial IntelligenceJun-20-2023

In recent years, deep learning has made remarkable progress in a wide range of domains, with a particularly notable impact on natural language processing tasks. One of the challenges associated with training deep neural networks is the need for large amounts of computational resources and time. In this paper, we present Deep Fusion, an efficient approach to network training that leverages pre-trained initializations of smaller networks. % We show that Deep Fusion accelerates the training process, reduces computational requirements, and leads to improved generalization performance on a variety of NLP tasks and T5 model sizes. % Our experiments demonstrate that Deep Fusion is a practical and effective approach to reduce the training time and resource consumption while maintaining, or even surpassing, the performance of traditional training methods.

artificial intelligence, machine learning, natural language, (19 more...)

2306.11903

Country:

North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Saihood, Ahmed, Karshenas, Hossein, Nilchi, AhmadReza Naghsh

Deep fusion of gray level co-occurrence matrices for lung nodule classification

arXiv.org Artificial IntelligenceNov-5-2022

Lung cancer is a severe menace to human health, due to which millions of people die because of late diagnoses of cancer; thus, it is vital to detect the disease as early as possible. The Computerized chest analysis Tomography of scan is assumed to be one of the efficient solutions for detecting and classifying lung nodules. The necessity of high accuracy of analyzing C.T. scan images of the lung is considered as one of the crucial challenges in detecting and classifying lung cancer. A new long-short-term-memory (LSTM) based deep fusion structure, is introduced, where, the texture features computed from lung nodules through new volumetric grey-level-co-occurrence-matrices (GLCM) computations are applied to classify the nodules into: benign, malignant and ambiguous. An improved Otsu segmentation method combined with the water strider optimization algorithm (WSA) is proposed to detect the lung nodules. Otsu-WSA thresholding can overcome the restrictions present in previous thresholding methods. Extended experiments are run to assess this fusion structure by considering 2D-GLCM computations based 2D-slices fusion, and an approximation of this 3D-GLCM with volumetric 2.5D-GLCM computations-based LSTM fusion structure. The proposed methods are trained and assessed through the LIDC-IDRI dataset, where 94.4%, 91.6%, and 95.8% Accuracy, sensitivity, and specificity are obtained, respectively for 2D-GLCM fusion and 97.33%, 96%, and 98%, accuracy, sensitivity, and specificity, respectively, for 2.5D-GLCM fusion. The yield of the same are 98.7%, 98%, and 99%, for the 3D-GLCM fusion. The obtained results and analysis indicate that the WSA-Otsu method requires less execution time and yields a more accurate thresholding process. It is found that 3D-GLCM based LSTM outperforms its counterparts.

artificial intelligence, classification, machine learning, (16 more...)

doi: 10.1371/journal.pone.0274516

2205.05123

Country:

Europe > Switzerland > Basel-City > Basel (0.04)
North America > United States > Washington > Whatcom County > Bellingham (0.04)
Asia > Pakistan (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Oncology > Lung Cancer (0.90)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Mandal, Supriyo, Maiti, Abyayananda

FusionDeepMF: A Dual Embedding based Deep Fusion Model for Recommendation

arXiv.org Artificial IntelligenceOct-11-2022

Traditional Collaborative Filtering (CF) based methods are applied to understand the personal preferences of users/customers for items or products from the rating matrix. Usually, the rating matrix is sparse in nature. So there are some improved variants of the CF method that apply the increasing amount of side information to handle the sparsity problem. Only linear kernel or only non-linear kernel is applied in most of the available recommendation-related work to understand user-item latent feature embeddings from data. Only linear kernel or only non-linear kernel is not sufficient to learn complex user-item features from side information of users. Recently, some researchers have focused on hybrid models that learn some features with non-linear kernels and some other features with linear kernels. But it is very difficult to understand which features can be learned accurately with linear kernels or with non-linear kernels. To overcome this problem, we propose a novel deep fusion model named FusionDeepMF and the novel attempts of this model are i) learning user-item rating matrix and side information through linear and non-linear kernel simultaneously, ii) application of a tuning parameter determining the trade-off between the dual embeddings that are generated from linear and non-linear kernels. Extensive experiments on online review datasets establish that FusionDeepMF can be remarkably futuristic compared to other baseline approaches. Empirical evidence also shows that FusionDeepMF achieves better performances compared to the linear kernels of Matrix Factorization (MF) and the non-linear kernels of Multi-layer Perceptron (MLP).

artificial intelligence, machine learning, reliability score, (19 more...)

2210.05338

Country:

South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
Europe > Germany > Schleswig-Holstein > Kiel (0.04)
Asia > India > Bihar (0.04)

Genre: Research Report (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.68)

#artificialintelligenceOct-8-2019, 02:07:15 GMT

Apple's Deep Fusion photography comes to iPhone 11 in iOS 13.2 beta (updated)

You now have a chance to try Apple's machine learning-based Deep Fusion photography if you're willing to live on the bleeding edge. It's releasing an iOS 13.2 developer beta (public likely to follow soon) that makes Deep Fusion available to iPhone 11 and iPhone 11 Pro owners. The technique uses machine learning to create highly detailed, sharper and more natural-looking photos on the primary and telephoto lenses by combining the results of multiple shots. Deep Fusion takes an underexposed photo for sharpness, and blends that with three neutral pictures and a long high-exposure image on a per-pixel level to achieve a highly customized result. The machine learning system examines the context of the picture to understand where a pixel sits on the frequency spectrum.

apple, io 13, iphone 11, (3 more...)

Industry: Media > Photography (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Mobile (0.94)

#artificialintelligenceOct-3-2019, 21:47:14 GMT

Apple's Deep Fusion hands-on: AI sharpens photos like HDR fixes colors

Digital photographers coined the term "pixel peepers" years ago to denote -- mostly with scorn -- people who focused on flaws in the individual dots that create photos rather than the entirety of the images. Zooming in to 100%, it was said, is nothing but a recipe for perpetual disappointment; instead, judge each camera by the overall quality of the photo it takes, and don't get too mired in the details. Until now, Apple's approach to digital photography has been defined by its commitment to improving the quality of the big picture without further compromising pixel-level quality. I say "further" because there's no getting around the fact that tiny phone camera sensors are physically incapable of matching the pixel-level results of full-frame DSLR camera sensors in a fair fight. Bigger sensors can capture more light and almost invariably more actual pixels than the iPhone's 12-megapixel cameras.

apple, deep fusion, fusion, (12 more...)

Industry: Media > Photography (0.58)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Mobile (0.42)

#artificialintelligenceSep-16-2019, 10:20:03 GMT

Apple's New iPhone 11 Pro Has the First Artificially Intelligent Camera

Apple's new high-end iPhone will make any traditional camera manufacturer tremble. The iPhone 11 Pro, unveiled at a special Apple event on Tuesday, not only has three cameras in the back--each having its own functions--but also for the first time utilizes artificial intelligence to take a photo. Yes, the next time you feel proud of snapping a perfect pic, it may have actually been the little robot living inside your phone. Here's how it works: On the iPhone 11 Pro, every time you are about to take a picture, the cameras will quickly take eight images of the object before you press the shutter. When you actually take a photo, the phone will compare your image against the eight previously taken ones and merge the best pixels of each image into one final product.

apple, artificially intelligent camera, iphone 11, (4 more...)

Country: North America > United States > California > Santa Clara County > Cupertino (0.07)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence (1.00)

#artificialintelligenceSep-10-2019, 21:22:23 GMT

Deep Fusion is the iPhone's take on AI photography

In announcing the iPhones 11 Pro, Phil Schiller tipped us off to a new feature that'll come to the flagship smartphones in the next year. Deep Fusion is a system which Schiller describes as "computational photography mad science," which is likely to be the company's answer, more or less, to Google's Night Sight. As Schiller explained, when you're about to take an image with the new iPhone 11 Pro, the camera will snap 8 images before you press the shutter. When you do, it'll then take one long exposure, and then stitch a new image together, "pixel-by-pixel" to create one with lots of detail and very little noise. It's not specifically designed for shooting in the dark, but it's clear that Apple is parking its tanks on Google's lawn. Night Sight has been one of the strengths of the last few Pixel phones, using machine learning to create well-lit images in dark environments.

ai photography, artificial intelligence, deep fusion, (6 more...)

Industry: Media > Photography (0.98)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence (1.00)

arXiv.org Artificial IntelligenceJul-27-2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition

Toshniwal, Shubham, Kannan, Anjuli, Chiu, Chung-Cheng, Wu, Yonghui, Sainath, Tara N, Livescu, Karen

Attention-based recurrent neural encoder-decoder models present an elegant solution to the automatic speech recognition problem. This approach folds the acoustic model, pronunciation model, and language model into a single network and requires only a parallel corpus of speech and text for training. However, unlike in conventional approaches that combine separate acoustic and language models, it is not clear how to use additional (unpaired) text. While there has been previous work on methods addressing this problem, a thorough comparison among methods is still lacking. In this paper, we compare a suite of past methods and some of our own proposed methods for using unpaired text data to improve encoder-decoder models. For evaluation, we use the medium-sized Switchboard data set and the large-scale Google voice search and dictation data sets. Our results confirm the benefits of using unpaired text across a range of methods and data sets. Surprisingly, for first-pass decoding, the rather simple approach of shallow fusion performs best across data sets. However, for Google data sets we find that cold fusion has a lower oracle error rate and outperforms other approaches after second-pass rescoring on the Google voice search data set.

artificial intelligence, machine learning, natural language, (19 more...)

1807.10857

Country: North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)