AITopics | Vazquez, Eduard

Plotting

Vazquez, Eduard

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

TextAug: Test time Text Augmentation for Multimodal Person Re-identification

Fawakherji, Mulham, Vazquez, Eduard, Giampa, Pasquale, Bhattarai, Binod

arXiv.org Artificial IntelligenceDec-3-2023

Multimodal Person Reidentification is gaining popularity in the research community due to its effectiveness compared to counter-part unimodal frameworks. However, the bottleneck for multimodal deep learning is the need for a large volume of multimodal training examples. Data augmentation techniques such as cropping, flipping, rotation, etc. are often employed in the image domain to improve the generalization of deep learning models. Augmenting in other modalities than images, such as text, is challenging and requires significant computational resources and external data sources. In this study, we investigate the effectiveness of two computer vision data augmentation techniques: cutout and cutmix, for text augmentation in multi-modal person re-identification. Our approach merges these two augmentation strategies into one strategy called CutMixOut which involves randomly removing words or sub-phrases from a sentence (Cutout) and blending parts of two or more sentences to create diverse examples (CutMix) with a certain probability assigned to each operation. This augmentation was implemented at inference time without any prior training. Our results demonstrate that the proposed technique is simple and effective in improving the performance on multiple multimodal person re-identification benchmarks.

artificial intelligence, machine learning, person re-identification, (18 more...)

arXiv.org Artificial Intelligence

2312.01605

Country:

North America > United States (0.14)
Europe > Portugal (0.14)
Europe > Italy (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ConvNeXtv2 Fusion with Mask R-CNN for Automatic Region Based Coronary Artery Stenosis Detection for Disease Diagnosis

Pokhrel, Sandesh, Bhandari, Sanjay, Vazquez, Eduard, Shrestha, Yash Raj, Bhattarai, Binod

arXiv.org Artificial IntelligenceOct-7-2023

Coronary Artery Diseases although preventable are one of the leading cause of mortality worldwide. Due to the onerous nature of diagnosis, tackling CADs has proved challenging. This study addresses the automation of resource-intensive and time-consuming process of manually detecting stenotic lesions in coronary arteries in X-ray coronary angiography images. To overcome this challenge, we employ a specialized Convnext-V2 backbone based Mask RCNN model pre-trained for instance segmentation tasks. Our empirical findings affirm that the proposed model exhibits commendable performance in identifying stenotic lesions. Notably, our approach achieves a substantial F1 score of 0.5353 in this demanding task, underscoring its effectiveness in streamlining this intensive process.

artificial intelligence, machine learning, segmentation, (16 more...)

arXiv.org Artificial Intelligence

2310.04749

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.71)

Add feedback

CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

Nwoye, Chinedu Innocent, Yu, Tong, Sharma, Saurav, Murali, Aditya, Alapatt, Deepak, Vardazaryan, Armine, Yuan, Kun, Hajek, Jonas, Reiter, Wolfgang, Yamlahi, Amine, Smidt, Finn-Henri, Zou, Xiaoyang, Zheng, Guoyan, Oliveira, Bruno, Torres, Helena R., Kondo, Satoshi, Kasai, Satoshi, Holm, Felix, Özsoy, Ege, Gui, Shuangchun, Li, Han, Raviteja, Sista, Sathish, Rachana, Poudel, Pranav, Bhattarai, Binod, Wang, Ziheng, Rui, Guo, Schellenberg, Melanie, Vilaça, João L., Czempiel, Tobias, Wang, Zhenkun, Sheet, Debdoot, Thapa, Shrawan Kumar, Berniker, Max, Godau, Patrick, Morais, Pedro, Regmi, Sudarshan, Tran, Thuy Nuong, Fonseca, Jaime, Nölke, Jan-Hinrich, Lima, Estevão, Vazquez, Eduard, Maier-Hein, Lena, Navab, Nassir, Mascagni, Pietro, Seeliger, Barbara, Gonzalez, Cristians, Mutter, Didier, Padoy, Nicolas

arXiv.org Artificial IntelligenceJul-14-2023

Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier efforts and the CholecTriplet challenge introduced in 2021 have put together techniques aimed at recognizing these triplets from surgical footage. Estimating also the spatial locations of the triplets would offer a more precise intraoperative context-aware decision support for computer-assisted intervention. This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection. It includes weakly-supervised bounding box localization of every visible surgical instrument (or tool), as the key actors, and the modeling of each tool-activity in the form of triplet. The paper describes a baseline method and 10 new deep learning algorithms presented at the challenge to solve the task. It also provides thorough methodological comparisons of the methods, an in-depth analysis of the obtained results across multiple metrics, visual and procedural challenges; their significance, and useful insights for future research directions and applications in surgery.

artificial intelligence, instrument, machine learning, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.media.2023.102888

2302.06294

Country:

Asia (1.00)
Europe > Germany (0.46)

Genre: Research Report > New Finding (0.47)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Surgery (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings

Bano, Sophia, Casella, Alessandro, Vasconcelos, Francisco, Qayyum, Abdul, Benzinou, Abdesslam, Mazher, Moona, Meriaudeau, Fabrice, Lena, Chiara, Cintorrino, Ilaria Anita, De Paolis, Gaia Romana, Biagioli, Jessica, Grechishnikova, Daria, Jiao, Jing, Bai, Bizhe, Qiao, Yanyan, Bhattarai, Binod, Gaire, Rebati Raman, Subedi, Ronast, Vazquez, Eduard, Płotka, Szymon, Lisowska, Aneta, Sitek, Arkadiusz, Attilakos, George, Wimalasundera, Ruwan, David, Anna L, Paladini, Dario, Deprest, Jan, De Momi, Elena, Mattos, Leonardo S, Moccia, Sara, Stoyanov, Danail

arXiv.org Artificial IntelligenceFeb-26-2023

Fetoscopy laser photocoagulation is a widely adopted procedure for treating Twin-to-Twin Transfusion Syndrome (TTTS). The procedure involves photocoagulation pathological anastomoses to regulate blood exchange among twins. The procedure is particularly challenging due to the limited field of view, poor manoeuvrability of the fetoscope, poor visibility, and variability in illumination. These challenges may lead to increased surgery time and incomplete ablation. Computer-assisted intervention (CAI) can provide surgeons with decision support and context awareness by identifying key structures in the scene and expanding the fetoscopic field of view through video mosaicking. Research in this domain has been hampered by the lack of high-quality data to design, develop and test CAI algorithms. Through the Fetoscopic Placental Vessel Segmentation and Registration (FetReg2021) challenge, which was organized as part of the MICCAI2021 Endoscopic Vision challenge, we released the first largescale multicentre TTTS dataset for the development of generalized and robust semantic segmentation and video mosaicking algorithms. For this challenge, we released a dataset of 2060 images, pixel-annotated for vessels, tool, fetus and background classes, from 18 in-vivo TTTS fetoscopy procedures and 18 short video clips. Seven teams participated in this challenge and their model performance was assessed on an unseen test dataset of 658 pixel-annotated images from 6 fetoscopic procedures and 6 short clips. The challenge provided an opportunity for creating generalized solutions for fetoscopic scene understanding and mosaicking. In this paper, we present the findings of the FetReg2021 challenge alongside reporting a detailed literature review for CAI in TTTS fetoscopy. Through this challenge, its analysis and the release of multi-centre fetoscopic data, we provide a benchmark for future research in this field.

artificial intelligence, machine learning, survey article, (21 more...)

arXiv.org Artificial Intelligence

2206.12512

Country:

Europe (1.00)
North America > Canada > Ontario > Toronto (0.14)
North America > United States > Massachusetts (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine > Surgery (1.00)
Health & Medicine > Health Care Providers & Services (0.93)
Health & Medicine > Diagnostic Medicine > Imaging (0.46)
Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.68)

Add feedback