AITopics | towardsdatascience

Collaborating Authors

towardsdatascience

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Comparative Study on Code Generation with Transformers

Das, Namrata, Panta, Rakshya, Karki, Neelam, Manandhar, Ruchi, Kshatri, Dinesh Baniya

arXiv.org Artificial IntelligenceDec-7-2024

In an era of widespread influence of Natural Language Processing (NLP), there have been multiple research efforts to supplant traditional manual coding techniques with automated systems capable of generating solutions autonomously. With rapid research for code generation and a sole focus on large language models, there emerges a need to compare and evaluate the performance of transformer architectures based on several complexities of the model. This paper introduces the concept of a "A Comparative Study on Code Generation with Transformers," a model based on Transformer architecture, and NLP methodologies to automatically generate C++ source code for different varieties of problems. Here, a comparative study is performed to evaluate the robustness of transformer-based models on the basis of their architecture complexities and their capability to handle diverse problem sets, from basic arithmetic to complex computations.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.05749

Country: Asia > Nepal (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Categorising Products in an Online Marketplace: An Ensemble Approach

Drumm, Kieron

arXiv.org Artificial IntelligenceApr-26-2023

In recent years, product categorisation has been a common issue for E-commerce companies who have utilised machine learning to categorise their products automatically. In this study, we propose an ensemble approach, using a combination of different models to separately predict each product's category, subcategory, and colour before ultimately combining the resultant predictions for each product. With the aforementioned approach, we show that an average F1-score of 0.82 can be achieved using a combination of XGBoost and k-nearest neighbours to predict said features.

artificial intelligence, category, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2304.13852

Country:

North America > United States > California > San Diego County > San Diego (0.04)
Europe > Spain (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)

Genre: Research Report > New Finding (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.40)

Add feedback

Batch Norm Explained Visually -- How it works, and why neural networks need it

#artificialintelligenceApr-11-2023, 09:50:58 GMT

Batch Norm is an essential part of the toolkit of the modern deep learning practitioner. Soon after it was introduced in the Batch Normalization paper, it was recognized as being transformational in creating deeper neural networks that could be trained faster. Batch Norm is a neural network layer that is now commonly used in many architectures. It often gets added as part of a Linear or Convolutional block and helps to stabilize the network during training. In this article, we will explore what Batch Norm is, why we need it and how it works.

batch norm, mean and variance, variance, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

How to Visualize Neural Network Architectures in Python

#artificialintelligenceApr-2-2023, 16:30:11 GMT

Often while working with Artificial Neural Networks or other variations like Convolution Neural Networks or Recurrent Neural Networks, we want to visualize and create a diagrammatic representation of our compiled model. There are a few packages readily available in python that can create a visual representation of our Neural Network Models. The first three packages can be used even before a model is trained (the model needs to be defined and compiled only); however, Tensor Boards requires the user to train the model on accurate data before the architecture can be visualized. We don't need to install the "Tensor Board" and "Keras Model Plot" separately. This will come with the initial installation of Tensorflow & Keras.

architecture, neural network, tensorboard, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

"ML-Everything"? Balancing Quantity and Quality in Machine Learning Methods for Science

#artificialintelligenceMar-14-2023, 06:25:19 GMT

Recent research in machine learning (ML) has led to significant progress in various fields, including scientific applications. However, there are limitations that need to be addressed to ensure the validity of new models, the quality of testing and validation procedures, and the actual applicability of the developed models to real-world problems. These limitations include unfair, subjective, and unbalanced evaluations, not necessarily intentional yet there, the use of datasets that don't properly reflect real-world use cases (for example that are "too easy"), incorrect ways to split datasets into training, testing, and validation subsets, etc. In this article I will discuss all these points, using examples from the domain of biology which is being revolutionized by ML methodologies. Along the way I will also briefly touch on the interpretability of ML models, which is today very limited but very important because it could help clarify many of the aspects discussed in the first part of the article regarding the limitations that need to be addressed.

application, prediction, structure prediction, (15 more...)

#artificialintelligence

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

How to Write a Scientific Paper from a Data Science Project

#artificialintelligenceFeb-9-2023, 14:25:35 GMT

All the sections of the Introduction should be balanced, thus you should reserve the same number of paragraphs to all of them, more or less. Up to now, you have written a draft of the abstract and the Introduction and Related Work Sections. You are ready to give a structure to your paper. I strongly encourage you to take again the Introduction and split it into paragraphs. Then, you could add one section for each paragraph. Remind that while writing the paper, you can add, delete or modify any section you have already written.

data science project, paragraph, scientific paper, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

What I Learned from the Best and the Worst Machine Learning Team Leads

#artificialintelligenceFeb-6-2023, 11:10:25 GMT

While some of us were lucky enough to work only with great team leads, most of us have had both great and terrible experiences. And although terrible leadership can make the team members' life horrible, bitter experiences foster great team leads from the team members -- helping them learn what behaviours to avoid. Technical management of software engineering projects is very established, with multiple tools and techniques at the disposal of a team lead, such as Agile. Meanwhile, machine learning projects, where accurately predicting timelines, outcomes of the tasks, and task feasibility are challenging, are hard to fit into these paradigms. Navigating projects with high uncertainty at every step requires skills and knowledge that machine learning team leads need to gain through experience.

artificial intelligence, machine learning, team member, (17 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.96)

Add feedback

The 6 Benefits of Interpretable Machine Learning

#artificialintelligenceFeb-1-2023, 08:30:53 GMT

We seem to be in the golden era of AI. Every week there is a new service that can do anything from creating short stories to original images. These innovations are powered by machine learning. We use powerful computers and vast amounts of data to train these models. The problem is, this process leaves us with a poor understanding of how they actually work.

artificial intelligence, machine learning, prediction, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Convolutional Neural Network for Breast Cancer Classification

#artificialintelligenceJan-31-2023, 10:05:59 GMT

Click here to read the full story with my Friend Link! Breast cancer is the second most common cancer in women and men worldwide. In 2012, it represented about 12 percent of all new cancer cases and 25 percent of all cancers in women. Breast cancer starts when cells in the breast begin to grow out of control. These cells usually form a tumor that can often be seen on an x-ray or felt as a lump. The tumor is malignant (cancer) if the cells can grow into (invade) surrounding tissues or spread (metastasize) to distant areas of the body.

artificial intelligence, batch size, machine learning, (17 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.83)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.53)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

Is a Small Dataset Risky?. Some reflections and tests on the use…

#artificialintelligenceJan-23-2023, 09:15:33 GMT

Recently I have written an article about the risks of using the train_test_split() function provided by the scikit-learn Python package. That article has raised a lot of comments, some positives, and others with some concerns. The main concern in the article was that I used a small dataset to demonstrate my theory, which was: be careful when you use the train_test_split() function, because the different seeds may produce very different models. The main concern was that the train_test_split() function does not behave strangely; the problem is that I used a small dataset to demonstrate my thesis. In this article, I try to discover which is the performance of a Linear Regression model by varying the dataset size.

artificial intelligence, dataset, machine learning, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.59)

Add feedback