AITopics | sthanikamsanthosh

Collaborating Authors

sthanikamsanthosh

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Policy Gradient(Reinforce)using Tensorflow2

#artificialintelligenceFeb-10-2022, 18:15:18 GMT

In this article, we will be discussing what is Policy gradients and how to implement policy gradients using tensorflow2. There are three main points in the policy gradient algorithm. By considering the above three principles, we can implement the policy gradient using TensorFlow. We are dividing our source code into two parts. Policy gradient takes the current state as input and outputs probabilities for all actions.

lunarlander environment, sthanikamsanthosh, tensorflow2, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.57)

Add feedback

Dueling Double Deep Q Learning with Tensorflow

#artificialintelligenceJan-31-2022, 13:25:15 GMT

In this article, we will be going through what is Dueling Double Deep Q Learning and how to implement it in Tenroflow. Dueling Double Deep Q learning is the combination of Dueling Deep Q Learning and Double Deep Q Learning. Let's try to understand what is Dueling Deep Q learning and Double Deep Q Learning. One of the drawbacks of the DQN algorithm is that it overestimates the true rewards; the Q-values think the agent is going to obtain a higher return than what it will obtain in reality. This overestimation is due to the presence of Max of Q value for the next state in the Q learning update equation.

agent, learning, neural network, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback