AITopics | blackjack

d5ab8dc7ef67ca92e41d730982c5c602-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 14:14:12 GMT

agent, explanation, forward simulation, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Sharpa's ping-pong playing, blackjack dealing humanoid is working overtime at CES 2026

EngadgetJan-9-2026, 15:00:00 GMT

Sharpa's ping-pong playing, blackjack dealing humanoid is working overtime at CES 2026 The company's super dexterous robotic hand, SharpaWave, allows it to pull individual playing cards from a deck. There were no idle hands at Sharpa's CES booth. The company's humanoid may have been the busiest bot at show, autonomously playing ping-pong, dealing blackjack games and taking selfies with passersby. The hand has 22 active degrees of freedom, according to the company, allowing for precise and intricate finger movements. It mirrored my gestures as I wiggled my hand in front of its camera, getting everything mostly right, which was honestly pretty cool.

humanoid, sharpa, term and privacy policy, (9 more...)

Engadget

Technology: Information Technology > Artificial Intelligence > Robots > Manipulation (0.38)

Add feedback

PARL: Prompt-based Agents for Reinforcement Learning

Resendiz, Yarik Menchaca, Klinger, Roman

arXiv.org Artificial IntelligenceOct-27-2025

Large language models (LLMs) have demonstrated high performance on tasks expressed in natural language, particularly in zero- or few-shot settings. These are typically framed as supervised (e.g., classification) or unsupervised (e.g., clustering) problems. However, limited work evaluates LLMs as agents in reinforcement learning (RL) tasks (e.g., playing games), where learning occurs through interaction with an environment and a reward system. While prior work focused on representing tasks that rely on a language representation, we study structured, non-linguistic reasoning - such as interpreting positions in a grid world. We therefore introduce PARL (Prompt-based Agent for Reinforcement Learning), a method that uses LLMs as RL agents through prompting, without any fine-tuning. PARL encodes actions, states, and rewards in the prompt, enabling the model to learn through trial-and-error interaction. We evaluate PARL on three standard RL tasks that do not entirely rely on natural language. We show that it can match or outperform traditional RL agents in simple environments by leveraging pretrained knowledge. However, we identify performance limitations in tasks that require complex mathematical operations or decoding states and actions.

large language model, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2510.21306

Country:

Europe (1.00)
Asia (0.68)
North America > United States > Minnesota (0.28)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

d5ab8dc7ef67ca92e41d730982c5c602-AuthorFeedback.pdf

Neural Information Processing SystemsAug-16-2025, 15:30:35 GMT

agent, explanation, forward simulation, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

The House Always Wins: A Framework for Evaluating Strategic Deception in LLMs

Chopra, Tanush, Li, Michael

arXiv.org Artificial IntelligenceJul-1-2024

We propose a framework for evaluating strategic deception in large language models (LLMs). In this framework, an LLM acts as a game master in two scenarios: one with random game mechanics and another where it can choose between random or deliberate actions. As an example, we use blackjack because the action space nor strategies involve deception. We benchmark Llama3-70B, GPT-4-Turbo, and Mixtral in blackjack, comparing outcomes against expected distributions in fair play to determine if LLMs develop strategies favoring the "house." Our findings reveal that the LLMs exhibit significant deviations from fair play when given implicit randomness instructions, suggesting a tendency towards strategic manipulation in ambiguous scenarios. However, when presented with an explicit choice, the LLMs largely adhere to fair play, indicating that the framing of instructions plays a crucial role in eliciting or mitigating potentially deceptive behaviors in AI systems.

deception, scenario, strategic deception, (16 more...)

arXiv.org Artificial Intelligence

2407.00948

Genre: Research Report > New Finding (0.67)

Industry: Leisure & Entertainment > Games (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Add feedback

Variations on the Reinforcement Learning performance of Blackjack

Buramdoyal, Avish, Gebbie, Tim

arXiv.org Artificial IntelligenceAug-9-2023

Blackjack or "21" is a popular card-based game of chance and skill. The objective of the game is to win by obtaining a hand total higher than the dealer's without exceeding 21. The ideal blackjack strategy will maximize financial return in the long run while avoiding gambler's ruin. The stochastic environment and inherent reward structure of blackjack presents an appealing problem to better understand reinforcement learning agents in the presence of environment variations. Here we consider a q-learning solution for optimal play and investigate the rate of learning convergence of the algorithm as a function of deck size. A blackjack simulator allowing for universal blackjack rules is also implemented to demonstrate the extent to which a card counter perfectly using the basic strategy and hi-lo system can bring the house to bankruptcy and how environment variations impact this outcome. The novelty of our work is to place this conceptual understanding of the impact of deck size in the context of learning agent convergence.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2308.07329

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Africa > South Africa > Western Cape > Cape Town (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Python Reinforcement Learning using OpenAI Gymnasium – Full Course

#artificialintelligenceMar-22-2023, 05:10:06 GMT

Learn the basics of reinforcement learning and how to implement it using Gymnasium (previously called OpenAI Gym). Gymnasium is an open source Python library originally created by OpenAI that provides a collection of pre-built environments for reinforcement learning agents. It provides a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. Reinforcement learning is an area of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward.

blackjack, python reinforcement learning, reinforcement learning, (4 more...)

#artificialintelligence

Genre: Instructional Material (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.95)

Add feedback

Win at Blackjack with Reinforcement Learning

#artificialintelligenceFeb-2-2023, 05:20:24 GMT

As a popular casino card game, many have studied Blackjack closely in order to devise strategies for improving their likelihood of winning. In this project, we will use Reinforcement Learning to find the best playing strategy for Blackjack. We will use Monte Carlo Reinforcement learning algorithms to do it; you will see how Reinforcement Learning can determine the optimum Blackjack strategy in just a few minutes. You will quickly grasp important concepts of Reinforcement learning and apply open AI's gym, the go-to framework for Reinforcement learning. To see all of the detailed explanations for the mentioned concepts and analyze/experiment with the code for this blog. You can also take a lot of FREE courses and projects about data science or any other technology topics from Cognitive Class.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games (0.89)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Blackjack: A game model for applying AI to cybersecurity

#artificialintelligenceApr-1-2021, 23:15:04 GMT

Cyber-attacks continue to threaten organizations large and small. The impacts of a data breach or ransomware attack may have significant and material impacts on both customers and shareholders. To help combat cyber threats, some organizations have started exploring how big data and artificial intelligence (AI) may help to reduce cybersecurity risk. Machine learning algorithms are now common in cybersecurity. We find machine learning offered in more commercial products, from those that are fully integrated into products and require no knowledge of machine learning to those that require rolling up your sleeves to put together the algorithms and perform statistical analysis. Machine learning for cybersecurity has most frequently been applied to detecting patterns that represent attacks. This includes algorithms that evaluate audit log data, that spot anomalies for network intrusion detection systems, and that identify and block malware on computer systems. In some applications, machine learning is used to train models of normal activity on networks in hope of later detecting anomalous events that may represent a cyber-attack.

algorithm, cybersecurity, security manager, (17 more...)

#artificialintelligence

Country:

North America > United States > Maryland > Montgomery County > Gaithersburg (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Blackjack: A game model for applying AI to cybersecurity

#artificialintelligenceApr-1-2021, 18:01:43 GMT

Cyber-attacks continue to threaten organizations large and small. The impacts of a data breach or ransomware attack may have significant and material impacts on both customers and shareholders. To help combat cyber threats, some organizations have started exploring how big data and artificial intelligence (AI) may help to reduce cybersecurity risk. Machine learning algorithms are now common in cybersecurity. We find machine learning offered in more commercial products, from those that are fully integrated into products and require no knowledge of machine learning to those that require rolling up your sleeves to put together the algorithms and perform statistical analysis. Machine learning for cybersecurity has most frequently been applied to detecting patterns that represent attacks. This includes algorithms that evaluate audit log data, that spot anomalies for network intrusion detection systems, and that identify and block malware on computer systems. In some applications, machine learning is used to train models of normal activity on networks in hope of later detecting anomalous events that may represent a cyber-attack.

algorithm, cybersecurity, security manager, (17 more...)

#artificialintelligence

Country: