Goto

Collaborating Authors

 mahmood


Massive overhaul of England and Wales policing announced

BBC News

The home secretary has announced a blueprint for reforming what she called the broken policing model in England and Wales. Shabana Mahmood confirmed the shake-up will create a new National Police Service (NPS) to fight the most complex cross-border crime and could also see the number of local forces in England and Wales cut by around two-thirds. She told the House of Commons she also intends to make better use of technology - including the largest-ever rollout of facial recognition. This government's reforms will ensure we have the right policing in the right place, Mahmood said. I set out reforms that are long overdue and define a new model for policing in this country, with local policing that protects our communities and national policing that protects us all.


Police admit overstating Maccabi fan ban evidence

BBC News

West Midlands Police has admitted it overstated the evidence used to make the decision to ban Israeli fans from a match in Birmingham. Craig Guildford, its former chief constable, retired earlier this month after damning criticism of the ban on Maccabi Tel Aviv fans from the Europa League match against Aston Villa, last November. In newly released documents, the force also said we did not engage early enough with the local Jewish community, and indicated there was now a ban on AI use after its evidence included a match that did not take place. Furthermore, it said its operations would have lasted four days, involved multiple forces, and cost more than £5m, if 2,500 away fans had attended. The documents were released ahead of a public meeting on Tuesday, at which Police and Crime Commissioner for the West Midlands, Simon Foster, will discuss at his accountability and governance board, the decision to ban the Maccabi fans.


Mahmood has no confidence in police chief after Israeli fan ban

BBC News

Home Secretary Shabana Mahmood says she has lost confidence in West Midlands Police's chief constable after Israeli football fans were banned from a match against Aston Villa. Mahmood told MPs a damning review from the policing watchdog over the intelligence that led to Maccabi Tel Aviv fans being banned showed a failure of leadership. The force has apologised saying it did not deliberately distort evidence that was used by Birmingham's Safety Advisory Group for the 6 November game . Chief Constable Craig Guildford remains in post, but faces a meeting on 27 January to be questioned by Police and Crime Commissioner Simon Foster who has the authority to sack him. Mahmood told the Commons on Wednesday she intended to restore the power for home secretaries to dismiss chief constables who fail their communities.


UK to deport 60 delivery riders after illegal work crackdown

BBC News

The government says it is to deport 60 takeaway-delivery riders found to be working illegally in the UK. The Home Office says the group are among 171 riders arrested over seven days in November in a national enforcement blitz in villages, towns and cities across the country. It comes as Home Secretary Shabana Mahmood has been targeting people working unlawfully in the gig economy. Border Security Minister Alex Norris has also met representatives from food-delivery firms to encourage them to do more to tackle the issue - such as using facial recognition checks to prevent riders sharing their identities with people who do not have permission to take up work in the UK. Norris said November's action ought to send a clear message: if you are working illegally in this country, you will be arrested and removed.


UK's sweeping asylum law changes: How will they impact refugees?

Al Jazeera

UK's sweeping asylum law changes: How will they impact refugees? Shabana Mahmood, the United Kingdom's home secretary, has said the country's asylum system is "not working" and is placing "intense strain on communities" ahead of proposals for major government reforms that would end refugees' automatic right to settle permanently in the UK. Speaking to the BBC on Sunday, Mahmood said undocumented migration is "tearing the country apart". First, they would end the automatic path to settled status for refugees after five years. And second, they would remove state benefits from those who have the right to work and can support themselves.


Migrants will need A-level standard English to work in UK

BBC News

Some migrants coming to the UK will need to speak English to an A-level standard under tougher new rules set to be introduced by the government. Applicants will be tested in person on their speaking, listening, reading and writing at Home Office-approved providers, with their results checked as part of the visa process. The changes, which come into force from 8 January 2026, form part of wider plans to cut levels of immigration to the UK outlined in a white paper in May. Home Secretary Shabana Mahmood said: If you come to this country, you must learn our language and play your part. Those applying for skilled worker, scale-up and high potential individual (HPI) visas will be required to reach B2 level - a step up from the current B1 standard which is equivalent to GCSE.


Busting the Paper Ballot: Voting Meets Adversarial Machine Learning

Mahmood, Kaleel, Manicke, Caleb, Rathbun, Ethan, Verma, Aayushi, Ahmad, Sohaib, Stamatakis, Nicholas, Michel, Laurent, Fuller, Benjamin

arXiv.org Artificial Intelligence

We show the security risk associated with using machine learning classifiers in United States election tabulators. The central classification task in election tabulation is deciding whether a mark does or does not appear on a bubble associated to an alternative in a contest on the ballot. Barretto et al. (E-Vote-ID 2021) reported that convolutional neural networks are a viable option in this field, as they outperform simple feature-based classifiers. Our contributions to election security can be divided into four parts. To demonstrate and analyze the hypothetical vulnerability of machine learning models on election tabulators, we first introduce four new ballot datasets. Second, we train and test a variety of different models on our new datasets. These models include support vector machines, convolutional neural networks (a basic CNN, VGG and ResNet), and vision transformers (Twins and CaiT). Third, using our new datasets and trained models, we demonstrate that traditional white box attacks are ineffective in the voting domain due to gradient masking. Our analyses further reveal that gradient masking is a product of numerical instability. We use a modified difference of logits ratio loss to overcome this issue (Croce and Hein, ICML 2020). Fourth, in the physical world, we conduct attacks with the adversarial examples generated using our new methods. In traditional adversarial machine learning, a high (50% or greater) attack success rate is ideal. However, for certain elections, even a 5% attack success rate can flip the outcome of a race. We show such an impact is possible in the physical domain. We thoroughly discuss attack realism, and the challenges and practicality associated with printing and scanning ballot adversarial examples.


Synchronous vs Asynchronous Reinforcement Learning in a Real World Robot

Parsaee, Ali, Shahriar, Fahim, He, Chuxin, Tan, Ruiqing

arXiv.org Artificial Intelligence

In recent times, reinforcement learning (RL) with physical robots has attracted the attention of a wide range of researchers. However, state-of-the-art RL algorithms do not consider that physical environments do not wait for the RL agent to make decisions or updates. RL agents learn by periodically conducting computationally expensive gradient updates. When decision-making and gradient update tasks are carried out sequentially by the RL agent in a physical robot, it significantly increases the agent's response time. In a rapidly changing environment, this increased response time may be detrimental to the performance of the learning agent. Asynchronous RL methods, which separate the computation of decision-making and gradient updates, are a potential solution to this problem. However, only a few comparisons between asynchronous and synchronous RL have been made with physical robots. For this reason, the exact performance benefits of using asynchronous RL methods over synchronous RL methods are still unclear. In this study, we provide a performance comparison between asynchronous and synchronous RL using a physical robotic arm called Franka Emika Panda. Our experiments show that the agents learn faster and attain significantly more returns using asynchronous RL. Our experiments also demonstrate that the learning agent with a faster response time performs better than the agent with a slower response time, even if the agent with a slower response time performs a higher number of gradient updates.


Streaming Deep Reinforcement Learning Finally Works

Elsayed, Mohamed, Vasan, Gautham, Mahmood, A. Rupam

arXiv.org Artificial Intelligence

Natural intelligence processes experience as a continuous stream, sensing, acting, and learning moment-by-moment in real time. Streaming learning, the modus operandi of classic reinforcement learning (RL) algorithms like Q-learning and TD, mimics natural learning by using the most recent sample without storing it. This approach is also ideal for resource-constrained, communication-limited, and privacy-sensitive applications. However, in deep RL, learners almost always use batch updates and replay buffers, making them computationally expensive and incompatible with streaming learning. Although the prevalence of batch deep RL is often attributed to its sample efficiency, a more critical reason for the absence of streaming deep RL is its frequent instability and failure to learn, which we refer to as stream barrier. This paper introduces the stream-x algorithms, the first class of deep RL algorithms to overcome stream barrier for both prediction and control and match sample efficiency of batch RL. Through experiments in Mujoco Gym, DM Control Suite, and Atari Games, we demonstrate stream barrier in existing algorithms and successful stable learning with our stream-x algorithms: stream Q, stream AC, and stream TD, achieving the best model-free performance in DM Control Dog environments. A set of common techniques underlies the stream-x algorithms, enabling their success with a single set of hyperparameters and allowing for easy extension to other algorithms, thereby reviving streaming RL.


Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning

Vasan, Gautham, Wang, Yan, Shahriar, Fahim, Bergstra, James, Jagersand, Martin, Mahmood, A. Rupam

arXiv.org Artificial Intelligence

Many real-world robot learning problems, such as pick-and-place or arriving at a destination, can be seen as a problem of reaching a goal state as soon as possible. These problems, when formulated as episodic reinforcement learning tasks, can easily be specified to align well with our intended goal: 1 reward every time step with termination upon reaching the goal state, called minimum-time tasks. Despite this simplicity, such formulations are often overlooked in favor of dense rewards due to their perceived difficulty and lack of informativeness. Our studies contrast the two reward paradigms, revealing that the minimum-time task specification not only facilitates learning higher-quality policies but can also surpass dense-reward-based policies on their own performance metrics. Crucially, we also identify the goal-hit rate of the initial policy as a robust early indicator for learning success in such sparse feedback settings. Finally, using four distinct real-robotic platforms, we show that it is possible to learn pixel-based policies from scratch within two to three hours using constant negative rewards. Our video demo can be found here.