whitelist
Testing Language Model Agents Safely in the Wild
Naihin, Silen, Atkinson, David, Green, Marc, Hamadi, Merwane, Swift, Craig, Schonholtz, Douglas, Kalai, Adam Tauman, Bau, David
A prerequisite for safe autonomy-in-the-wild is safe testing-in-the-wild. Yet real-world autonomous tests face several unique safety challenges, both due to the possibility of causing harm during a test, as well as the risk of encountering new unsafe agent behavior through interactions with real-world and potentially malicious actors. We propose a framework for conducting safe autonomous agent tests on the open internet: agent actions are audited by a context-sensitive monitor that enforces a stringent safety boundary to stop an unsafe test, with suspect behavior ranked and logged to be examined by humans. We design a basic safety monitor (AgentMonitor) that is flexible enough to monitor existing LLM agents, and, using an adversarial simulated agent, we measure its ability to identify and stop unsafe situations. Then we apply the AgentMonitor on a battery of real-world tests of AutoGPT, and we identify several limitations and challenges that will face the creation of safe in-the-wild tests as autonomous agents grow more capable.
- North America > Canada > Ontario > Toronto (0.04)
- Asia > Middle East > Jordan (0.04)
A Hybrid Approach for Smart Alert Generation
Zhao, Yao, Zhang, Sophine, Yao, Zhiyuan
Anomaly detection is an important task in network management. However, deploying intelligent alert systems in real-world large-scale networking systems is challenging when we take into account (i) scalability, (ii) data heterogeneity, and (iii) generalizability and maintainability. In this paper, we propose a hybrid model for an alert system that combines statistical models with a whitelist mechanism to tackle these challenges and reduce false positive alerts. The statistical models take advantage of a large database to detect anomalies in time-series data, while the whitelist filters out persistently alerted nodes to further reduce false positives. Our model is validated using qualitative data from customer support cases. Future work includes more feature engineering and input data, as well as including human feedback in the model development process.
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > California > San Francisco County > San Francisco (0.04)
- North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
- (2 more...)
NFTs are 'Stayin' Alive' as new minting trends, AI and music-based projects thrive
In 2020, newly launched NFTs projects experienced costly gas wars that are gas priority auctions where buyers battle to secure their spot on the next block, potentially losing Ether (ETH) to failed transactions. In 2021, digital scarcity and utility drove the NFT hype in the constantly surging markets, and toward the end of the year, how much attention any collection received seemed to be at the mercy of influencers' opinions. It's frustrating especially when tons of good projects with actual value and hardworking teams that don't just disappear on their community are still so undervalued. Influencer culture invading the NFT space has had a horrible impact for the community. Transitions have slowly emerged and driven new entrants in with new sets of values that not only impact how projects are minted, but what is minted. In 2022, it seems the NFT ecosystem will emphasize "strong communities," and exclusive collector utility.
An N-gram based approach to auto-extracting topics from research articles
Zhu, Linkai, Huang, Maoyi, Chen, Maomao, Wang, Wennan
A lot of manual work goes into identifying a topic for an article. With a large volume of articles, the manual process can be exhausting. Our approach aims to address this issue by automatically extracting topics from the text of large Numbers of articles. This approach takes into account the efficiency of the process. Based on existing N-gram analysis, our research examines how often certain words appear in documents in order to support automatic topic extraction. In order to improve efficiency, we apply custom filtering standards to our research. Additionally, delete as many noncritical or irrelevant phrases as possible. In this way, we can ensure we are selecting unique keyphrases for each article, which capture its core idea. For our research, we chose to center on the autonomous vehicle domain, since the research is relevant to our daily lives. We have to convert the PDF versions of most of the research papers into editable types of files such as TXT. This is because most of the research papers are only in PDF format. To test our proposed idea of automating, numerous articles on robotics have been selected. Next, we evaluate our approach by comparing the result with that obtained manually.
- Asia > Macao (0.14)
- North America > Canada > Ontario > National Capital Region > Ottawa (0.14)
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.05)
- (3 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.50)
- Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.47)
AI-powered tool aims to help reduce bias and racially charged language on websites
Website accessibility tech provider UserWay has released an AI-powered tool designed to help organizations ensure their websites are free from discriminatory, biased, and racially charged language. The tool, Content Moderator, flags content for review, and nothing is deleted or removed without approval from site administrators, according to UserWay. UserWay's customers are using its AI-powered accessibility widget, an advanced AI-based compliance-as-a-service (CaaS) technology that ensures brands provide an accessible digital experience that meets strict governmental and ADA regulations, the company said. "Focusing on digital racism and bias is long past due, and our team is eager to contribute to the conversation positively," UserWay founder and CEO Allon Mason said in a statement. In June, Google announced that it would be reevaluating what it considers acceptable language, Mason noted. So far, Google has changed terms including "blacklist" to "blocked list," "whitelist" to "allowed list," and "master-slave" to "primary/secondary," among others, he said.
- North America > United States (0.05)
- Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.05)
Method and Dataset Mining in Scientific Papers
Yao, Rujing, Hou, Linlin, Ye, Yingchun, Wu, Ou, Zhang, Ji, Wu, Jian
Literature analysis facilitates researchers better understanding the development of science and technology. The conventional literature analysis focuses on the topics, authors, abstracts, keywords, references, etc., and rarely pays attention to the content of papers. In the field of machine learning, the involved methods (M) and datasets (D) are key information in papers. The extraction and mining of M and D are useful for discipline analysis and algorithm recommendation. In this paper, we propose a novel entity recognition model, called MDER, and constructe datasets from the papers of the PAKDD conferences (2009-2019). Some preliminary experiments are conducted to assess the extraction performance and the mining results are visualized.
- Asia > China > Tianjin Province > Tianjin (0.05)
- North America > United States (0.04)
- Asia > China > Zhejiang Province > Hangzhou (0.04)
Mindsync.ai - ML/DS competitions to help solve business tasks and a marketplace for AI
First and foremost if you have not signed up for our whitelist yet, click on the "Join Whitelist" button on the top of the page. After that, you should fill out the "Join the Whitelist" form. Once you submit the form, you will receive a confirmation e-mail. Please follow the instructions in the e-mail. Mindsync tokens will be available for purchase with Ether (ETH) from your ETH wallet.
Why security teams should treat machine learning like a junior employee
Machine learning is critical to the future of cybersecurity and helping security teams overcome the challenges of modern cybersecurity attacks. Indeed, its ability to'outthink' humans can boost return on investment (ROI), drastically improve productivity and minimise resource expenditure. However, machine learning is also not just a'set and forget' solution. In fact, companies need to treat machine like an intern on their first day. Security teams should not assume a machine learning programme can hit the ground running – there needs to be an onboarding process where you check in on the models frequently and spend time getting them started in the right direction.
- Information Technology > Security & Privacy (0.94)
- Commercial Services & Supplies > Security & Alarm Services (0.87)
- Government > Military > Cyberwarfare (0.76)
Patented Technology Behind Thought Network Changes Data Processing As We Know It
For thousands of years, humans have recognized patterns, gathered and analyzed data. Identifying patterns and using this information has been key to our evolution, as well as playing an important role in our pastime activities. Whether it is making a difference between animals that want to kill us and those who don't, categorizing plants based on their edibility, seeing patterns in the stars, or creating algorithms that know exactly which cat video we want to see next on Youtube. Although data and pattern recognition have been around forever, the way we use, store and spread this information has changed drastically. While at the beginning we had to rely on word-of-mouth and cave drawings to spread knowledge, nowadays we can store and spread trillions of gigabytes of information without much effort.