Goto

Collaborating Authors

 Overview


The Road Ahead for Speech Recognition Technology

#artificialintelligence

Speech recognition technology has had its place in the enterprise tech stack for years, but the onset of COVID-19 has proven its worth even further. Our recent annual Trends and Predictions for Voice Technology in 2021 report found that 2020 saw a marked increase in voice technology adoption among enterprises, with 68% of respondents reporting their company has a voice technology strategy, an increase of 18% since last year. This is for a number of reasons – it can increase efficiencies across organizations, give them better access to data from conversations, even abade our contact-free wishes during the pandemic. Given that the number of organizations adopting speech technology is set to increase as its capabilities grow, providers need to focus their attention on the barriers to adoption and ensure that user concerns are addressed. Only then will the technology's true value be recognized.


Conjectures, Tests and Proofs: An Overview of Theory Exploration

arXiv.org Artificial Intelligence

A key component of mathematical reasoning is the ability to formulate interesting conjectures about a problem domain at hand. In this paper, we give a brief overview of a theory exploration system called QuickSpec, which is able to automatically discover interesting conjectures about a given set of functions. QuickSpec works by interleaving term generation with random testing to form candidate conjectures. This is made tractable by starting from small sizes and ensuring that only terms that are irreducible with respect to already discovered conjectures are considered. QuickSpec has been successfully applied to generate lemmas for automated inductive theorem proving as well as to generate specifications of functional programs. We give an overview of typical use-cases of QuickSpec, as well as demonstrating how to easily connect it to a theorem prover of the user's choice.


Top 100 Artificial Intelligence Startups to Lookout for in 2021

#artificialintelligence

Sooner or later, the concept of digitization will completely take over all repetitive tasks. Today, with the help of big data, advanced technologies like automation, artificial intelligence, IoT, and machine learning are leveraging unimaginable amounts and types of information to work from. It is streamlining tedious, repetitive, and difficult tasks, which tend to slow down production and also increases the cost of operation. Owing to the evolution of technology, artificial intelligence startups are mushrooming like never before. The companies are driving the world into a new phase of digitization with a mixture of disruptive statistical methods, computational intelligence, soft computing, and traditional symbolic AI. Artificial intelligence is the combination of two amazing concepts namely science and engineering. With the infusion of disruptive trends and human intelligence, intelligent machines and intelligent computing programs are emerging. Slowly, the flare of innovations moved away from IT and entered into diverse industries including healthcare, education, finance, marketing, business, telecommunication, etc. Organizations realized that by digitizing repetitive tasks, an enterprise can cut the cost of paperwork and labor which further eliminates human error, thus boosting efficiency. Automating processes involve employing artificial intelligence solutions that can support digitization and deliver data-driven insights. Artificial intelligence startups emerge as a ready-made solution provider that supports every company's individual needs. AI startups in 2021 use big data to sophisticated AI models and leverage new solutions that could better serve customers. Analytics Insight has listed the top 100 artificial intelligence startups that are driving the next-generation development in technology. It democratizes the way investments are done by bringing sophisticated elite trading technology to laymen. Accrad is a health tech company that assists radiologists to reduce their workload with the precision of artificial intelligence. Radiologists work under different circumstances and deadlines and might find diagnosis through x-rays a bit difficult. Therefore, Accrad has come up with a futuristic solution to help with accurate and fast image diagnosis. The company has made x-ray processing more convincing and simpler. Its signature product CheXRad, a deep learning algorithm that identifies locations in the chest radiograph has the capability to predict 15 different diseases including Covid-19. Affable.ai is a data-driven influencer marketing platform where customers can find relevant and authentic influencers and manage marketing operations. By using cutting-edge computer vision algorithms on social media posts, the company delivers actionable insights about micro-influencers and their audience. Similar to how Google has sophisticated its search and promote relative ads to users, Affable.ai has also built one-click marketing at a shorter scale.


Representation Learning for Efficient and Effective Similarity Search and Recommendation

arXiv.org Artificial Intelligence

How data is represented and operationalized is critical for building computational solutions that are both effective and efficient. A common approach is to represent data objects as binary vectors, denoted \textit{hash codes}, which require little storage and enable efficient similarity search through direct indexing into a hash table or through similarity computations in an appropriate space. Due to the limited expressibility of hash codes, compared to real-valued representations, a core open challenge is how to generate hash codes that well capture semantic content or latent properties using a small number of bits, while ensuring that the hash codes are distributed in a way that does not reduce their search efficiency. State of the art methods use representation learning for generating such hash codes, focusing on neural autoencoder architectures where semantics are encoded into the hash codes by learning to reconstruct the original inputs of the hash codes. This thesis addresses the above challenge and makes a number of contributions to representation learning that (i) improve effectiveness of hash codes through more expressive representations and a more effective similarity measure than the current state of the art, namely the Hamming distance, and (ii) improve efficiency of hash codes by learning representations that are especially suited to the choice of search method. The contributions are empirically validated on several tasks related to similarity search and recommendation.


Will bots take over the supply chain? Revisiting Agent-based supply chain automation

arXiv.org Artificial Intelligence

Agent-based systems have the capability to fuse information from many distributed sources and create better plans faster. This feature makes agent-based systems naturally suitable to address the challenges in Supply Chain Management (SCM). Although agent-based supply chains systems have been proposed since early 2000; industrial uptake of them has been lagging. The reasons quoted include the immaturity of the technology, a lack of interoperability with supply chain information systems, and a lack of trust in Artificial Intelligence (AI). In this paper, we revisit the agent-based supply chain and review the state of the art. We find that agent-based technology has matured, and other supporting technologies that are penetrating supply chains; are filling in gaps, leaving the concept applicable to a wider range of functions. For example, the ubiquity of IoT technology helps agents "sense" the state of affairs in a supply chain and opens up new possibilities for automation. Digital ledgers help securely transfer data between third parties, making agent-based information sharing possible, without the need to integrate Enterprise Resource Planning (ERP) systems. Learning functionality in agents enables agents to move beyond automation and towards autonomy. We note this convergence effect through conceptualising an agent-based supply chain framework, reviewing its components, and highlighting research challenges that need to be addressed in moving forward.


The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing Studies

arXiv.org Artificial Intelligence

As algorithmic risk assessment instruments (RAIs) are increasingly adopted to assist decision makers, their predictive performance and potential to promote inequity have come under scrutiny. However, while most studies examine these tools in isolation, researchers have come to recognize that assessing their impact requires understanding the behavior of their human interactants. In this paper, building off of several recent crowdsourcing works focused on criminal justice, we conduct a vignette study in which laypersons are tasked with predicting future re-arrests. Our key findings are as follows: (1) Participants often predict that an offender will be rearrested even when they deem the likelihood of re-arrest to be well below 50%; (2) Participants do not anchor on the RAI's predictions; (3) The time spent on the survey varies widely across participants and most cases are assessed in less than 10 seconds; (4) Judicial decisions, unlike participants' predictions, depend in part on factors that are orthogonal to the likelihood of re-arrest. These results highlight the influence of several crucial but often overlooked design decisions and concerns around generalizability when constructing crowdsourcing studies to analyze the impacts of RAIs.


An Exploratory Study on Utilising the Web of Linked Data for Product Data Mining

arXiv.org Artificial Intelligence

The Linked Open Data practice has led to a significant growth of structured data on the Web in the last decade. Such structured data describe real-world entities in a machine-readable way, and have created an unprecedented opportunity for research in the field of Natural Language Processing. However, there is a lack of studies on how such data can be used, for what kind of tasks, and to what extent they can be useful for these tasks. This work focuses on the e-commerce domain to explore methods of utilising such structured data to create language resources that may be used for product classification and linking. We process billions of structured data points in the form of RDF n-quads, to create multi-million words of product-related corpora that are later used in three different ways for creating of language resources: training word embedding models, continued pre-training of BERT-like language models, and training Machine Translation models that are used as a proxy to generate product-related keywords. Our evaluation on an extensive set of benchmarks shows word embeddings to be the most reliable and consistent method to improve the accuracy on both tasks (with up to 6.9 percentage points in macro-average F1 on some datasets). The other two methods however, are not as useful. Our analysis shows that this could be due to a number of reasons, including the biased domain representation in the structured data and lack of vocabulary coverage. We share our datasets and discuss how our lessons learned could be taken forward to inform future research in this direction.


Hot papers on arXiv from the past month: August 2021

AIHub

Reproduced under a CC BY 4.0 license. Here are the most tweeted papers that were uploaded onto arXiv during August 2021. Results are powered by Arxiv Sanity Preserver. How to avoid machine learning pitfalls: a guide for academic researchers Michael A. Lones Submitted to arXiv on: 5 August 2021 Abstract: This document gives a concise outline of some of the common mistakes that occur when using machine learning techniques, and what can be done to avoid them. It is intended primarily as a guide for research students, and focuses on issues that are of particular concern within academic research, such as the need to do rigorous comparisons and reach valid conclusions.


A Visual Guide to Low-Resource NLP

#artificialintelligence

Deep neural networks are becoming omnipresent in natural language applications (NLP). However, they require large amounts of labeled training data, which is often only available for English. This is a big challenge for many languages and domains where labeled data is limited. In recent years, a variety of methods have been proposed to tackle this situation. This article gives an overview of these approaches that help you train NLP models in resource-lean scenarios.


Study on artificial intelligence: The state of the art and future prospects

#artificialintelligence

In the world, the technological and industrial revolution is accelerating by the widespread application of new generation information and communication technologies, such as AI, IoT (the Internet of Things), and blockchain technology. Artificial intelligence has attracted much attention from government, industry, and academia. In this study, popular articles published in recent years that relate to artificial intelligence are selected and explored. This study aims to provide a review of artificial intelligence based on industry information integration. It presents an overview of the scope of artificial intelligence using background, drivers, technologies, and applications, as well as logical opinions regarding the development of artificial intelligence.