AITopics | mturk

Collaborating Authors

mturk

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Rädsch, Tim, Reinke, Annika, Weru, Vivienn, Tizabi, Minu D., Heller, Nicholas, Isensee, Fabian, Kopp-Schneider, Annette, Maier-Hein, Lena

arXiv.org Artificial IntelligenceJul-26-2024

This paper does not describe a novel method. Instead, it studies an essential foundation for reliable benchmarking and ultimately real-world application of AI-based image analysis: generating high-quality reference annotations. Previous research has focused on crowdsourcing as a means of outsourcing annotations. However, little attention has so far been given to annotation companies, specifically regarding their internal quality assurance (QA) processes. Therefore, our aim is to evaluate the influence of QA employed by annotation companies on annotation quality and devise methodologies for maximizing data annotation efficacy. Based on a total of 57,648 instance segmented images obtained from a total of 924 annotators and 34 QA workers from four annotation companies and Amazon Mechanical Turk (MTurk), we derived the following insights: (1) Annotation companies perform better both in terms of quantity and quality compared to the widely used platform MTurk. (2) Annotation companies' internal QA only provides marginal improvements, if any. However, improving labeling instructions instead of investing in QA can substantially boost annotation performance. (3) The benefit of internal QA depends on specific image characteristics. Our work could enable researchers to derive substantially more value from a fixed annotation budget and change the way annotation companies conduct internal QA.

annotation, annotation company, instruction, (14 more...)

arXiv.org Artificial Intelligence

2407.17596

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Minnesota (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(8 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Information Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.93)
Health & Medicine > Therapeutic Area (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.89)

Add feedback

ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

Gilardi, Fabrizio, Alizadeh, Meysam, Kubli, Maël

arXiv.org Artificial IntelligenceJul-19-2023

Published in the Proceedings of the National Academy of Sciences https://www.pnas.org/doi/10.1073/pnas.2305016120 Many NLP applications require manual text annotations for a variety of tasks, notably to train classifiers or evaluate the performance of unsupervised models. Depending on the size and degree of complexity, the tasks may be conducted by crowd-workers on platforms such as MTurk as well as trained annotators, such as research assistants. Using four samples of tweets and news articles (n = 6,183), we show that ChatGPT outperforms crowd-workers for several annotation tasks, including relevance, stance, topics, and frame detection. Across the four datasets, the zero-shot accuracy of ChatGPT exceeds that of crowd-workers by about 25 percentage points on average, while ChatGPT's intercoder agreement exceeds that of both crowd-workers and trained annotators for all tasks. Moreover, the per-annotation cost of ChatGPT is less than $0.003--about thirty times cheaper than MTurk. These results demonstrate the potential of large language models to drastically increase the efficiency of text classification. 1 Introduction Many NLP applications require high-quality labeled data, notably to train classifiers or evaluate the performance of unsupervised models. For example, researchers often aim to filter noisy social media data for relevance, assign texts to different topics or conceptual categories, or measure their sentiment or stance.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1073/pnas.2305016120

2303.15056

Country:

North America > United States (1.00)
Europe > Switzerland > Zürich > Zürich (0.15)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine (0.94)
Government > Regional Government > North America Government > United States Government (0.49)
Law > Statutes (0.46)
Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

How AI and crowdsourcing help social scientists sample diverse populations

#artificialintelligenceNov-25-2022, 05:10:05 GMT

Check out the on-demand sessions from the Low-Code/No-Code Summit to learn how to successfully innovate and achieve efficiency by upskilling and scaling citizen developers. In 2010, three psychologists from the University of British Columbia published a paper with an intriguing title: The WEIRDest people in the world? Paradoxically, the paper was about Americans. The three scientists had devoted their research careers to cross-cultural variability of human psychology and traveled the seven seas to study small-scale tribal societies. In the paper, they voiced a growing concern about how heavily the humanities -- psychology, economics, sociology, political science and others -- were relying on samples of Americans.

diversity, mturk, participant, (11 more...)

#artificialintelligence

Country:

North America > Canada > British Columbia (0.24)
North America > United States (0.14)
South America > Venezuela (0.04)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.95)
Information Technology > Communications > Social Media > Crowdsourcing (0.53)

Add feedback

Amazon Mechanical Turk - Wikipedia

#artificialintelligenceJul-19-2022, 05:20:49 GMT

Amazon Mechanical Turk (MTurk) is a crowdsourcing website for businesses (known as Requesters) to hire remotely located "crowdworkers" to perform discrete on-demand tasks that computers are currently unable to do. It is operated under Amazon Web Services, and is owned by Amazon.[1] Employers post jobs known as Human Intelligence Tasks (HITs), such as identifying specific content in an image or video, writing product descriptions, or answering questions, among others. Workers, colloquially known as Turkers or crowdworkers, browse among existing jobs and complete them in exchange for a rate set by the employer. To place jobs, the requesting programs use an open application programming interface (API), or the more limited MTurk Requester site.[2] As of April 2019, Requesters could register from only 49 approved countries.[3]

amazon, mechanical turk, requester, (14 more...)

#artificialintelligence

Country:

Asia > India (0.05)
North America > United States > Texas (0.04)
Europe (0.04)

Genre: Research Report > New Finding (0.47)

Industry:

Law (1.00)
Banking & Finance (0.95)
Information Technology > Services (0.67)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.71)

Add feedback

Micro-employment - trend created by large-scale automation

#artificialintelligenceOct-30-2021, 06:06:24 GMT

In recent years, a new trend has emerged on the labor market – micro-employment, making money on small jobs, for which you only need a laptop and access to the Internet. Platforms for customers and performers advertise their service as a convenient and easy way to make money. But often micro-employment is a lack of choice, low earnings, monotonous and ambiguous tasks. When shoppers in London's Hackney area shop at the new Amazon Fresh store, they no longer pay the checkout operator, but simply walk out with their wares. Amazon describes it as an effortless consumer experience. The rise in automated stores during the pandemic is just the tip of the iceberg.

large-scale automation, micro-employment, platform, (8 more...)

#artificialintelligence

Industry: Information Technology > Services (0.31)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Networks (0.36)

Add feedback

Big tech's push for automation hides the grim reality of 'microwork' Phil Jones

The GuardianOct-27-2021, 10:00:28 GMT

When customers in the London borough of Hackney shop in the new Amazon Fresh store, they no longer pay a checkout operator but simply walk out with their goods. Amazon describes "just walk out shopping" as an effortless consumer experience. The rise of automated stores during the pandemic is just the tip of the iceberg. Floor-cleaning robots have been introduced in hospitals, supermarkets and schools. Fast-food restaurants are employing burger-grilling robots and chatbots.

contractor, microwork, phil jones, (14 more...)

The Guardian

Country: Europe > United Kingdom > England > Greater London > London > Hackney (0.25)

Industry:

Health & Medicine (0.90)
Consumer Products & Services > Restaurants (0.90)

Technology: Information Technology > Artificial Intelligence > Robots (0.59)

Add feedback

AI: Ghost workers demand to be seen and heard

#artificialintelligenceMar-28-2021, 00:09:59 GMT

Artificial intelligence and machine learning exist on the back of a lot of hard work from humans. Alongside the scientists, there are thousands of low-paid workers whose job it is to classify and label data - the lifeblood of such systems. But increasingly there are questions about whether these so-called ghost workers are being exploited. As we train the machines to become more human, are we actually making the humans work more like machines? And what role do these workers play in shaping the AI systems that are increasingly controlling every aspect of our lives?

ghost worker demand, platform, so-called ghost worker, (9 more...)

#artificialintelligence

Country:

North America > United States > West Virginia (0.06)
North America > Canada > Quebec > Montreal (0.05)
Europe (0.05)
Africa (0.05)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.53)

Add feedback

5 Best Data Collection Companies for Machine Learning Projects

#artificialintelligenceMar-29-2020, 06:12:12 GMT

Data is the bedrock of all machine learning systems. As such, working with the right data collection company is critical in order to solve a supervised machine learning problem. If you don't have a particular goal or project in mind, there is a wealth of open data available on the web to practice with. However, if you're looking to tackle a specific problem, chances are you'll need to collect data yourself or work with a company that can collect data for you. There are many data collection companies that provide crowdsourcing services to help individuals and corporations gather data at scale.

artificial intelligence, data collection company, machine learning, (15 more...)

#artificialintelligence

Country: Europe > Germany (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.55)

Add feedback

When AI needs a human assistant

#artificialintelligenceJun-17-2019, 11:51:17 GMT

For years, Amazon's Mechanical Turk (mTurk) has been a kind of open secret in the tech world, a place where fledgling algorithms can hire human labor on the cheap. If you need a hundred people to trace the boundaries of an object or fill out a survey, it's the single best place to make it happen. But while the project itself is well-known, it's always slightly embarrassing when a company turns up there. In 2017, Expensify was spotted asking mTurk workers to enter data from receipts, leading the company to rush out a statement insisting that the mTurk project had nothing to do with Expensify's main app. In part, it was a privacy issue, but mostly it was embarrassing: Expensify was built on a simple piece of technology -- the ability to extract data from a photo of a receipt -- and the mTurk tasks made it look like that technology was a sham. What if it was human beings extracting that data all along?

algorithm, artificial intelligence, social media, (8 more...)

#artificialintelligence

Industry:

Health & Medicine (0.69)
Information Technology > Security & Privacy (0.58)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.69)

Add feedback

Comparison-Based Framework for Psychophysics: Lab versus Crowdsourcing

Haghiri, Siavash, Wichmann, Felix, von Luxburg, Ulrike

arXiv.org Machine LearningMay-17-2019

Traditionally, psychophysical experiments are conducted by repeated measurements on a few well-trained participants under well-controlled conditions, often resulting in, if done properly, high quality data. In recent years, however, crowdsourcing platforms are becoming increasingly popular means of data collection, measuring many participants at the potential cost of obtaining data of worse quality. In this paper we study whether the use of comparison-based (ordinal) data, combined with machine learning algorithms, can boost the reliability of crowdsourcing studies for psychophysics, such that they can achieve performance close to a lab experiment. To this end, we compare three setups: simulations, a psychophysics lab experiment, and the same experiment on Amazon Mechanical Turk. All these experiments are conducted in a comparison-based setting where participants have to answer triplet questions of the form "is object x closer to y or to z?". We then use machine learning to solve the triplet prediction problem: given a subset of triplet questions with corresponding answers, we predict the answer to the remaining questions. Considering the limitations and noise on MTurk, we find that the accuracy of triplet prediction is surprisingly close---but not equal---to our lab study.

accuracy, experiment, triplet, (15 more...)

arXiv.org Machine Learning

1905.07234

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > United States > New York (0.04)
North America > United States > New Jersey (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback