Skip to main content

The Dynamics of Micro-Task Crowdsourcing

On 20 May 2015 Dr Gianluca Demartini of the Information School will present a paper on 'The Dynamics of Micro-Task Crowdsourcing' at the 24th World Wide Web Conference in Florence, Italy.

Micro-task crowdsourcing is a modern technique that allows outsourcing of simple data collection tasks to a crowd of individuals online. Tasks such as image annotation, document summarisation, or audio transcription are easy for humans to complete but very challenging for computers. micro-task crowdsourcing is commonly used to build information systems that combine the scalability of computers over large amounts of data with the quality of human intelligence.

Over the last 10 years different micro-task crowdsourcing platforms have been created. These platforms are marketplaces where crowd workers complete tasks (usually called Human Intelligence Tasks or HITs) in exchange of small monetary rewards and where requesters post their data and tasks to quickly obtain large scale annotations.

As part of research carried out with Difallah, Catasta, Ipeirotis and Cudré-Mauroux, Gianluca analysed logs between 2009 and 2014 from the most popular crowdsourcing platform: Amazon Mechanical Turk and observed the evolution over time of its usage. This research will be presented at the World Wide Web Conference, and the full paper is available here.

The main findings from the research are:

Published tasks:
- The most frequent HIT reward value on Amazon Mechanical Turk has increased over time, and reaches $0.05 in 2014.
- HITs about audio transcription have been gaining momentum over last years and are now the most popular tasks on Amazon Mechanical Turk.
- Content Access HITs (like “Visit this website” or “click this link”) popularity on Amazon Mechanical Turk has decreased over time.
- Surveys are the most popular type of HITs for US-based workers on Amazon Mechanical Turk.

Workers and Requesters:
- HITs on Amazon Mechanical Turk that are exclusively asking for workers based in India have strongly decreased over time
- While most HITs on Amazon Mechanical Turk do not require country-specific workers, most of such HITs require US-based workers
- New requesters constantly join Amazon Mechanical Turk, making the number of active requesters and available reward increase over time: Over the last 2 years, an average of 1000 new requesters per month joined Amazon Mechanical Turk
- There is a weekly seasonality effect in the amount of rewards assigned to workers and in the HITs available on Amazon Mechanical Turk

Market size and dynamics:
- On Amazon Mechanical Turk 10K new HITs arrive and 7.5K HITs get completed every hour (on average)
- New HITs attract new workers to the Amazon Mechanical Turk website
- New workers arriving to the Amazon Mechanical Turk platform complete both fresh and old HITs
- Workers on Amazon Mechanical Turk prefer to work on fresh, recently posted HITs
- New work has almost 10x higher attractiveness for workers as compared to remaining work on Amazon Mechanical Turk

Work size and speed:
- Very large (300K HITs) batches recently appeared on Amazon Mechanical Turk
- Throughput of HIT batches on Amazon Mechanical Turk can best be predicted based on the number of HITs in the batch and its freshness
- Large HIT batches can achieve high throughput (thousands of HITs per minute) on Amazon Mechanical Turk


Above: Cumulative HITs (log) per country plotted by time


Above: Micro Reward per year


Comments

Popular posts from this blog

Raspberry Pi Weather Project now live

A project to create a raspberry pi weather station is currently live in the Information School.  The Sheffield Pi weather station has been created by Romilly Close, undergraduate Aerospace Engineering student at the University of Sheffield.  The project was funded by the Sheffield Undergraduate Research Experience (SURE) scheme and is being supervised by Dr Jo Bates, Paula Goodale and Fred Sonnenwald from the Information School. Information about the Sheffield Pi station and how to create your own can be found on the project website .  You can also see live data from the Sheffield Pi station on Plot.ly , and further information can also be found on the Met Office Weather Observations Website .    This work compliments the School’s existing project entitled ‘The Secret Life of a Weather Datum’ which explores socio-cultural influences on weather data.  This project is funded under the AHRC’s Digital Transformations Big Data call.  It ...

Our Chemoinformatics Group wins Jason Farradane Award

The Information School's Chemoinformatics Research Group has been awarded the 2012 UKeiG Jason Farradane Award , in recognition of its outstanding 40 year contribution to the information field. The prize is awarded to the three current members of the group,  Professor Val Gillet , Dr John Holliday and Professor Peter Willett . The judges recognised the Group's status as one of the world's leading centres of chemoinformatics research, a major contributor to the field of information science, and an exemplar in raising the profile of the information profession. The School has a long association with the Farradane prize. Its second recipient was long time member of staff Professor Mike Lynch in 1980.

Professor Mike Thelwall gives inaugural lecture

Professor of Data Science Mike Thelwall recently gave his inaugural lecture at the University of Sheffield, entitled  How helpful are AI and bibliometrics for assessing the quality of academic research? The lecture, delivered in the University's Diamond building, was introduced by Head of the Information School Professor Briony Birdi. It covered Mike's research into whether Artificial Intelligence can inform - or replace - expert peer review in the journal article publication process and what this could look like, as well as to what extent bibliometrics and citation statistics can play a role in assessing the quality of a piece of research. Mike also discussed whether tools like ChatGPT can accurately detect research quality. The inaugural lecture was well attended by colleagues from around the University.