An Iterative Dual Pathway Structure for Speech-to-Text Transcription

Liem, Beatrice (Harvard University) | Zhang, Haoqi (Harvard University) | Chen, Yiling (Harvard University)

Aug-8-2011–AAAI Conferences

In this paper, we develop a new human computation algorithm for speech-to-text transcription that can potentially achieve the high accuracy of professional transcription using only microtasks deployed via an online task market or a game. The algorithm partitions audio clips into short 10-second segments for independent processing and joins adjacent outputs to produce the full transcription. Each segment is sent through an iterative dual pathway structure that allows participants in either path to iteratively refine the transcriptions of others in their path while being rewarded based on transcriptions in the other path, eliminating the need to check transcripts in a separate process. Initial experiments with local subjects show that produced transcripts are on average 96.6% accurate.

artificial intelligence, social media, transcript, (19 more...)

AAAI Conferences

Aug-8-2011

Conferences PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe > United Kingdom
  - England > East Sussex > Brighton (0.04)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology
  - Communications > Social Media (0.68)
  - Artificial Intelligence > Speech
    - Speech Recognition (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found