Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd

Radadia, Purushotam; Karande, Shirish

Computer Science > Artificial Intelligence

arXiv:1609.02043 (cs)

[Submitted on 7 Sep 2016]

Title:Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd

Authors:Purushotam Radadia, Shirish Karande

View PDF

Abstract:Manual correction of speech transcription can involve a selection from plausible transcriptions. Recent work has shown the feasibility of employing a mismatched crowd for speech transcription. However, it is yet to be established whether a mismatched worker has sufficiently fine-granular speech perception to choose among the phonetically proximate options that are likely to be generated from the trellis of an ASRU. Hence, we consider five languages, Arabic, German, Hindi, Russian and Spanish. For each we generate synthetic, phonetically proximate, options which emulate post-editing scenarios of varying difficulty. We consistently observe non-trivial crowd ability to choose among fine-granular options.

Comments:	HCOMP 2016 Works-in-Progress
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:1609.02043 [cs.AI]
	(or arXiv:1609.02043v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1609.02043

Submission history

From: Shirish Karande [view email]
[v1] Wed, 7 Sep 2016 16:05:20 UTC (253 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2016-09

Change to browse by:

cs
cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Purushotam G. Radadia
Shirish Subhash Karande

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators