back

Automatically Subtitling the C3

How speech processing helps the CCC subtitle project, and vice-versa.

If you suspend your transcription on amara.org, please add a timestamp below to indicate how far you progressed! This will help others to resume your work!

Please do not press “publish” on amara.org to save your progress, use “save draft” instead. Only press “publish” when you're done with quality control.

Video duration: 00:30:55
Language: English
Abstract: Transcribing a talk comes relatively easy to fast typists, whereas turning a transcript into time-aligned subtitles for a video requires a much larger human effort. In contrast, speech recognition performance (especially for open-source-based solutions), is still poor on open-domain topics, but speech technology is able to align a given text to the corresponding speech with high accuracy. Let's join forces to generate superior subtitling with little effort, and to improve future open-source-based speech recognizers, at the same time!

We present the ongoing work of an student project in informatics at Universität Hamburg in which we combine the strengths of human transcription performance and automatic alignment of these transcriptions to produce high quality video subtitles.

We believe that our work can help the C3 community in generating video subtitles with less manual effort, and we hope to provide subtitles for all 31C3 talks (as long as you provide the transcriptions).

However, we're not just a service provider to the C3. There is a shortage of training material for free and open-source speech recognizers and the acoustic models they employ. Thus, we plan to prepare an aligned audio corpus of C3 talks which will help to advance open-source speech recognition.

Be a part of this by helping us with your transcriptions -- we'll repay with subtitlings and better open-source speech recognition in the future!

Talk ID: 6554
Event:: 31c3
Day: 3
Room: Saal G
Start: 10 p.m.
Duration: 00:30:00
Track: Science
Type of: lecture
Speaker: timobaumann; Arne Köhn
Talk Slug & media link: 31c3_-_6554_-_en_-_saal_g_-_201412292200_-_automatically_subtitling_the_c3_-_timobaumann_-_arne_kohn

: Source Repository
: Project Homepage

English

0.0%

100.0%

Work on this video on Amara!

This talk has a pad for the transcript, please use this instead of the amara editor: Etherpad
Shortcut to the video files: CDN Video Link or YouTube Video Link use this with otranscribe.com

Automatically Subtitling the C3

How speech processing helps the CCC subtitle project, and vice-versa.

Work on this video on Amara!

Please check: