BRS85 - A CLARIN Transcription Portal for Interview Data

blogs

A CLARIN Transcription Portal for Interview Data

2020-03-06

Christoph Draxler, Henk van den Heuvel, Arjan van Hessen, Silvia Calamai, Louise Corti, Stefania Scagliola Abstract In this paper we present a first version of a transcription portal for audio files...

Cross disciplinary overtures with interview data

2018-11-29

Integrating digital practices and tools in the scholarly workflow Summary As increasingly sophisticated new technologies come on stream, there is one type of data that begs to be explored by the...

A Transcription Portal for Oral History Research and Beyond

2018-11-28

Background and Introduction Over the past 2 years a number of researchers from various backgrounds have been working on the exploitation of digital techniques and tools for working with oral history...

Speech Recognition and Scholarly Research: Usability and Sustainability

2018-10-05

Roeland Ordelman and Arjan van Hessen Netherlands Institute for Sound and Vision and University of Twente, The Netherlands Objectives In spite of significant efforts and progress, automatic speech...

Evaluation of the OH-portal

2018-10-04

During the successful and enjoyable workshop in Arezzo (May 2017), it became clear that, if done properly, automatic transcription of interviews could be useful to get a quick overview of what was...

In response to your inquiry Automatic e-mail answer suggestion in a Dutch Contact Centre

2015-10-09

M.R. Boedeltje and A.J. van Hessen Telecats and Telecats/University of Twente Abstract In the past years, the number of service requests through e-mail has shown an explosive growth. Equal to most...

Utterance generation for transaction dialogues

2015-07-19

Joris Hulstijn and Arjan van Hessen University of Twente The Netherlands { joris | hessen }@cs.utwente.nl 1. Transaction Dialogues Obligations Transactions (ticket reservation, distant selling)...

Croatian Memories: an interview collection with personal narratives on war and trauma

2015-07-17

Arjan van Hessen1, Franciska de Jong1, 2, and Stef Scagliola2 1 Human Media Interaction Group, Universiteit Twente, The Netherlands {a.j.vanhessen, f.m.g.dejong}@utwente.nl 2 Erasmus Studio, Erasmus...

Christoph Draxler, Henk van den Heuvel, Arjan van Hessen, Silvia Calamai, Louise Corti, Stefania Scagliola

Abstract

In this paper we present a first version of a transcription portal for audio files based on automatic speech recognition (ASR) in various languages. The portal is implemented in the CLARIN resources research network and intended for use by non-technical scholars. We explain the background and interdisciplinary nature of interview data, the perks and quirks of using ASR for transcribing the audio in a research context, the dos and don’ts for optimal use of the portal, and future developments foreseen. The portal is promoted in a range of workshops, but there are a number of challenges that have to be met. These challenges concern privacy issues, ASR quality, and cost, amongst others.

Keywords: automatic speech recognition, interviews, digital humanities, social sciences, research infrastructure

icoonpdf full paper

Laatste aanpassing website: zondag 27 juli 2025, 14:23:54.