Lyrics Transcription for Humans: A Readability-Aware Benchmark

Ondřej Cífka; Hendrik Schreiber; Luke Miner; Fabian-Robert Stöter

doi:10.5281/zenodo.14877443

Published November 10, 2024 | Version v1

Conference paper Open

Lyrics Transcription for Humans: A Readability-Aware Benchmark

Writing down lyrics for human consumption involves not only accurately capturing word sequences, but also incorporating punctuation and formatting for clarity and to convey contextual information. This includes song structure, emotional emphasis, and contrast between lead and background vocals. While automatic lyrics transcription (ALT) systems have advanced beyond producing unstructured strings of words and are able to draw on wider context, ALT benchmarks have not kept pace and continue to focus exclusively on words. To address this gap, we introduce Jam-ALT, a comprehensive lyrics transcription benchmark. The benchmark features a complete revision of the JamendoLyrics dataset, in adherence to industry standards for lyrics transcription and formatting, along with evaluation metrics designed to capture and assess the lyric-specific nuances, laying the foundation for improving the readability of lyrics. We apply the benchmark to recent transcription systems and present additional error analysis, as well as an experimental comparison with a classical music dataset.

Files

000083.pdf

Files (194.4 kB)

Name	Size	Download all
000083.pdf md5:2e0eede77eb11a03cc51895a519326df	194.4 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	70	70
Downloads	84	84
Data volume	18.5 MB	18.5 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 25th International Society for Music Information Retrieval Conference, 737-744. San Francisco, California, USA and Online.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2024) , San Francisco, California, USA and Online, November 10-14, 2024

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: February 16, 2025
Modified: February 16, 2025

Lyrics Transcription for Humans: A Readability-Aware Benchmark

Creators

Description

Files

000083.pdf

Files (194.4 kB)