Topic on User talk:Hjfocs

Jump to navigation Jump to search

Issues with Last.fm extractor

3
Summary by Hjfocs

[soweego 2] MusicBrainz (Q14005) URLs validation: delete percent-encoded IDs & put back decoded ones; use pluses instead of whitespaces for Last.fm ID (P3192) values

Lockal (talkcontribs)

Hi, could you reevaluate recently imported LastFM ids, please? It was broken and now we have 34 Johns.

Lockal (talkcontribs)

Another note: now as "%" is allowed in extraction pattern, could you automatically convert all %20 (" ") to "%2B" ("+") in Last.fm ids (and only for Last.fm ids)? Both spaces and plus encoding works for last.fm (even double encoding monstrosity like Kevin%2520Macleod works), but pluses are canonical there.

Hjfocs (talkcontribs)

I'm very grateful for your regular feedback, that's really precious. Here are the actions taken:

  • all the bad IDs resulting from the SPARQL query you pointed out are now deleted;
  • all percent-encoded IDs are now replaced with decoded ones;
  • all Last.fm ID (P3192) IDs now have pluses instead of whitespaces.