Page MenuHomePhabricator

CommonsMetadata should remove simple HTML wrapping
Closed, ResolvedPublic

Description

Where possible, metadata values should be plain text, not HTML (Wikidata does not support HTML values). There is not much we can do with arbitrary HTML, but a few simple values should be recognized, e.g. <p>text</p>, <ul><li>text</li></ul> etc. (Also, maybe a list with multiple elements could be turned into a multivalued field?)


Version: unspecified
Severity: minor

Details

Reference
bz57848

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:37 AM
bzimport added a project: CommonsMetadata.
bzimport set Reference to bz57848.

Change 120948 had a related patch set uploaded by Gergő Tisza:
Clean parsed HTML

https://gerrit.wikimedia.org/r/120948

<p> is cleaned now; <ul> seems too complex to be worth it.

Gilles triaged this task as Unbreak Now! priority.Dec 4 2014, 10:11 AM
Gilles moved this task from Untriaged to Done on the Multimedia board.
Gilles lowered the priority of this task from Unbreak Now! to Needs Triage.Dec 4 2014, 11:23 AM