Search for exact title with different encoding fails #6059
Labels
Lead: @cdrini
Issues overseen by Drini (Staff: Team Lead & Solr, Library Explorer, i18n) [managed]
Module: Solr
Issues related to the configuration or use of the Solr subsystem. [managed]
Priority: 2
Important, as time permits. [managed]
Type: Bug
Something isn't working. [managed]
If the title of a work has characters with diacritics, searching for it with a string that corresponds to exactly the same title but with a different unicode representation fails.
Evidence / Screenshot (if possible)
This is very hard to demo, because it involves strings which appear identical, and whose encoding can be (and often is) changed by browsers, editors, etc. Therefore, I have created a Colab notebook that demonstrates the error: https://colab.research.google.com/drive/1NiKOD0Md_nR7bPbHXGW4lUnyCbc0sbJ5
Since @cdrini is already aware of the issue, I hope this is sufficient.
Stakeholders
@cdrini
The text was updated successfully, but these errors were encountered: