Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Tags are case-sensitive, and it's very hard to store and query them case-insensitively

See original GitHub issue

Tags are provided by django-taggit. It has a setting, TAGGIT_CASE_INSENSITIVE, which is supposed to make storing and retrieving tags ignore its case. This setting has multiple issues:

It doesn’t do anything about tags currently in the database.
- Setting this option to True while you have tags like economics and Economics won’t make the uppercase version go away. When you try to save a page that has either version of economics you’ll get hit with a MultipleObjectsReturned error.
- It’s also very hard to query all the objects that reference a tag and make them all point to lowercase versions… so while a management command is possible, I couldn’t figure out how to write one.
It takes the first version of the given tag.
- The case insensitive flag does not coerce all tags into lowercase. Instead, it compares all new tags by a case-insensitive match against the tag name.
- If you store eCoNomiCs the first time, every time you try to write economics it will be coerced into the horrible, disfigured version. Content editors can’t change this in Wagtail once it’s been done. This would be less bad if all tags were at least rendered in lowercase on the page editor.

This effectively means that tags can not be used in the way that tags are usually intended to be used. From my standpoint, django-taggit is a completely broken library. These issues have been raised multiple times upstream, but with no fix.

To demonstrate why this is so bad: imagine you’re running an online shop with tens of thousands of t-shirts. You have the tags black and Black which each matching 50% of black t-shirts. You cannot simply have a filter called “black” which shows all the black shirts. You have to run some post-processing after querying the tags to try to “merge” the tags. Yet, this is what tags are typically used for. I cannot even think of a case where case-sensitive tags are relevant (maybe for some strange reason “guy” ie a human male, is distinct from “Guy” ie the given name - this seems much more like the edge case than the rule).

I know this was a lot of information, and it may be hard to see my point unless you’ve experienced exactly what I’m talking about, but there is seriously no straightforward way to create a sidebar filter like this where economics and Economics are viewed as the same tag:

screenshot from 2018-09-22 17 50 05

The closest I’ve gotten is by writing some code like this:

discipline_tags = Tag.objects.filter(
    course_materials_disciplinetag_items__isnull=False
).annotate(
    num_results=Count('course_materials_disciplinetag_items'),
    lower_name=Lower('name')
).order_by('-num_results', lower_name).distinct('lower_name')

But this code actually errors out, because Django doesn’t support using annotate and then using the result in distinct. Also, this would only solve half my problem: it would filter out the duplicate tags, but it wouldn’t show the correct results count, and I couldn’t use it to get the combined results for rendering the filtered page.

Issue Analytics

State:
Created 5 years ago
Reactions:2
Comments:6 (4 by maintainers)

Top GitHub Comments

1reaction

nmorduchcommented, Oct 4, 2018

Opened issue #4798 for @harrislapiroff to add wagtail-autocomplete to core as discussed in the core team meeting yesterday. Whether or not that also closes this issue is a decision I will leave to someone else.

1reaction

nmorduchcommented, Oct 3, 2018

If we consider the route of replacement, it could be worth considering replacing tags with m2ms and wagtail-autocomplete which would require some work for integration into core but has autocompletion, on-the-fly creation, and the flexibility of being able to make your own tag model

cc @harrislapiroff @emilyhorsman

Top Results From Across the Web

Case sensitive and insensitive like in SQLite - Stack Overflow

You can use the UPPER keyword on your case insensitive field then upper-case your like statement. e.g.. SELECT * FROM mytable WHERE caseSensitiveField...

labels should be case insensitive | Jira Server and Data Center

Yes it is. This is my main gripe with the whole labeling process. There usually ends up being at least two of every...

Swift Core Data case insensitive request - Apple Developer

As a work around, I just added a new field used just for querying. This field is the lower case version of the...

Is SQL Case-Sensitive? - LearnSQL.com

Differences in case sensitivity can lead to problems in executing your queries. For this reason, you need to understand the error messages you ......

Design Studio URLs are case-sensitive. And that's OK.

Yes, https://some.eXaMpLe.com is implicitly case-insensitive (because it doesn't have a /path , ?query , nor #hash ). But case-insensitivity ...