Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

ENH: SortField shorthand

See original GitHub issue

Yesterday, a colleague asked me how to dictate the sort of bars in a chart. I developed this example to show him how.

download

alt.Chart(df, title="Median household income of U.S. counties").mark_bar().encode(
    x=alt.X(
        "name:N",
        axis=alt.Axis(labels=False, title="", ticks=False),
        # Here's where you can resort the order of the columns on the x-axis
        sort=alt.SortField(
            # This SortField class requires at least three inputs,
            # which does seem like overkill. I'd like to see a simpler
            # way to pull this off.
            field='b19013001',  # First the field you want to sort on 
            op='sum',  # Then the operation to run on that field. In this case, we just total the value.
            order="descending"  # Finally, the order to sort.
        )
    ),
    y=alt.Y(
        "b19013001:Q",
        axis=alt.Axis(title="", format="$s", ticks=False)
    )
).properties(width=620)

It works great but, IMHO, the SortField requirement with three inputs, including a “fake” op that in this case does not appear to be necessary, is asking a lot of beginners. And I’d like to think something more convenient could also benefit experts.

I know nothing about the internals of this feature, but I’m curious if the sort channel could somehow benefit from a shorthand, much like the x and y channels.

In my imagination, something like this:

sort=alt.SortField(field="b19013001", op="sum", ordering="descending")

Could be submitted like this, with the field and operation handled much like the other shorthand features, and the descending order of the sort handled with the same style as the order_by method of the popular Django framework:

sort="-sum(b19013001)"

I’m guessing you can easily imagine the other permutations in this kind of scheme. Additionally in cases where the dataframe is not grouped during encoding, it seems to me that providing the op argument should be, 🥁, optional. That would mean that if a field was to be used as the sort in ascending order with no aggregation, the shorthand submission could be as simple as:

sort="b19013001"

What do you think? If something like this already exists and I’m simply ignorant of it I will accept writing the documentation as my punishment.

Issue Analytics

State:
Created 5 years ago
Reactions:1
Comments:12 (8 by maintainers)

Top GitHub Comments

1reaction

jakevdpcommented, May 26, 2018

I wouldn’t say they’re off-limits… I’d just say we need to think carefully about where to draw the line on what parts of Altair exactly mirror the Vega-Lite API and what parts diverge.

Just for background: the way the shorthand expressions work is:

Subclass all encoding channel classes
Add an attribute that is invalid according to its schema
specialize the to_dict() method so that it detects the presence of this attribute, removes it, and interprets its contents into a form that is valid according to the schema (in this case, populating the field, type, aggregate, and timeUnit attributes).

This customized code depends on the details of the schema, and so when the schema is updated the details of these modifications have to be updated as well. For example, the Vega-Lite version 1 and Vega-Lite version 2 schemas were so different that it required essentially rewriting the code from scratch, which all told took about 8 months to really get correct. Along the way, I dropped a number of other API shortcuts we had created earlier because I saw how unmaintainable they were when it came to schema updates.

I think overall it’s good to have those encoding shorthands available at the top level of the encoding… it’s something that’s used in basically every chart, and so the added maintenance burden is worth it. For any other API changes that require circumventing the grammar of the Vega-Lite schema, I want to make sure we’re carefully weighing the benefit to users vs the costs of the new maintenance burdens they create.

So no, nothing’s off-limits per se, but there’s a lot to keep in mind when making these kinds of decisions.

0reactions

kanitwcommented, May 28, 2019

Maybe… my best attempt at making it modular is here, in the code generation tools, where we automatically generate wrappers for schema objects for which we want to modify the default behavior: /tools/generate_schema_wrapper.py@master#L245-L293

There’s a lot in there that is “hard-coded”, so when the schema changes it takes a bit of hunting to figure out why things aren’t working any more.

I think it’s worth knowing what are the things that Altair still diverges from Vega-Lite, so we can revise our defaults, esp. for the upcoming VL4.

I still think it may be useful to allow a shorter syntax, like sort=‘column’ rather than sort=alt.EncodingSortField(‘column’)

Yep, I have an issue that you can upvote in VL here: https://github.com/vega/vega-lite/issues/4933.

Top Results From Across the Web

Glossary - Default

Abbreviation and Acronym ... East or Eastern or Easting | English | Enhancement Project Roadside (E-1234), Roadway Plans.

Gregg shorthand reporting course [1944]

Gregg shorthand reporting course [1944]. Swem, Charles Lee.; Gregg, John Robert; 1867-1948; ... "Gregg shorthand reporting course"@eng. Loading.

IOAG Organizations

Abbreviation Sort Ascending. Web Site ... Russia, 107996, г. Москва ул. Щепкина, 42. RFSA, http://www.roscosmos.ru/index.asp?Lang=ENG.

VistA Scheduling Enhancements (VSE) VS GUI User Guide

VistA Scheduling Enhancement (VSE) SharePoint (previously VA Pulse pages): ... two-character minimum when searching by clinic abbreviation),.

SAP Table: AES_ADDR - Structure for Transferring Addresses to ...

SAP Table: AES_ADDR - Structure for Transferring Addresses to Enhancement Services ... 30, STREETABBR, (Not Supported) Abbreviation of Street Name, CHAR, 2.