Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Datashader support

See original GitHub issue

The new datashader support in HoloViews is great – we can now easily choose between a “normal” HoloViews plot, a decimated one, or an aggregated one, making it simple to work with large or small datasets. But there are still a few wrinkles to iron out:

Automatic bin array sizing

The height and width of the datashader bin array must currently be set explicitly, but the plot size in pixels is available from the underlying Bokeh plot, and so we should be able to attach some streams that provide that size to datashader for its bin array dimensions instead. It will always be important for users to be able to override such automatic values, but having them be automatic would be vastly more convenient in most cases.

Unifying shading and colormapping

There are now two completely separate mechanisms for “shading”, i.e., rendering a data array stored in an hv.Image object into a set of RGBA pixels for display. The “backend” mechanism is called by the display hook for hv.Image objects, to provide “rich display” of the object in Jupyter. This mechanism provides a colormap to the backend (Matplotlib or Bokeh) and requests that the backend render the image. Presumably the backend mechanism allows Bokeh to provide accurate hover information, as it has direct access to the underlying data values. The backend mechanism is also fully automatic, from the user perspective, being called behind the scenes whenever such an object needs to be displayed.

The other mechanism is provided by hv.operation.datashader.shade(), which explicitly turns an hv.Image object into an hv.RGB object, performing the colormapping from within Python using specialized functions in Datashader. The shade() mechanism has functionality not present in Bokeh or Matplotlib’s colormapping, namely doing histogram equalization, using the alpha channel to provide a mask, and allowing arbitrary data normalization functions if needed. However, it currently must be invoked explicitly, using shade() to create a different HoloViews object type that no longer has information about the original data values, and is thus a “dead end” just for visualization, of the type that we try to avoid in HoloViews. It would be nice to put this functionality into the plot options instead so that it can be invoked automatically on rich display, to avoid such dead ends and make the user experience more straightforward.

Shading categorical aggregates

Categorical aggregates currently end up as an NdOverlay of Images, which can’t normally be viewed directly as the topmost image obscures all the rest. Thus these aggregates have an additional wrinkle similar to the issue for shading above – they must first be converted explicitly into something visible, rather than allowing the user to use the same representation for working with data and for viewing it, as is an explicit goal of HoloViews. Instead, we should be able to generalize the notion of compositors, perhaps covering the shading case above as well, allowing such overlays to be combined and shaded automatically on rich display.

These changes would make using datashading much more convenient, intuitive, and seamless in HoloViews. Tasks:

Automatic plot sizing for aggregate()
Shading as a plot option
Compositors for shading categorical aggregates

Issue Analytics

State:
Created 7 years ago
Comments:5 (4 by maintainers)

Top GitHub Comments

1reaction

jbednarcommented, May 24, 2021

It would be nice to put [shade] functionality into the plot options instead so that it can be invoked automatically on rich display,

Current support in bokeh makes this approach unnecessary, so I’ve marked it “done”. There are still some gaps between Bokeh’s colormapping what shade() can do, namely using alpha to indicate value for a single color and mixing categorical colors, but those are separate issues and should be addressed outside of this one.

0reactions

jlstevenscommented, Mar 18, 2019

Just a note that new bokeh colormappers that will be better able to support datashader are being worked on: https://github.com/bokeh/bokeh/pull/8743

@kidpixo The compositor is a feature that is used occasionally internally but has never been properly documented. You might be able to get it working but I am not really sure about its status atm.

Top Results From Across the Web

FAQ — Datashader v0.14.3

Sample code is provided with Datashader for demonstrating how to use HoloViews (supporting Bokeh, Matplotlib, and Plotly) or native Matplotlib to provide ...

Interactivity — Datashader v0.14.3

In this notebook, we will first look at the HoloViews API, then at Datashader's new native Matplotlib support. Embedding Datashader with HoloViews#. HoloViews...

Releases — Datashader v0.14.3

Support for Dask-based quadmesh rendering for rectilinear and curvilinear mesh types (#885, #913). Support for GPU-based raster mesh rendering via Canvas.

Extending — Datashader v0.14.3

Datashader is directly supported by HoloViews, with interactive exploration supported for its Bokeh extension, and static plots supported for its Matplotlib ...

Installation — Datashader v0.14.3

The API is the definitive guide to each part of Datashader, but the same information is available more conveniently via the help() command...