question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Better support R users

See original GitHub issue

tl;dr – I’d love thoughts and discussion around this. I suspect this has several components to it; one big component of this is input data formats. I’ve taken some notes below about what (I think?) that landscape looks like based on some initial investigations.

In targeting the loop between experimental and computational users, it would allow us to support many more scientists if launch were able to support the output of seurat and bioconductor.

For improving data ingress, I’d vote for implementing this in launch rather than prepare simply because if someone is already working in seurat/bioconductor (i.e., in an analytical loop), I assume they don’t need to use prepare to do normalization, embedding, etc. I think this is also where we’d most likely be intercepting pairs of experimental/computational users.

This does get a little tricky. In exploring and consulting with Angela Pisco and Olga Botvinnik, it appears that both seurat and bioconductor mostly just output rds and rdata objects. While there are converters between the seurat <–> SingleCellExperiment R objects --> h5ad, these are still under development and nontrivial to work with.

In the future, I suspect supporting launch from rdata/rds will be necessary. But, it does seem like a larger and more complex chunk of work than I had anticipated due to the absence of an intermediate data file structure.

In the meantime, I’d suggest implementing support for other file formats already supported by anndata.read as detailed here.

Issue Analytics

  • State:closed
  • Created 5 years ago
  • Comments:6 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
csweavercommented, Mar 8, 2019

I think that supporting the R ecosystem and supporting input from other file formats are going to be two separate tickets. For this ticket the title matchings the supporting input, but the body matches supporting R. And it seems like supporting R is a more involved process which deserves its own prioritization for roadmapping and releases.

0reactions
signechambers1commented, Sep 29, 2020

Closing as a duplicate of #1841

Read more comments on GitHub >

github_iconTop Results From Across the Web

Dataiku 4.1.0: More support for R users! - R-Craft
Recently, Dataiku 4.1.0 was released, it now offers much more support for R users. But wait a minute, Data-what?
Read more >
R User Group - Mass General Brigham RISC
Welcome to the Mass General Brigham (MGB) R User Group. R is a free and open programming language for statistical computing, data analysis,...
Read more >
Why RStudio Supports Python for Data Science
We had always hypothesized that R users use more than one language to do data science. The data we collected from this survey...
Read more >
Getting Help with R - Posit Support - RStudio
There are a number of good resources available on the web for both learning R and seeking answers to questions about how to...
Read more >
UseRs Participation in the R Community - FORWARDS
The top initiatives that would encourage more involvement are a new user group nearby and mentoring. Women are more likely than men to...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found