Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Error with launching cellxgene after manually adding metadata to AnnData file (.h5ad)

See original GitHub issue

Hi,

I have an AnnData file adata.h5ad file with certain columns in adata.obs. I have some more metadata that I would like to add to adata.obs. I follow some simple steps and recommendations provided online to add these columns to adata.obs

# df_annot - pandas dataframe with annotations/columns I would like to add
# df_annot has identical index as adata.obs
adata.obs = pd.concat([adata.obs,df_annot],axis=1)

Note: I am using cellxgene==0.14.1 The launching of original adata.h5ad using cellxgene works using the following command - cellxgene launch adata.obs --port 8000 --disable-diffexp

However, when I try to launch adata.h5ad after having added additional columns (metadata) to adata.obs using the same command, I get the following error -

Error: 'dict' object has no attribute 'dtype' - file not found or is inaccessible.  File must be an .h5ad object.  Please check your input and try again.

Initially, I thought there might be an issue with how I am adding metadata/columns to AnnData file - adata.obs. I explored further by re-reading the modified h5ad file using scanpy (sc.read_h5ad) and processing it further by performing steps like ranking gene groups (sc.tl.rank_genes_groups). These steps were successful, so it seems unlikely that this is an AnnData (.h5ad) file issue.

Would love to hear your thoughts on where the issue might be arising from and what would be the best way to debug and getting it to work.

cellxgene installed in a conda environment with the following dependency versions -

python==3.7.3
numpy==1.16.3
pandas==0.24.2
numba==0.43.1

cellxgene==0.14.1
anndata==0.6.22post1
scanpy==1.4.5.1

Update (Mar 10, 2020):

To clarify, I am using different environments for cellxgene, and processing the AnnData file. CELLXGENE CONDA ENVIRONMENT

python==3.7.3
numpy==1.16.3
pandas==0.24.2
numba==0.43.1
cellxgene==0.14.1
anndata==0.6.22post1

ANNDATA PROCESSING CONDA ENVIRONMENT

python==3.8.1
numpy==1.18.1
pandas==1.0.1
numba==0.48.0
anndata==0.7.1
scanpy==1.4.5.1

Issue Analytics

State:
Created 4 years ago
Comments:15 (7 by maintainers)

Top GitHub Comments

2reactions

bkmartinjrcommented, Mar 10, 2020

I believe I understand the issue for @GMaciag, and it may very well be the issue for @shahnirav1005

Background: scanpy and anndata provide backward compatibility, but not forward compatibility. H5AD created by old anndata versions can be read by newer versions, but not vice versa.

In other words:

H5AD created with anndata version “N” are readable by anndata version “N+1” (and later)
H5AD created by anndata version “N+1” are not normally readable by anndata version “N”

In @GMaciag case, you are using a more recent anndata to create the data, and reading it with an old one.

cellxgene 0.14.1 mandates a slightly older version because the latest anndata was incompatible with cellxgene. So we pinned the version dependency.

You can fix this in two ways:

Create the H5AD using the 0.6.22post1 version of anndata (and correspondingly older scanpy)
Wait a few days, and we will release a new version of cellxgene that fixes this (works with the latest anndata/scanpy).

Apologies for the inconvenience!

1reaction

bkmartinjrcommented, Mar 10, 2020

We will announce the release on the CZI Science slack site (#cellxgene-users channel)

Top Results From Across the Web

Users struggle to convert seurat -> anndata or h5ad (which we ...

I tried running my local cellxgene with loom and RDS files and got Error: Dataset does not have an allowed type. Is this...

cellxgene - PyPI

The launch command assumes that the data is stored in the .h5ad ... adding the contents of a csv file with metadata to...

Cellxgene VIP unleashes full power of interactive visualization ...

First, export the following from Seurat object in R: expression matrix (assume normalized), metadata and coordinates (pca, tsne, umap) as separate txt files....

h5ad cellxgene to R - Biostar

Converts to h5seurat, but runs into error when reading in the h5seurat file. 2 R anndata library(anndata) g <- read_h5ad("local.h5ad") ...

Exploration with cellxgene

As input, it takes an h5ad file (AnnData) that contains a pre-computed low-dimensional embedding (e.g., tSNE or UMAP) and, optionally, additional metadata ......