Vitessce supports several data types which denote in an abstract sense the type of observations contained in a file (e.g., matrix, dataframe, image).
For each data type, Vitessce may support multiple file types (
datasets.files.fileType) which denote specific schemas and file formats that Vitessce knows how to read.
For example, a file that conforms to the
obsEmbedding data type must contain embedding coordinates for each cell (assuming each observation represents a cell) computed via e.g. t-SNE or UMAP.
Depending on which file format is more convenient, you may choose either
Data Types and File Types
|Data Type||File Types||Convert from...|
Per-observation 2D embedding coordinates. Typically used to store dimensionality reductions performed on cell-by-biomarker expression matrices.
Spatially-resolved 2D coordinates. For example, individual RNA molecule x-y coordinates measured by FISH, or cell segmentation centroid coordinates.
Lists or hierarchies of sets of observations. For example, cell type annotations or unsupervised clustering results.
Per-observation string labels. For example, alternate cell identifiers.
Multi-scale multiplexed imaging data, including OME-TIFF files and OME-NGFF Zarr stores.
Observation-by-feature matrix. Typically used to store cell-by-gene expression matrices.
Per-feature string labels. For example, alternate gene identifiers.
Per-observation segmentation polygons or bitmasks. Typically used to store cell or organelle segmentations.
Genomic profiles, such as ATAC-seq profiles.
Joint File Types
A joint file type is a pseudo-file type (pseudo in the sense that it does not correspond to any one data type) which allows a single file definition (and therefore a single URL) in the Vitessce configuration to expand to define multiple files of the atomic (i.e., non-joint) types listed in the table above.
This is motivated by the fact that one file may store information corresponding to more than one data type.
For instance, AnnData files may store not only the obs-by-feature matrix (
but also multiple 2D embedding coordinates (
spatial coordinates (
and cell type annotations (e.g.,
Rather than defining five different files
(corresponding to the atomic file types
anndata.zarr joint file definition can be used instead.
Note that joint file type expansion is currently not performed recursively (i.e., a joint file type expansion function must return a list of atomic file definitions).