anndata

anndata is a Python package for handling annotated data matrices in memory and on disk, positioned between pandas and xarray. anndata offers a broad range of computationally efficient features including, among others, sparse data support, lazy operations, and a PyTorch interface.

4
contributors
Get started
1779 commitsLast commit ≈ 6 days ago667 stars176 forks

Description

Tests Conda Coverage Docs PyPI Downloads Downloads Stars Powered by NumFOCUS

anndata - Annotated data

anndata is a Python package for handling annotated data matrices in memory and on disk, positioned between pandas and xarray. anndata offers a broad range of computationally efficient features including, among others, sparse data support, lazy operations, and a PyTorch interface.

anndata is part of the scverse® project (website, governance) and is fiscally sponsored by NumFOCUS. If you like scverse® and want to support our mission, please consider making a tax-deductible donation to help the project pay for developer time, professional services, travel, workshops, and a variety of other needs.

Public API

Our public API is documented in the API section of these docs. We cannot guarantee the stability of our internal APIs, whether it's the location of a function, its arguments, or something else. In other words, we do not officially support (or encourage users to do) something like from anndata._core import AnnData as _core is both not documented and contains a leading underscore. However, we are aware that many users do use these internal APIs and thus encourage them to open an issue or migrate to the public API. That is, if something is missing from our public API as documented, for example a feature you wish to be exported publicly, please open an issue.

Citation

If you use anndata in your work, please cite the anndata publication as follows:

anndata: Annotated data

Isaac Virshup, Sergei Rybakov, Fabian J. Theis, Philipp Angerer, F. Alexander Wolf

JOSS 2024 Sep 16. doi: 10.21105/joss.04371.

You can cite the scverse publication as follows:

The scverse project provides a computational ecosystem for single-cell omics data analysis

Isaac Virshup, Danila Bredikhin, Lukas Heumos, Giovanni Palla, Gregor Sturm, Adam Gayoso, Ilia Kats, Mikaela Koutrouli, Scverse Community, Bonnie Berger, Dana Pe’er, Aviv Regev, Sarah A. Teichmann, Francesca Finotello, F. Alexander Wolf, Nir Yosef, Oliver Stegle & Fabian J. Theis

Nat Biotechnol. 2023 Apr 10. doi: 10.1038/s41587-023-01733-8.

Logo of anndata
Keywords
Programming language
  • Python 100%
License
  • BSD-3-Clause
</>Source code
Packages
pypi.org
anaconda.org

Participating organisations

Helmholtz Zentrum München

Contributors

Related software

scanpy

SC

Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata. It includes preprocessing, visualization, clustering, and differential expression testing. The Python-based implementation efficiently deals with datasets of more than one million cells.

Updated 26 months ago
1 3