phenopype: A phenotyping pipeline for Python

Research output: Contribution to journalArticlepeer-review

Abstract

Digital images are an intuitive way to capture, store and analyse organismal phenotypes. Many biologists are taking images to collect high-dimensional phenotypic information from specimens to investigate complex ecological, evolutionary and developmental phenomena, such as relationships between trait diversity and ecosystem function, multivariate natural selection or developmental plasticity. As a consequence, images are being collected at ever-increasing rates, but extraction of the contained phenotypic information poses a veritable analytical bottleneck. phenopype is a high-throughput phenotyping pipeline for the programming language Python that aims at alleviating this bottleneck. The package facilitates immediate extraction of high-dimensional phenotypic data from digital images with low levels of background noise and complexity. At the core, phenopype provides functions for rapid signal processing-based image preprocessing and segmentation, data extraction, as well as visualization and data export. This functionality is provided by wrapping low-level computer vision libraries (such as OpenCV) into accessible functions to facilitate scientific image analysis. In addition, phenopype provides a project management ecosystem to streamline data collection and to increase reproducibility. phenopype offers two different workflows that support users during different stages of scientific image analysis. The low-throughput workflow uses regular Python syntax and has greater flexibility at the cost of reproducibility, which is suitable for prototyping during the initial stages of a research project. The high-throughput workflow allows users to specify and store image-specific settings for analysis in human-readable YAML format, and then execute all functions in one step by means of an interactive parser. This approach facilitates rapid program-user interactions during batch processing, and greatly increases scientific reproducibility. Overall, phenopype intends to make the features of powerful but technically involved low-level CV libraries available to biologists with little or no Python coding experience. Therefore, phenopype is aiming to augment, rather than replace the utility of existing Python CV libraries, allowing biologists to focus on rapid and reproducible data collection. Furthermore, image annotations produced by phenopype can be used as training data, thus presenting a stepping stone towards the application of deep learning architectures.

Original languageEnglish
Pages (from-to)569-576
JournalMethods in Ecology and Evolution
Volume13
Issue number3
Early online date2021 Nov 19
DOIs
Publication statusPublished - 2022

Bibliographical note

Funding Information:
I conceived in its current form during a laboratory retreat in Vna (Graubünden, Switzerland) that was organized and funded by Jukka Jokela. Its implementation was made possible by Blake Matthews and the Eawag directorate (Discretionary Funding Grant No. 5221.00492.013.11). Additional funding came from the Swiss National Science Foundation through an Early Postdoc. Mobility Fellowship (Grant No. P2EZP3_191804) and from the European Union's Horizon 2020 research and innovation programme through a Marie Skłodowska‐Curie IF (Grant No. 898932). I thank Kim Kaltenbach for being a patient alpha tester, and Cam Hudson, Ryan Greenway, Andres Grolimund, Nare Ngoepe, Anja Merz and Irene Gallego for being helpful beta testers. I would also express my sincere gratitude to Arthur Porto and Seth Donoughe whose comprehensive review for the consortium greatly improved the presentation and documentation of the package. Two anonymous reviewers provided very constructive feedback during review of the manuscript. The stickleback image for 's logo was taken by Angelina Arquint. Finally, this package may not have come into existence without Matt McGee, who encouraged me to learn Python and use it for computer vision. phenopype pyOpenSci phenopype

Publisher Copyright:
© 2021 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society

Subject classification (UKÄ)

  • Computer Vision and Robotics (Autonomous Systems)
  • Evolutionary Biology

Free keywords

  • automation
  • computer vision
  • image analysis
  • image segmentation
  • phenomics
  • phenotype
  • toolbox
  • trait measurement

Fingerprint

Dive into the research topics of 'phenopype: A phenotyping pipeline for Python'. Together they form a unique fingerprint.

Cite this