TY - JOUR
T1 - phenopype
T2 - A phenotyping pipeline for Python
AU - Lürig, Moritz D.
N1 - Funding Information:
I conceived in its current form during a laboratory retreat in Vna (Graubünden, Switzerland) that was organized and funded by Jukka Jokela. Its implementation was made possible by Blake Matthews and the Eawag directorate (Discretionary Funding Grant No. 5221.00492.013.11). Additional funding came from the Swiss National Science Foundation through an Early Postdoc. Mobility Fellowship (Grant No. P2EZP3_191804) and from the European Union's Horizon 2020 research and innovation programme through a Marie Skłodowska‐Curie IF (Grant No. 898932). I thank Kim Kaltenbach for being a patient alpha tester, and Cam Hudson, Ryan Greenway, Andres Grolimund, Nare Ngoepe, Anja Merz and Irene Gallego for being helpful beta testers. I would also express my sincere gratitude to Arthur Porto and Seth Donoughe whose comprehensive review for the consortium greatly improved the presentation and documentation of the package. Two anonymous reviewers provided very constructive feedback during review of the manuscript. The stickleback image for 's logo was taken by Angelina Arquint. Finally, this package may not have come into existence without Matt McGee, who encouraged me to learn Python and use it for computer vision. phenopype pyOpenSci phenopype
Publisher Copyright:
© 2021 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society
PY - 2022
Y1 - 2022
N2 - Digital images are an intuitive way to capture, store and analyse organismal phenotypes. Many biologists are taking images to collect high-dimensional phenotypic information from specimens to investigate complex ecological, evolutionary and developmental phenomena, such as relationships between trait diversity and ecosystem function, multivariate natural selection or developmental plasticity. As a consequence, images are being collected at ever-increasing rates, but extraction of the contained phenotypic information poses a veritable analytical bottleneck. phenopype is a high-throughput phenotyping pipeline for the programming language Python that aims at alleviating this bottleneck. The package facilitates immediate extraction of high-dimensional phenotypic data from digital images with low levels of background noise and complexity. At the core, phenopype provides functions for rapid signal processing-based image preprocessing and segmentation, data extraction, as well as visualization and data export. This functionality is provided by wrapping low-level computer vision libraries (such as OpenCV) into accessible functions to facilitate scientific image analysis. In addition, phenopype provides a project management ecosystem to streamline data collection and to increase reproducibility. phenopype offers two different workflows that support users during different stages of scientific image analysis. The low-throughput workflow uses regular Python syntax and has greater flexibility at the cost of reproducibility, which is suitable for prototyping during the initial stages of a research project. The high-throughput workflow allows users to specify and store image-specific settings for analysis in human-readable YAML format, and then execute all functions in one step by means of an interactive parser. This approach facilitates rapid program-user interactions during batch processing, and greatly increases scientific reproducibility. Overall, phenopype intends to make the features of powerful but technically involved low-level CV libraries available to biologists with little or no Python coding experience. Therefore, phenopype is aiming to augment, rather than replace the utility of existing Python CV libraries, allowing biologists to focus on rapid and reproducible data collection. Furthermore, image annotations produced by phenopype can be used as training data, thus presenting a stepping stone towards the application of deep learning architectures.
AB - Digital images are an intuitive way to capture, store and analyse organismal phenotypes. Many biologists are taking images to collect high-dimensional phenotypic information from specimens to investigate complex ecological, evolutionary and developmental phenomena, such as relationships between trait diversity and ecosystem function, multivariate natural selection or developmental plasticity. As a consequence, images are being collected at ever-increasing rates, but extraction of the contained phenotypic information poses a veritable analytical bottleneck. phenopype is a high-throughput phenotyping pipeline for the programming language Python that aims at alleviating this bottleneck. The package facilitates immediate extraction of high-dimensional phenotypic data from digital images with low levels of background noise and complexity. At the core, phenopype provides functions for rapid signal processing-based image preprocessing and segmentation, data extraction, as well as visualization and data export. This functionality is provided by wrapping low-level computer vision libraries (such as OpenCV) into accessible functions to facilitate scientific image analysis. In addition, phenopype provides a project management ecosystem to streamline data collection and to increase reproducibility. phenopype offers two different workflows that support users during different stages of scientific image analysis. The low-throughput workflow uses regular Python syntax and has greater flexibility at the cost of reproducibility, which is suitable for prototyping during the initial stages of a research project. The high-throughput workflow allows users to specify and store image-specific settings for analysis in human-readable YAML format, and then execute all functions in one step by means of an interactive parser. This approach facilitates rapid program-user interactions during batch processing, and greatly increases scientific reproducibility. Overall, phenopype intends to make the features of powerful but technically involved low-level CV libraries available to biologists with little or no Python coding experience. Therefore, phenopype is aiming to augment, rather than replace the utility of existing Python CV libraries, allowing biologists to focus on rapid and reproducible data collection. Furthermore, image annotations produced by phenopype can be used as training data, thus presenting a stepping stone towards the application of deep learning architectures.
KW - automation
KW - computer vision
KW - image analysis
KW - image segmentation
KW - phenomics
KW - phenotype
KW - toolbox
KW - trait measurement
U2 - 10.1111/2041-210X.13771
DO - 10.1111/2041-210X.13771
M3 - Article
AN - SCOPUS:85120485237
SN - 2041-210X
VL - 13
SP - 569
EP - 576
JO - Methods in Ecology and Evolution
JF - Methods in Ecology and Evolution
IS - 3
ER -