Anne E. Carpenter, Thouis Jones, Lee Kamentsky, Beth Cimini, Allen Goodman, Claire McQuin, Madison Swain-Bowden, David Stirling, Nodar Gogoberidze, and others (Broad Institute)
CellProfiler[1][2] is free, open-source software designed to enable biologists without training in computer vision or programming to quantitatively measure phenotypes from thousands of images automatically. Advanced algorithms for image analysis are available as individual modules that can be placed in sequential order together to form a pipeline; the pipeline is then used to identify and measure
biological objects and features in images, particularly those obtained through fluorescence microscopy.
CellProfiler can read and analyze most common microscopy image formats.[5] Biologists typically use CellProfiler to identify objects of interest (e.g. cells, colonies, C. elegans worms) and then measure their properties of interest.[6] Specialized modules for illumination correction may be applied as pre-processing step to remove distortions due to uneven lighting.[7] Object identification (segmentation) is performed through machine learning or image thresholding, recognition and division of clumped objects, and removal or merging of objects on the basis of size or shape.[8] Each of these steps are customizable by the user for their unique image assay.
A wide variety of measurements can be generated for each identified cell or subcellular compartment, including morphology, intensity, and texture among others. These measurements are accessible by using built-in viewing and plotting data tools, exporting in a comma-delimitedspreadsheet format,[9] or importing into a MySQL or SQLite database.[10]
CellProfiler interfaces with the high-performance scientific libraries NumPy and SciPy for many mathematical operations, the Open Microscopy Environment[11] Consortium’s Bio-Formats library for reading more than 100 image file formats, ImageJ for use of plugins and macros, and ilastik for pixel-based classification.[12] While designed and optimized for large numbers of two-dimensional images (the most common high-content screening image format), CellProfiler supports analysis of small-scale experiments and time-lapse movies.[13]
Originally developed in MATLAB,[14] it was re-written in Python and released as CellProfiler 2.0 in 2010.[2] Version 3.0, supporting volumetric analysis of 3D image stacks and optional deep learning modules, was released in October 2017.[16] CellProfiler 4.0 was released in September 2020 and focused on speed, usability, and utility improvements with most notable example of migration to Python 3.[17]
Community
Because CellProfiler is a free, open-source project, anyone can develop their own image processing algorithms as a new module for CellProfiler and contribute it to the project.[18] The CellProfiler website contains a forum for discussion where new users can have their questions answered, usually by the creators of the project.[19]