I will begin posting again on the BRAIN initiative notes running webpage/blog that I started while ago. There is a lot going on in neuroscience and many questions of how it relates to society at large. Below I have included the first of the new posts that goes over methods for identifying cells and reconstructing neural activity traces from calcium imaging movies.
Find article on BRAIN webpage at Calcium imaging cell identification and fluorescence activity trace reconstruction, part 1.
Look forward to more postings!
Find the posting on the BRAIN initiative notes webpage at Calcium imaging cell identification and fluorescence activity trace reconstruction, part 1.
Large-scale calcium (Ca2+) imaging, using the change in Ca2+ concentration inside neurons as a relatively direct measure of neural activity, has become an essential tool for neuroscientists to probe neural ensemble activity and identify coding principles the brain uses to process sensory information, store memories, and produce behaviors (Hamel et al., 2015). Originally, chemical fluorescent dyes such as OGB-1 allowed experimenters to measure Ca2+ concentration in neurons and thus neuronal activity (Paredes et al., 2008). These synthetic dyes allowed researchers to begin studying detailed intracellular Ca2+ dynamics along with activity of multiple neurons at once. However, the dyes had drawbacks: they did not allow long-term (weeks to months) imaging of neural activity, dye loading could be problematic in certain brain cell types (e.g. those with thick cell walls) or regions (deep brain structures), dyes would target non-specifically to various cellular compartments/organelles and would leak out of cells, and they could not be targeted in a genetically defined manner (though there is limited indication that this might be possible (Tour et al., 2007)). On the other hand, genetically encoded Ca2+ indicators (GECIs) have helped expand the types of experiments and questions that can be answered using Ca2+ imaging (Chen et al., 2013; Dana et al., 2018; Mank et al., 2008) by addressing many of the issues with the Ca2+ dyes: using adeno-associated and other viruses, they can be expressed chronically from weeks to years ((Ziv et al., 2013) and personal experiments/observations); specific promoters upstream of the GECI gene can lead to expression in specific cell types; tagging the GECI with certain amino acid sequences can enable targeting to specific regions of the cell (e.g. the axon terminal (Broussard et al., 2018; Dreosti et al., 2009)); and many other advantages. However, once experimenters had collected their Ca2+ imaging movies, they faced a formidable challenge in extracting both cell locations/shapes and their associated fluorescence activity traces from each of these movies.
One method of obtaining cell shapes and activity traces is via manual identification and selection of regions-of-interest (ROI) by humans and then taking the average pixel value within said ROI on each frame to reconstruct the activity trace for that cell (ROI) during the entire movie. The manual identification can take a lot of time, requires humans to browse through the entire movie frame-by-frame, and can lead to false negatives especially for low signal-to-noise ratio cells. The later is especially true for one-photon movies and modern GECIs where the baseline is too low to allow cells to be seen except for when they are active. Beyond missing cells, the ROI approach also tends to introduce crosstalk by contaminating each cell’s activity trace with fluorescent signals from nearby or overlapping cells (while cells themselves do not physically overlap, this occurs due to optical limitations and the fact that many Ca2+ imaging datasets are 2D, which leads to perceived spatial overlap when a 3D volume is flattened), background (e.g. neuropil from axons and dendrites), blood vessels, and other sources. Thus, a good Ca2+ imaging cell extraction algorithm needs to meet several criteria to allow biologists to obtain high-quality data from their movies:
- (1) be automated—reduces time needed to process data, especially as dataset sizes grow, and removes a degree of human introduced variability from manual ROI selection;
- (2) scale well with dataset sizes—this is not required, but preferred as large-scale Ca2+ imaging videos are becoming more widespread;
- (3) maintain high fidelity—finding all Ca2+ events associated with a particular cell;
- (4) have low crosstalk—minimize number of Ca2+ transients or other fluorescent signals in a cell’s activity trace that are not from that cell.
To solve the above problems and meet those criteria, many groups have developed and published new methods. An early popular method used Principal Component Analysis followed by Independent Component Analysis (PCA-ICA) (Mukamel et al., 2009); this was a great step forward and met several "good algorithm" criteria. However, it could fail in cases with a lot of correlated background noise or in regions of high cell density and activity. Over the next several years, additional techniques were published to tackle the problem and among these non-negative matrix factorization (NMF) (Pnevmatikakis et al., 2014; Maruyama et al., 2014) emerged as a promising candidate to meet many of the ”good algorithm” criteria. After publication, variants and enhancements of the method (CNMF, sc-CNMF, CNMF-E, etc.) sought to improve the cell activity traces by integrating Ca2+ dynamics into the model (constrained NMF—CNMF) or to compensate for issues in the original algorithm (e.g. handling of correlated background activity as seen in miniature microscope movies, see CNMF-E). In addition, there is an emerging focus on online (e.g. close as possible to real-time) processing of Ca2+ imaging data for closed-loop experiments and also methods to improve compression of data (Buchanan et al., 2018; Giovannucci et al., 2017).
Beyond those algorithms discussed above, a variety of other techniques have been released that either directly allow experimenters to simultaneously obtain cell images and activity traces or that focus on detecting cells or improving already acquired activity trace accuracy. For example, several publications have initial or limited methods for post-hoc automated removal of false positive signals (e.g. PCA-ICA, CNMF, etc. all output signal sources that upon manual inspection turn out to not be cells or other biologically relevant signals [e.g. dendrites]); look for this to be tackled to a greater degree going forward (I will have more on this in the future).
To help new experimenters gain a clearer picture of the landscape, see below table for a list of many different Ca2+ imaging cell extraction and activity trace reconstruction algorithms. This table will be updated to include any missing algorithms and further discussion about Ca2+ imaging analysis will take place in future posts.
Method | Notes | Source |
ROI | Matrix multiplication; sometimes neuropil/background subtraction. | Kerr et al. 2005; Kuchibhotla et al. 2014; Peron et al. 2015; Romano et al. 2017 |
PCA-ICA | Principal Component Analysis then Independent Component Analysis. | Mukamel et al. 2009 |
CIRF, calcium-behavior | Regressive model to obtain Ca2+ signal based on behavior. | Miri et al. 2011 |
Automated ROI analysis | Automatic ellipses based ROI detection. | Francis et al. 2012 |
ADINA | Sparse dictionary learning. | Diego et al. 2013 |
NMF | Nonnegative matrix factorization (NMF). | Pnevmatikakis et al. 2014; Maruyama et al. 2014 |
Suite2p | Generative model. | Pachitariu et al. 2016 |
CNMF (CaImAn) | Constrained NMF (CNMF). | Pnevmatikakis et al. 2016 |
CNMF-E | CNMF + background model. | Zhou et al. 2016, 2018 |
Apthorpe CNN | Convolutional neural network (CNN). | Apthorpe et al. 2016 |
sc-CNMF | CNMF + GMM/RNN seed cleansing. | Lu et al. 2017 |
OASIS | Generalized pool adjacent violators algorithm. | Friedrich et al. 2017 |
ABLE | Active contours. | Reynolds et al. 2017 |
SCALPEL | Dictionary learning, dissimilarity, and clustering. | Petersen et al. 2017 |
HNCcorr | Combinatorial optimization (correlation space analysis). | Spaen et al. 2017 |
OnACID | NMF variant for online Ca2+ imaging processing. | Giovannucci et al. 2017 |
EXTRACT | Robust statistical estimation. | Inan et al. 2017 |
LSSC | Spectral clustering; variant to find local subset of eigenvectors. | Mishne et al. 2018 |
PMD - PCA | Spatially-localized penalized matrix decomposition for denoising, compression, and improved demixing. | Buchanan et al. 2018 |
MIN1PIPE | Pre-processing + CNMF. | Lu et al. 2018 |
CaImAn | CNMF + several other processing tools. | Giovannucci et al. 2018 |
SEUDO | Mixture of Gaussians + maximum likelihood; post-hoc activity trace correction. | Gauthier et al. 2018 |
Allen Institute ROI | ROIs detected via adaptive thresholding and morphological operations; traces improved with neuropil subtraction and demixing. | de Vries et al. 2018 |
—CELLMax | Maximum likelihood. | Ahanonu et al. 2018, 2017 |