Difference between revisions of "Projects:ParticlesForShapesAndComplexes"

From NAMIC Wiki
Jump to: navigation, search
Line 14: Line 14:
  
 
== Particle-Based Correspondence Optimization ==
 
== Particle-Based Correspondence Optimization ==
The proposed correspondence optimization method is to construct a point-based sampling of a shape ensemble that
+
The proposed correspondence optimization method is to construct a point-based sampling of a shape ensemble that simultaneously maximizes both the geometric accuracy and the statistical simplicity of the model. Surface point samples, which also define the shape-to-shape correspondences, are modeled as sets of dynamic particles that are constrained to lie on a set of implicit surfaces. Sample positions are optimized by gradient descent on an energy
simultaneously maximizes both the geometric accuracy and the statistical simplicity of the model. Surface point
 
samples, which also define the shape-to-shape correspondences, are modeled as sets of dynamic particles that are
 
constrained to lie on a set of implicit surfaces. Sample positions are optimized by gradient descent on an energy
 
 
function that balances the negative entropy of the distribution on each shape with the positive entropy of the ensemble of shapes. We have also extended the method with a curvature-adaptive sampling strategy in order to better approximate the geometry of the objects.
 
function that balances the negative entropy of the distribution on each shape with the positive entropy of the ensemble of shapes. We have also extended the method with a curvature-adaptive sampling strategy in order to better approximate the geometry of the objects.
  
Line 44: Line 41:
 
In the mouse phenotyping work of the previous section, we described a statistical analysis primarily based on univariate shape measures. In this work we expand our hypothesis testing methodology to incorporate multivariate shape features, and explore methods to aid in the interpretation of statistical results and group differences. Analysis of point-based shape models is difficult because point-wise statistical tests are require multiple-comparisons corrections that significantly reduce statistical power.  Analysis in the full shape space, however, is problematic due to the high number of dimensions and the difficulty of obtaining sufficient samples.  A common solution is to employ dimensionality reduction reduction by choosing a subspace in which to project the data for traditional multivariate analysis, such as a nonparametric Hotelling T2 test.  Dimensionality reduction, however, requires both the choice of the type of basis function to use, and also the basis vector, and the choice of ''how many'' basis vectors to use. This latter consideration can be hard to resolve when the choice of different numbers of factors leads to different statistical results.
 
In the mouse phenotyping work of the previous section, we described a statistical analysis primarily based on univariate shape measures. In this work we expand our hypothesis testing methodology to incorporate multivariate shape features, and explore methods to aid in the interpretation of statistical results and group differences. Analysis of point-based shape models is difficult because point-wise statistical tests are require multiple-comparisons corrections that significantly reduce statistical power.  Analysis in the full shape space, however, is problematic due to the high number of dimensions and the difficulty of obtaining sufficient samples.  A common solution is to employ dimensionality reduction reduction by choosing a subspace in which to project the data for traditional multivariate analysis, such as a nonparametric Hotelling T2 test.  Dimensionality reduction, however, requires both the choice of the type of basis function to use, and also the basis vector, and the choice of ''how many'' basis vectors to use. This latter consideration can be hard to resolve when the choice of different numbers of factors leads to different statistical results.
  
For our multivariate metric of shape, we propose to use a standard, data-driven approach to dimensionality
+
For our multivariate metric of shape, we propose to use a standard, data-driven approach to dimensionality reduction and project the correspondences into a lower dimensional space determined by choosing a number of basis vectors from principal component analysis (PCA).  Ideally, we would like to choose only PCA modes that account for variance that cannot be explained by random noise.  ''Parallel analysis'' is commonly recommended for this purpose [see, e.g., L W Glorfeld, Educational and Psych. Measurement, V. 55, 1955], which works by comparing the percent variances of each of the PCA modes with the average percent variances obtained via PCA of Monte Carlo simulations of samplings from isotropic, multivariate, unit Gaussian distributions.  We choose only modes with greater variance than the simulated modes for the dimensionality reduction, and use use a standard, parametric Hotelling T2 test to test for group differences, with the null hypothesis that the two groups are drawn from the same distribution.
reduction and project the correspondences into a lower dimensional space determined by choosing a number of basis vectors from principal component analysis (PCA).  Ideally, we would like to choose only PCA modes that account for variance that cannot be explained by random noise.  ''Parallel analysis'' is commonly recommended for this purpose [see, e.g., L W Glorfeld, Educational and Psych. Measurement, V. 55, 1955], which works by comparing the percent variances of each of the PCA modes with the average percent variances obtained via PCA of Monte Carlo simulations of samplings from isotropic, multivariate, unit Gaussian distributions.
+
 
 +
To visualize group differences that are driving the statistical result, we compute the linear discriminant vector implicit in the Hotelling T2 statistic, which is the is the line along which the between-class variance is maximized with respect to the within-class variance (Fisher’s linear discriminant).  The discriminant vector can be rotated back
 +
from PCA space into the full dimensional shape space, and then mapped onto the mean group shape visualizations to give an indication of the significant morphological differences between groups.  Figure 3 shows a linear discriminant visualization on a mean surface of the P2 bone from the ''Hoxd11'' study in the previous section. 
 +
 
 +
To visualize deformations between the group mean shapes, we can compute metrics on the displacement field describing the mapping from points x on one group mean to corresponding points x0 on the another. Using the set of correspondence points, a smooth transformation T(x) = x0, can be computed using a thin-plate spline interpolation.  We propose to visualize strain, a measure on the Jacobian of the deformation field x − T(x) that describes the local stretching and compression caused by the deformation. An effective visualization for the strain tensor is an ellipsoid with principal axes given by the principal directions and magnitude of strain. Figure 4 is a visualization of the strain tensors computed from the deformation from the mean patient shape to the mean normal control shape for putamen shape data.
  
 
= Key Investigators =
 
= Key Investigators =

Revision as of 00:57, 2 October 2008

Home < Projects:ParticlesForShapesAndComplexes

Back to NA-MIC Collaborations, Utah Algorithms

Shape Modeling and Analysis with Particle Systems

Overview

This work addresses technical challenges in biomedical shape analysis through the development of novel modeling and analysis methodologies, and seeks validation of those methodologies by their application to real-world research problems. The main focus of the work is the development and validation of a new framework for constructing compact statistical point-based models of ensembles of similar shapes that does not rely on any specific surface parameterization. The proposed optimization uses an entropy-based minimization that balances the simplicity of the model (compactness) with the accuracy of the surface representations. This framework is easily extended to handle more general classes of shape modeling, such as multiple-object complexes and correspondence based on functions of position. This work also addresses the issue of how to do hypothesis testing with the proposed modeling framework, since, to date, the shape analysis community has not reached a consensus regarding a systematic approach to statistical analysis with point-based models. Finally, another important issue that remains is how to visualize significant shape differences in a way that allows researchers to understand not only whether differences exist, but what those shape differences are. This latter consideration is obviously of importance in in relating shape differences to scientific hypotheses.

The following list is a summary of research and development results to date.

  • We have implemented a mathematical framework and a robust numerical algorithm implementation for computing optimized correspondence-point shape models using an entropy-based optimization and particle-system technology.
  • We have proposed a general methodology for hypothesis testing with point-based shape models that is suitable for use with the particle-based correspondence algorithm. Additionally, we have proposed ideas for visualization to aid in the interpretation of these shape statistics.
  • The particle-based correspondence algorithm and statistical analysis methodology have been extended to more general classes of shape analysis problems: (a) the analysis of multiple-object complexes and (b) the generalization to correspondences based on generic functions of position.
  • In cooperation with scientists and clinicians, we have published several papers that evaluate the above methodologies in the context of biomedical research.

Particle-Based Correspondence Optimization

The proposed correspondence optimization method is to construct a point-based sampling of a shape ensemble that simultaneously maximizes both the geometric accuracy and the statistical simplicity of the model. Surface point samples, which also define the shape-to-shape correspondences, are modeled as sets of dynamic particles that are constrained to lie on a set of implicit surfaces. Sample positions are optimized by gradient descent on an energy function that balances the negative entropy of the distribution on each shape with the positive entropy of the ensemble of shapes. We have also extended the method with a curvature-adaptive sampling strategy in order to better approximate the geometry of the objects.

Any set of implicitly defined surfaces, such as a set of binary segmentations, is appropriate as input to this framework. A binary mask, for example, contains an implicit shape surface at the interface of the labeled pixels and the background. Because the proposed method is completely generalizable to higher dimensions, we are able to process shapes in 2D and 3D using the same C++ software implementation, which is templated by dimension. Processing time on a 2GHz desktop machine scales linearly with the number of particles in the system and ranges from 20 minutes for a 2D system of a few thousand particles to several hours for a 3D system of tens of thousands of particles.

We have found that the proposed particle method compares favorably to state-of-the-art methods for optimizing surface point correspondences. Figure 1, for example, illustrates an experiment on a set hand shape contours that shows a favorable comparison with the Minimum Description Length (MDL) method of Davies, et al. [R Davies, et al., IEEE Trans Med Imaging, V. 21, No. 5, 2002]. In the figure, we also compare our results with a set of ideal, manually selected correspondences, which introduce anatomical knowledge of the digits. Principal component analysis shows that the proposed particle method and the manually selected points both produce very similar models, while the MDL model differs significantly.

Existing 3D MDL implementations rely on spherical parameterizations, and are therefore only capable of analyzing shapes topologically equivalent to a sphere. The particle-based method does not have this limitation. As a demonstration of this, we applied the proposed method to a set of 25 randomly chosen tori, drawn from a 2D, normal distribution parameterized by the small radius r and the large radius R. An analysis of the resulting model showed that the particle system method discovered the two pure modes of variation, with only 0.08% leakage into smaller modes.

Further preliminary experiments on 2D synthetic data and 3D medical image data (hippocampus shapes) are described in [1] and [2], but are omitted here for brevity. Figure 2 is taken from this work, for example, and illustrates two principal components of a model of the hippocampus.

Particle-Based Correspondences for Genetic Phenotyping

In cooperation with researchers from the Mario Capecchi lab at the University of Utah, we applied the proposed shape modeling framework to a phenotypic study of several of the forelimb bones of mice with a targeted disruption of the Hoxd11 gene, which Boulet, Davis and Capecchi have previously shown is important for the normal development and patterning of the appendicular skeleton [A. Davis, et al., Development, V. 120, 1995]. Our phenotyping study serves as a further evaluation of the particlebased technology described in the previous section, and is among the first to explore the use of high-dimensional, point-based shape modeling and analysis for small animal phenotyping.

Our study focused on three forelimb bones segmented from the CT volumes of 20 male Hoxd11 -/- (mutant) mice and 20 male wild-type mice: the metacarpal (MC), the first phalange (P1), and the second phalange (P2) of digit 2 of the right forepaw of the mouse. We computed combined models (mutant + wild type) of each of the three bones using the particle-system method described in the previous section for each of the three bones of interest, using 1024 correspondence points per shape. The results from the Hoxd11 study suggest that point-based shape modeling can be an effective tool for the study of mouse skeletal phenotype. We detected global and local group differences between normals and the Hoxd11 mutants that are not observable with traditional metrics, and showed how we can use general linear models to control for covariates in global, univariate shape statistics.

The results of this study are published in [3].

Hypothesis Testing and Visualization

In the mouse phenotyping work of the previous section, we described a statistical analysis primarily based on univariate shape measures. In this work we expand our hypothesis testing methodology to incorporate multivariate shape features, and explore methods to aid in the interpretation of statistical results and group differences. Analysis of point-based shape models is difficult because point-wise statistical tests are require multiple-comparisons corrections that significantly reduce statistical power. Analysis in the full shape space, however, is problematic due to the high number of dimensions and the difficulty of obtaining sufficient samples. A common solution is to employ dimensionality reduction reduction by choosing a subspace in which to project the data for traditional multivariate analysis, such as a nonparametric Hotelling T2 test. Dimensionality reduction, however, requires both the choice of the type of basis function to use, and also the basis vector, and the choice of how many basis vectors to use. This latter consideration can be hard to resolve when the choice of different numbers of factors leads to different statistical results.

For our multivariate metric of shape, we propose to use a standard, data-driven approach to dimensionality reduction and project the correspondences into a lower dimensional space determined by choosing a number of basis vectors from principal component analysis (PCA). Ideally, we would like to choose only PCA modes that account for variance that cannot be explained by random noise. Parallel analysis is commonly recommended for this purpose [see, e.g., L W Glorfeld, Educational and Psych. Measurement, V. 55, 1955], which works by comparing the percent variances of each of the PCA modes with the average percent variances obtained via PCA of Monte Carlo simulations of samplings from isotropic, multivariate, unit Gaussian distributions. We choose only modes with greater variance than the simulated modes for the dimensionality reduction, and use use a standard, parametric Hotelling T2 test to test for group differences, with the null hypothesis that the two groups are drawn from the same distribution.

To visualize group differences that are driving the statistical result, we compute the linear discriminant vector implicit in the Hotelling T2 statistic, which is the is the line along which the between-class variance is maximized with respect to the within-class variance (Fisher’s linear discriminant). The discriminant vector can be rotated back from PCA space into the full dimensional shape space, and then mapped onto the mean group shape visualizations to give an indication of the significant morphological differences between groups. Figure 3 shows a linear discriminant visualization on a mean surface of the P2 bone from the Hoxd11 study in the previous section.

To visualize deformations between the group mean shapes, we can compute metrics on the displacement field describing the mapping from points x on one group mean to corresponding points x0 on the another. Using the set of correspondence points, a smooth transformation T(x) = x0, can be computed using a thin-plate spline interpolation. We propose to visualize strain, a measure on the Jacobian of the deformation field x − T(x) that describes the local stretching and compression caused by the deformation. An effective visualization for the strain tensor is an ellipsoid with principal axes given by the principal directions and magnitude of strain. Figure 4 is a visualization of the strain tensors computed from the deformation from the mean patient shape to the mean normal control shape for putamen shape data.

Key Investigators

Joshua Cates, Tom Fletcher, Ross Whitaker

Publications

  1. J Cates, PT Fletcher, and R Whitaker, Entropy-based particle systems for shape correspondence, in Mathematical Foundations of Computational Anatomy Workshop. 2006, pp. 90–99, MICCAI.
  2. J Cates, P T Fletcher, M Styner, M Shenton, and R Whitaker, Shape modeling and analysis with entropy-based particle systems, in Information Processing in Medical Imaging (IPMI 2007), LNCS 4584, 2007, pp. 333–345.
  3. J Cates, P T Fletcher, Z Warnock, and R Whitaker, A shape analysis framework for small animal phenotyping with application to mice with a targeted disruption of Hoxd11, in Proc. 5th IEEE International Symposium on Biomedical Imaging (ISBI ’08), 2008, pp. 512–516.
  4. J Cates, PT Fletcher, and R Whitaker, “A hypothesis testing framework for high-dimensional shape models,” in Mathematical Foundations of Computational Anatomy Workshop. 2008, pp. 170–181, MICCAI.
  5. I Oguz, J Cates, P T Fletcher, Z Warnock, R Whitaker, D Cool, S Aylward, and M Styner, “Cortical correspondence using entropy-based particle systems and local featrues,” in Proc. 5th IEEE International Symposium on Biomedical Imaging (ISBI ’08), 2008, pp. 1637–1640.
  6. J Cates, PT Fletcher, M Styner, H Hazlett, and R Whitaker, “Particle-based shape analysis of multi-object complexes,” in Proceedings of the 11th International Conference on Medical Image Computing and Computer Assisted Intervention. (MICCAI 08).