Statistical tests for large tree-structured data

Bharath, Karthik and Kambadur, Prabhanjan and Dey, Dipak. K. and Rao, Arvind and Baladandayuthapani, Veerabhadran (2016) Statistical tests for large tree-structured data. Journal of the American Statistical Association . ISSN 1537-274X (In Press)

[img] PDF - Repository staff only until 7 April 2018. - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (824kB)

Abstract

We develop a general statistical framework for the analysis and inference of large tree-structured data, with a focus on developing asymptotic goodness-of-fit tests. We first propose a consistent statistical model for binary trees, from which we develop a class of invariant tests. Using the model for binary trees, we then construct tests for general trees by using the distributional properties of the Continuum Random Tree, which arises as the invariant limit for a broad class of models for tree-structured data based on conditioned Galton–Watson processes. The test statistics for the goodness-of-fit tests are simple to compute and are asymptotically distributed as χ2 and F random variables. We illustrate our methods on an important application of detecting tumour heterogeneity in brain cancer. We use a novel approach with tree-based representations of magnetic resonance images and employ the developed tests to ascertain tumor heterogeneity between two groups of patients.

Item Type: Article
Schools/Departments: University of Nottingham, UK > Faculty of Science > School of Mathematical Sciences
Identification Number: 10.1080/01621459.2016.1240081
Depositing User: Bharath, Karthik
Date Deposited: 24 Feb 2017 08:38
Last Modified: 12 Oct 2017 14:39
URI: http://eprints.nottingham.ac.uk/id/eprint/40800

Actions (Archive Staff Only)

Edit View Edit View