On the quantification of statistical significance of the extent of association projected on the margins of 2x2 tables, when only the aggregate data is available: A pseudo p-value approach - applied to leukaemia relapse data

Cheema, S, Beh, E and Hudson, I 2015, 'On the quantification of statistical significance of the extent of association projected on the margins of 2x2 tables, when only the aggregate data is available: A pseudo p-value approach - applied to leukaemia relapse data', in Proceedings of the 21st International Congress on Modelling and Simulation (MODSIM2015), Gold Coast, Australia, 29 November - 4 December 2015, pp. 1682-1688.


Document type: Conference Paper
Collection: Conference Papers

Title On the quantification of statistical significance of the extent of association projected on the margins of 2x2 tables, when only the aggregate data is available: A pseudo p-value approach - applied to leukaemia relapse data
Author(s) Cheema, S
Beh, E
Hudson, I
Year 2015
Conference name MODSIM2015
Conference location Gold Coast, Australia
Conference dates 29 November - 4 December 2015
Proceedings title Proceedings of the 21st International Congress on Modelling and Simulation (MODSIM2015)
Publisher The Modelling and Simulation Society of Australia and New Zealand
Place of publication Australia
Start page 1682
End page 1688
Total pages 7
Abstract Aggregate data arises in situations where survey research or other means of collecting individuallevel data are either infeasible or inefficient. The recent increasing use of aggregate data in the statistical and allied fields - including epidemiology, education and social sciences - has arisen due to number of reasons. These include the questionable reliability of estimates when sensitive information is required, the imposition of strict confidentiality policies on data by government and other organisational bodies and in some contexts it is impossible to collect the information that is needed. In this paper we present a novel approach to quantify the statistical significance of the extent of association that exists between two dichotomous variables when only the aggregate data is available. This is achieved by examining a newly developed index, called the aggregate association index (or the AAI), developed by Beh (2008 and 2010) which enumerates the overall extent of association about individuals that may exist at the aggregate level when individual level data is not available. The applicability of the technique is demonstrated by using leukaemia relapse data of Cave et al. (1998). This data is presented in the form of a contingency table that cross-classifies the follow up status of leukaemia relapse by whether cancer traces were found (or not) on the basis of polymerase child reaction (PCR) - a modern method used to detect cancerous cells in the body assumed superior than conventional for that period, microscopic identification. Assuming that the joint cell frequencies of this table are not available, and that the only available information is contained in the aggregate data, we first quantify the extent of association that exists between both variables by calculating the AAI. This index shows that the likelihood of association is high. As the AAI has been developed by exploiting Pearson's chi-squared statistics, the AAI inherently suffers from the well-known la
Subjects Statistics not elsewhere classified
Keyword(s) aggregate data
aggregate association index
pseudo p values
ecological inference
sample size
ISBN 9780987214355
Versions
Version Filter Type
Access Statistics: 4 Abstract Views  -  Detailed Statistics
Created: Fri, 05 Jul 2019, 12:33:00 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us