IDENTIFICATION AND ANALYSIS OF CONFOUNDING VARIABLES AND SIMPSON’S PARADOX

Chattopadhyay, Ishani (2022) IDENTIFICATION AND ANALYSIS OF CONFOUNDING VARIABLES AND SIMPSON’S PARADOX. [Dissertation (University of Nottingham only)]

[img] PDF - Repository staff only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (3MB)

Abstract

This dissertation harnesses machine learning algorithms and model agnostic tools to explore the counter intuitive relationship between protein intake from legumes and pass rate in Malawi. This dissertation focuses on an exploratory analysis to study approaches towards creating sub-groups based on K-means clustering algorithm in order to identify Simpson’s Paradox. The curious case of negative relationship between protein intake from legumes and pass rate in Malawi, has been addressed through identification of confounders by harnessing logistic regression and chi-square tests. Random Forest Model and Partial Dependency Plots have been utilised to study the relationship between protein intake from legumes and pass rates by creating sub-groups of the confounders in order to isolate the effect of these confounders.

This dissertation follows a waterfall method that dives deeper into identification of confounders whenever a sub group indicates a negative relationship between legumes and pass rates. The dissertation tries to answer certain trends in the relationship and possible ways to understand the problem. The analysis helps identify areas that could be explored further in order to provide better amenities to improve the standard of living of the poorer areas in Malawi.

Keywords: Simpsons Paradox, Partial Dependency Plots, Individual Conditional Expectation Plot, K-means clustering, counter-intuitive behaviour, confounder identification, confounder analysis.

Item Type: Dissertation (University of Nottingham only)
Depositing User: Chattopadhyay, Ishani
Date Deposited: 06 Jul 2023 11:48
Last Modified: 06 Jul 2023 11:48
URI: https://eprints.nottingham.ac.uk/id/eprint/70466

Actions (Archive Staff Only)

Edit View Edit View