Test process optimisation through big data analysis

Kho, Xiang Juan (2023) Test process optimisation through big data analysis. MPhil thesis, University of Nottingham.

[img] PDF (Thesis for reader access - any sensitive & copyright infringing material removed) - Repository staff only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (2MB)


The goal of the research is to adopt artificial intelligence techniques for regression test case selection in an industrial setting. Currently, the selection of test cases is made manually by test engineers referring to software change documentation. A machine learning-based solution which selects test cases with high potential in uncovering software defects is proposed and results were analysed.

Past historical results of test cases and various metadata, such as the ID, name and priority of test cases, were used in data training. At the pre-processing stage, data were analysed, cleaned and normalised. Then, a two-part balancing method, comprising of outliers removal and resampling with an algorithm, was applied to the imbalanced data before it can be fitted to various machine learning models.

The model that best fits the system requirement, which recommends at most 50% of the total test cases with no false negative predictions and as few false positive predictions as possible, is selected to be implemented as an executable application. The finalised model, based on random forest, recommends 1,626 test cases (7.35% of total test cases) for execution with no false negative and 882 false positive predictions (3.98% of total test cases) out of 22,137 test cases. This fulfils the two objectives of this research, which is to construct a recommendation system capable of reducing at least 50% of all test cases, and to ensure that there are no false negative and minimal false positive predictions.

The current test system is a black-box system, meaning the software’s functionalities are tested without accessing its internal code structure. The model’s performance can be improved with a white-box test system, where the software source code can be accessed, and information such as which specific code segments are causing test case failure is available.

Item Type: Thesis (University of Nottingham only) (MPhil)
Supervisors: Khan, Nafizah
Keywords: artificial intelligent; big data analysis; software development; advance technology
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Faculties/Schools: University of Nottingham, Malaysia > Faculty of Science and Engineering — Engineering > Department of Electrical and Electronic Engineering
Item ID: 74011
Depositing User: Kho, Xiang
Date Deposited: 22 Jul 2023 04:40
Last Modified: 22 Jul 2023 04:40
URI: https://eprints.nottingham.ac.uk/id/eprint/74011

Actions (Archive Staff Only)

Edit View Edit View