Preprocessing perceptrons

University dissertation from Umeå : Datavetenskap, Umeå universitet

Abstract: Reliable results are crucial when working with medical decision support systems. A decision support system should be reliable but also be interpretable, i.e. able to show how it has inferred its conclusions. In this thesis, the preprocessing perceptron is presented as a simple but effective and efficient analysis method to consider when creating medical decision support systems. The preprocessing perceptron has the simplicity of a perceptron combined with a performance comparable to the multi-layer perceptron.The research in this thesis has been conducted within the fields of medical informatics and intelligent computing. The original idea of the production line as a tool for a domain expert to extract information, build decision support systems and integrate them in the existing system is described. In the introductory part of the thesis, an introduction to feed-forward neural networks and fuzzy logic is given as a background to work with the preprocessing perceptron. Input to a decision support system is crucial and it is described how to gather a data set, decide how many and what kind of inputs to use. Outliers, errors and missing data are covered as well as normalising of the input. Training is done in a backpropagation-like manner where the division of the data set into a training and a test set can be done in several different ways just as the training itself can have variations. Three major groups of methods to estimate the discriminance effect of the preprocessing perceptron are described and a discussion of the trade-off between complexity and approximation strength are included.Five papers are presented in this thesis. Case studies are shown where the preprocessing perceptron is compared to multi-layer perceptrons, statistical approaches and other mathematical models. The model is extended to a generalised preprocessing perceptron and the performance of this new model is compared to the traditional feed-forward neural networks. Results concerning the preprocessing layer and its connection to multivariate decision limits are included. The well-known ROC curve is described and introduced fully into the field of computer science as well as the improved curve, the QROC curve. Finally a tutorial to the program trainGPP is presented. It describes how to work with the preprocessing perceptron from the moment when a data file is provided to the moment when a new decision support system is built.

  This dissertation MIGHT be available in PDF-format. Check this page to see if it is available for download.