Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Informatics, School of >
Informatics thesis and dissertation collection >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/284

This item has been viewed 88 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
WynnejonesSect1-2.pdfChapters 1 & 2270.09 kBAdobe PDFView/Open
WynnejonesSect3-4.pdfChapters 3 & 43.96 MBAdobe PDFView/Open
WynnejonesSect5-end.pdfChapter 5, refs and appendices338.96 kBAdobe PDFView/Open
Title: Model building in neural networks with hidden Markov models
Authors: Wynne-Jones, Michael
Supervisor(s): Wallace, David
Willshaw, David
Issue Date: Jul-1994
Publisher: University of Edinburgh. College Science and Engineering. Scool of Informatics
Abstract: This thesis concerns the automatic generation of architectures for neural networks and other pattern recognition models comprising many elements of the same type. The requirement for such models, with automatically determined topology and connectivity, arises from two needs. The first is the need to develop commercial applications of the technology without resorting to laborious trial and error with different network sizes; the second is the need, in large and complex pattern processing applications such as speech recognition, to optimise the allocation of computing resources for problem solving. The state of the art in adaptive architectures is reviewed, and a mechanism is proposed for adding new processing elements to models. The scheme is developed in the context of multi-layer perceptron networks, and is linked to the best network-pruning mechanism available using a numerical criterion with construction required at one extreme and pruning at the other. The construction mechanism does not work in the multi-layer perceptron for which it was developed, owing to the long-range effects occurring in many applications of these networks. It works demonstrably well in density estimation models based on Gaussian mixtures, which are of the same family as the increasingly popular radial basis function networks. The construction mechanism is applied to the initialization of the density estimators embedded in the states of a hidden Markov model for speaker-independent speech recognition, where it offers a considerable increase in recogniser performance, provided cross-validation is used to prevent over-training.
Description: Centre for Intelligent Systems and their Applications
URI: http://hdl.handle.net/1842/284
Appears in Collections:Informatics thesis and dissertation collection

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback