Handwriting Recognition
Google Scholar
You can also find out about my published research on Google Scholar, which, as well as providing links to the papers and their abstracts, additionally provides the latest citation counts and index values.
Overview
These papers are the outcome of a collaboration between myself and three colleagues at Griffith University in Australia: Jolon Faichney, Michael Blumenstein and Trevor Hine. Jolon was primarily responsible for the coding work, while Michael brought in the expertise with handwriting recognition and Trevor provided a brain science perspective from his work in psychology. While the practical focus of the work was on handwriting recognition, the deeper intention was to investigate the performance of various brain-inspired algorithms – particularly Jeff Hawkins’ hierarchical temporal memory architecture – on a task that the human brain handles relatively easily. This work expanded into the creation of Griffith University’s Cognitive Computing Unit (see the Brain-Inspired Algorithms page).
2009
Thornton, J. R., Faichney, J., Blumenstein, M., Nguyen, V. & Hine, T. (2009). Offline Cursive Character Recognition: A state-of-the-art comparison. In: IGS 2009: Proceedings of the 14th Conference of the International Graphonomics Society, Dijon, France, pp. 148-151.
Abstract: Recent research has demonstrated the superiority of SVM-based approaches for offline cursive character recognition. In particular, Camastra’s 2007 study showed SVM to be better than alternative LVQ and MLP approaches on the large C-Cube data set. Subsequent work has applied hierarchical vector quantization (HVQ) with temporal pooling to the same data set, improving on LVQ and MLP but still not reaching SVM recognition rates.
In the current paper, we revisit Camastra’s SVM study in order to explore the effects of using an alternative modified direction feature (MDF) vector representation, and to compare the performance of a RBF-based approach against both SVM and HVQ. Our results show that SVMs still have the better performance, but that much depends on the feature sets employed. Surprisingly, the use of more sophisticated MDF feature vectors produced the poorest results on this data set despite their success on signature verification problems.
2008
Thornton, J. R., Faichney, J., Blumenstein, M. & Hine, T. (2008). Character Recognition using Hierarchical Vector Quantization and Temporal Pooling. AI 2008: 21st Australasian Joint Conference on Artificial Intelligence. Lecture Notes in Computer Science, 5360, 562-572, Springer, ISSN 0302-9743.
Abstract: In recent years, there has been a cross-fertilization of ideas between computational neuroscience models of the operation of the neocortex and artificial intelligence models of machine learning. Much of this work has focussed on the mammalian visual cortex, treating it as a hierarchically-structured pattern recognition machine that exploits statistical regularities in retinal input. It has further been proposed that the neocortex represents sensory information probabilistically, using some form of Bayesian inference to disambiguate noisy data.
In the current paper, we focus on a particular model of the neocortex developed by Hawkins, known as hierarchical temporal memory (HTM). Our aim is to evaluate an important and recently implemented aspect of this model, namely its ability to represent temporal sequences of input within a hierarchically structured vector quantization algorithm. We test this temporal pooling feature of HTM on a benchmark of cursive handwriting recognition problems and compare it to a current state-of-the-art support vector machine implementation. We also examine whether two preprocessing techniques can enhance the temporal pooling algorithm’s performance. Our results show that a relatively simple temporal pooling approach can produce recognition rates that approach the current state-of-the-art without the need for extensive tuning of parameters. We also show that temporal pooling performance is surprisingly unaffected by the use of preprocessing techniques.
2006
Thornton, J. R., Gustafsson, T., Blumenstein, M. & Hine, T. (2006). Robust Character Recognition using a Hierarchical Bayesian Network. AI 2006: 19th Australian Joint Conference on Artificial Intelligence. Lecture Notes in Computer Science, 4304, 1259-1264, Springer, ISSN 0302-9743.
Abstract: There is increasing evidence to suggest that the neocortex of the mammalian brain does not consist of a collection of specialised and dedicated cortical architectures, but instead possesses a fairly uniform, hierarchically organised structure. As Mountcastle has observed [1], this uniformity implies that the same general computational processes are performed across the entire neocortex, even though different regions are known to play different functional roles. Building on this evidence, Hawkins has proposed a top-down model of neocortical operation [2], taking it to be a kind of pattern recognition machine, storing invariant representations of neural input sequences in hierarchical memory structures that both predict sensory input and control behaviour. The first partial proof of concept of Hawkins’ model was recently developed using a hierarchically organised Bayesian network that was tested on a simple pattern recognition problem [3]. In the current study we extend Hawkins’ work by comparing the performance of a backpropagation neural network with our own implementation of a hierarchical Bayesian network in the well-studied domain of character or handwriting recognition. The results show that even a simplistic implementation of Hawkins’ model can produce recognition rates that exceed a standard neural network approach. Such results create a strong case for the further investigation and development of Hawkins’ neocortically-inspired approach to building intelligent systems.