Here is a list of recent posts and major other updates of this website .

31 Aug 2016 PRTools 5.3.2
PRTools Minor updates
14 Oct 2015 The role of densities in pattern classification
Post Fundamentally pattern recognition is not about statistics. So why are probabilities and densities important?
12 Oct 2015 Pattern Recognition: Introduction and Terminology
eBook An ebook introducing concepts and terminology, including a large glossary with many internal and external links and examples.
 8 Sep 2015 Discovered by accident
Post An accidental observation by a student pointed to interesting possibilities of sequentially combining multi-class classifiers.
 2 Sep 2015 PRTools 5.3.1
PRTools Minor updates
16 Aug 2015 How to handle memory and time available for computations?
FAQ Describes the possible control of memory and computing time in PRTools.
16 Aug 2015 How to switch off the prwarning messages?
FAQ Introduces the new,  streamlined version of the prwarning system.
16 Aug 2015 PRTools 5.3.0
PRTools Handling of very small training sets, training time control, streamlined warning system.
15 July 2015 Cross-validation
Post How to define a proper cross-validation: How many folds? Do we need repeats? How to determine the significance?. Here are some considerations.
12 July 2015 How to generate a multi-dimensional banana set?
 FAQ The standard routine for generating the 2-dimensional banana set may be used to generate a multi-dimensional problem.
19 June 2015 How should the PRTools ROC plot be interpreted?
FAQ The relation between the PRTools ROC plot and the more standard one is discussed. A conversion statement is supplied.
17 June 2015 PRTools advanced examples
Examples A set of advanced examples has been added that may be copied and run by the user.
8 June 2015 Adaboost and the Random Fisher Combiner
Post The significant elements in the Adaboost classifier are the generation of base classifiers and the combining rule. Can they be simplified? A simple experiment.
5 May 2015 Using the test set for training
Post Never use the test set for training? Sometimes this rule can be neglected. It might be proper and helpful.
3 April 2015 My classifier scores 50% error. How bad is that?
Post What error rates can we expect for a trained classifier? How good or bad is a 50% error?Some observations and bounds.
23 February 2015 Is every pattern recognition problem a small sample size problem?
Post Also applications with large training sets have to face the small sample size problem.
21 December 2014 PRTools 5.2.3
PRTools Minor bog fixes
24 November 2014 Aristotle and the ugly duckling theorem
Post If Aristotelian designers of PR systems do not make a step in the Platonic direction, they may suffer from the ugly duckling theorem: all differences are equal.
13 October 2014 Why is the nearest neighbor rule so good?
Post Because it matches the problems of interest.
13 September 2014 PRTools 5.2
PRTools Using categorical data, converting cell arrays into datasets and reverse and more, see updates.
12 September 2014 There is no best classifier
Post Every problem has its own best classifier. Every classifier has at least one dataset for which it is the best. So every classifier is sometimes the best.
14 August 2014 The ten Aristotelian categories, features and dissimilarities
Post A relation between the ten categories and the problem of pattern recognition.
20 July 2014 Surprisingly good results in flow-cytometry classification
 Post Surprisingly good results might be a warning of something that is wrong. A report of a mistake.
 3 July 2014 Are football results random?
Post Sometimes the result of a football match seems arbitrary. Watching the FIFA world cup gives clear examples. Is a test on significance possible?
18 June 2014 Good recognition is non-metric: true or false?
 Post A recent paper in Pattern Recognition claims that good recognition is non-metric. Is this statement true or false?
14 May 2014 The Eurovision Song Contest analyzed
Post The results of the 2014 Eurovision Song Festival are analyzed by a clustering procedure to detect possible cultural similarities between countries.
29 April 2014 PRTools 5.1 available
PRTools Beautifications and additions
20 March 2014 Regularization and invariants
Post Regularization is equivalent to the use of invariants. Knowledge about invariants is thereby helpful for choing an effective regularization.
28 January 2014 Who invented the nearest neighbor rule?
Post Discussion on a paper by Pelillo on Alhazen.
 8 January 2014 Random representations
Post Why and when are random representations good?
24 November 2013 Hume’s fork in pattern recognition
Post Facts can be true, or just happen to be true. This results in two essentially different lines of research in pattern recognition: on the models or on the observed world.
 4 November 2013 Choosing or learning a representation?
Post Learning a representation needs another, initial representation on which learning can be based. A human choice is inevitable.
30 October 2013 How can I control classifier parameter optimization?
FAQ Classifier parameters, in particular the ones needed for regularization, can be optimized by the PRTools routine regoptc. Here some details are discussed.
28 August 2013 PRTools Examples added to the user guide
PRTools A large set of examples introducing the use of PRTools
28 August 2013 PRTools 5.0.2 available
PRTools Minor upgrade, generator and fixed_cell mappings introduced
14 August 2013 PRTools user guide
PRTools Major upgrade of the user guide, important for PRTools use on the command line
10 August 2013 Mapping types in user guide
PRTools The user guide has been extended with sections on the various mapping types: fixed, fixed_cell, untrained, trained, combiner, generator
 3 August 2013 How should I interpret the outcomes of a classifier?
FAQ What do the numbers mean that are shown as classifier outputs? Distances? Densities? Posteriors? Confidences?
23 July2013 The error in the error
Post How large is the classification error? How large should the test set be to have a small error in the error. The worn out test set.
16 July 2013 PRTools5 introduction
Page A summary of the whys and whats of the upgrade and transition information
15 July 2013 PRTools5 available
 PRTools This small but significant upgrade makes an integration with Stats, the Matlab Statsitical Toolbox possible. It integrates some of its classifiers as PRTools routines
 9 July 2013 PRTools 4.2.5 now available
PRTools A summary of changes
 8 July 2013 Pattern recognition and the art of naming
 Post Finding names for concepts makes them useful as building blocks in our thinking.
17 June 2013 Pattern recognition, for better or worse?
Post It starts with an innocent curiosity, then it is applied. The applications may be used for targets that the scientist and the engineer did not foresee.
 3 June 2013 Qualities and Quantities
Post Properties can be distinguished in qualities and quantities. Human decision making is based on the first, automatic pattern recognition on the second. Can they do the same thing?
27 May 2013 Classifying the exception
Post Exceptions do not follow the rules. That is their nature. Humans know how to handle them. Can that be learned?
20 May 2013 Fraud and pattern recognition
Post More and more reports appear about fraud in science. Does the field of pattern recognition suffer from fraud as well? Or, does it profit from it?
13 May 2013 Recognition, belief or knowledge?
 Post Pattern recognition may be based on machine learning. But what constitutes the training of the machine? Belief or knowledge? Nilsson is writing a book on beliefs.
 6 May 2013 Pattern recognition and neural networks, Kuhn and Popper
 Post Is the neural network model good for pattern recognition? Can this be decided by conjectures and refutations? Or is the answer determined by paradigm shifts?
28 April 2013 Peaking summarized
 Post Peaking (overtraining) of the real, expected and mean classification error.
21 April 2013 Platonic thinking
Post Using ideas and concepts as a basis for research.
14 April 2013 Trunk’s example of the peaking phenomenon
Post A discussion on the clearest peaking example.
 7 April 2013 Pattern recognition at eastern
Post The search for eastern eggs and a cycling tour with grandma show the pattern recognition abilities of grandchildren.
 1 April 2013 A crisis in the theory of pattern recognition
Post The Russian scientist A. Lerner published in 1972 a paper under the title: “A crisis in the theory of Pattern Recognition”. What was the crisis? The answer is surprising and still of actual interest.
30 March 2013 PRTools cheat sheet
PRTools PRTools in a glance, an active pdf sheet
25 March 2013 The curse of dimensionality
Post Imagine a two-class problem represented by 100 training objects in a 100-dimensional feature space. In anyway they are labeled a perfect linear classifier can be found. It is thereby not to be expected that such a classifier does generalize. So, 100 objects in a 100-dimensional space should be avoided. Or not?
18 March 2013 Hughes phenomenon
Post Hughes explanation of peaking and why it was wrong.
11 Marcch 2013 How to prepare my data for PRTools?
FAQ A basic question: what are the first steps to take from raw data towards the use of PRTools?
11 March 2013 The peaking paradox
 Post Why is the intuitive truth: “to measure is to know”, limited by statistics?
 4 March 2013 Non-metric dissimilarities are all around
 Post Some examples are given showing that non-metric dissimilarities arise easily, both, in daily life as well as in science.
25 February 2013 Metric learning, a problem in consciousness
Post Defining a proper distance measure is a consciouness problem. Can this be done by automatic means?
18 February 2013 Kernel-induced space versus the dissimilarity space
Post  The dissimilarity representation has a strong resemblance to a kernel. There are, however, essential differences in assumptions and usage. Here they are summarized and illustrated by some examples.
11 February 2013 Personal history on the dissimilarity representation
Post The previous post briefly explains arguments for the steps taken by us between 1995 and 2005. From the perspective we have now, it has become much more clear what we did in those years. In this post historical remarks are made that may sketch how research proceeds.
 9 February 2013 Two FAQ’s answerred on scatter plots: usage and gridsize effects
FAQ Discussions on the limited usage of scatter plots for 2D datasets only and accuracy effects on classification boundaries caused by changes of the gridsize setting.
 4 February 2013 The dissimilarity space – a step into the darkness
Post Representation by features may neglect relevant aspects, leading to class overlap. Pixels describe everything but tear the objects apart because their structure is not encoded in the representation. Structural descriptions are rich and describe the structure well, yet they do not construct a vector space. As a result, we lack a proper representation for learning from examples. Is there a way out, or are we trapped?
28 January 2013 What is new in PRTools
Post In the recent PRTools updates of September 2012 (4.2.2) November 2012 (4.2.3) and January 2013 (4.2.4) a number of tools have been added and changed of which not everybody might be aware. Here we will pay more attention to them and give some background information about their use
26 January 2013 PRTools 4.2.4 now available
PRTools A summary of changes
23 January 2013 The user guide in progress
PRTools A number of pages has been added to the user guide, in particular on elementary operations on and between datasets and mappings
21 January 2013 Non-Euclidean embedding
Post Non-Euclidean dissimilarities may be good for including knowledge about the objects in the dissimilarity measure, but how to embed them in a vector space if we want to use the standard linear algebra tools for generalization? Here the so-called pseudo-Euclidean space will be discussed.
20 January 2013 Follow us by RSS, Twitter or Recent Updates
Follow us icons on sidebar added.
14 January 2013 Non-euclidean and non-metric dissimilarities
Post Dissimilarities measures may be defined as distances in an euclidean space or such that they can be understood as euclidean distances. Euclidean distances satisfy the triangle inequality: the direct distance between two points is smaller than any detour. They are thereby metric.
 7 January 2013 Generalization by dissimilarities
Post Dissimilarities have the advantage over features that they potentially consider the entire objects and thereby may avoid class overlap. Dissimilarities have the advantage over pixels that they potentially consider the objects as connected totalities, where pixels tear them apart in thousands of pieces. Consequently, the use of dissimilarities may result in better classification performances and may require smaller training sets. But how should this be realized? How to generalize from dissimilarities?
 1 January 2013 Batch processing
PRTools Some mappings handling datasets or datafiles create large internal arrays. By this the speedup offered by Matlab array processing is maximized. Sometimes however these arrays become too large. Datasets applied to fixed mappings or to trained mappings may be split into smaller arrays without affecting the final result. Usually this is not possible for untrained mappings as during training all objects have to be related to each other.
31 December 2012 Dissimilarity measures
Post It has been argued that dissimilarities are potentially be a good alternative for features. How to build a  good representation will be discussed later. Here the question will be faced: what is a good measure? What type of measurement device should be used? What properties do we demand?
20 December 2012 How can PRTools be used for studying the dissimilarity representation?
FAQ Many studies have been presented on the dissimilarity representation in which experiments are based on PRTools. A large collection of research tools has thereby been developed. As research in this direction is still in progress by various researchers, a stable and consistent toolbox is not yet ready. There are however many possibilities to develop and run dissimilarity based experiments directly on PRTools. Here follows a very short introduction.
17 December 2012 Dissimilarities
Post In previous posts the usages of features and pixels are discussed for representing objects. Pros and cons are sketched. Here a third alternative will be considered: the direct use of dissimilarities. First the conclusions on features and pixels will be summarized.