B. Burlacu, M. Affenzeller, M. Kommenda, G. K. Kronberger, S. M. Winkler - Schema Analysis in Tree-based Genetic Programming in Genetic Programming in Theory and Practice XV (Contributions to Book: Part/Chapter/Section 2), - Springer International Publishing, 2018, pp. 17-37
In this chapter we adopt the concept of schemata from schema theory and use it to analyze population dynamics in genetic programming for symbolic regression. We define schemata as tree-based wildcard patterns and we empirically measure their frequencies in the population at each generation. Our methodology consists of two steps: in the first step we generate schemata based on genealogical information about crossover parents and their offspring, according to several possible schema definitions inspired from existing literature. In the second step, we calculate the matching individuals for each schema using a tree pattern matching algorithm.We test our approach on different problem instances and algorithmic flavors and we investigate the effects of different selection mechanisms on the identified schemata and their frequencies.