A new formulation for symbolic regression to identify physico-chemical laws from experimental data
- Pascal Neumannab
- Liwei Cao bc
- Danilo Russob
- Vassilios S. Vassiliadisb
- Alexei A.Lapkinbc
- a Aachener Verfahrenstchnik – Process Systems Engineering, RWTH Aachen University, Aachen, Germany
- b Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge CB3 0AS, UK
- c Cambridge Centre for Advanced Research and Education in Singapore, CARES Ltd., 1 CREATE Way, CREATE Tower #05-05, 138602 Singapore, Singapore
Read the publication that featured this abstractA modification to the mixed-integer nonlinear programming (MINLP) formulation for symbolic regression was proposed with the aim of identification of physical models from noisy experimental data. In the proposed formulation, a binary tree in which equations are represented as directed, acyclic graphs, is fully constructed for a pre-defined number of layers. The introduced modification results in the reduction in the number of required binary variables and removal of redundancy due to possible symmetry of the tree formulation. The formulation was tested using numerical models and was found to be more efficient than the previous literature example with respect to the numbers of predictor variables and training data points. The globally optimal search was extended to identify physical models and to cope with noise in the experimental data predictor variable. The methodology was proven to be successful in identifying the correct physical models describing the relationship between shear stress and shear rate for both Newtonian and non-Newtonian fluids, and simple kinetic laws of chemical reactions. Future work will focus on addressing the limitations of the present formulation and solver to enable extension of target problems to larger, more complex physical models.
Get in touch
For more information on flow chemistry systems and services please use the contact methods below.
Call us on +44 (0)1284 728659 or Email us
Resource Centre
R-Series

The Vapourtec R-Series is, quite simply, unrivalled for flow chemistry
- Flexible |
- Precise |
- Automatable
The R-Series is undoubtedly the most versatile, modular flow chemistry system available today.
E-Series

The Vapourtec E-Series is the perfect introductory system for flow chemistry
- Robust |
- Easy to use |
- Affordable
The E-Series is a robust and affordable, entry level flow chemistry system designed for reliability and ease of use.