TCM
UoC crest

Research Highlights

Unveil the unseen: exploit information hidden in noise

Bogdan Zviazhynski, Gareth Conduit

We develop a computer program that looks for patterns in data. These patterns are difficult to notice because of the noise. However, our program utilizes the noise to see these patterns in greater detail. We apply the program to predict two phenomena in Physics.

Prediction of the phase transition of PbZr<sub>0.7</sub>Sn<sub>0.3</sub>O<sub>3</sub> crystal at <span>~</span>440 K. Blue points 
are experimental data. The plot includes predictions using uncertainty in dielectric constant (red, error region shaded, R<sup>2</sup> = 0.85 on validation) and without 
uncertainty (orange, error region shaded, R<sup>2</sup> = <span>−</span>0.04 on validation). The grey shaded region is the validation region, 
where predictions are compared against the experimental data.
Prediction of the phase transition of PbZr0.7Sn0.3O3 crystal at ~440 K. Blue points are experimental data. The plot includes predictions using uncertainty in dielectric constant (red, error region shaded, R2 = 0.85 on validation) and without uncertainty (orange, error region shaded, R2 = 0.04 on validation). The grey shaded region is the validation region, where predictions are compared against the unseen experimental data.

Noise and uncertainty are usually the enemy of machine learning, noise in training data leads to uncertainty and inaccuracy in the predictions. However, we develop a machine learning architecture that extracts crucial information out of the noise itself to improve the predictions. The phenomenology computes and then utilizes uncertainty in one target variable to predict a second target variable. We apply this formalism to PbZr0.7Sn0.3O3 crystal, using the uncertainty in dielectric constant to extrapolate heat capacity, correctly predicting a phase transition that otherwise cannot be extrapolated. For the second example – single-particle diffraction of droplets – we utilize the particle count together with its uncertainty to extrapolate the ground truth diffraction amplitude, delivering better predictions than when we utilize only the particle count.

Our generic formalism enables the exploitation of uncertainty in machine learning, which has a broad range of applications in the physical sciences and beyond. For example, the formalism can utilize the noise in the images of concrete to predict its physical properties. Another exemplar application is to utilize the information in genetic chaos for early cancer detection. Furthermore, there are many other fields, including autonomous vehicles, additive manufacturing, and information engineering, where the information embedded in noise can be exploited to accelerate development, understanding, and impact.

Unveil the unseen: Exploit information hidden in noise

Bahdan Zviazhynski and Gareth Conduit, Applied Intelligence (2022)

More TCM Research Highlights