To switch or not to switch – a machine learning approach for ferroelectricity

Sabine M. Neumayer; Stephen Jesse; Gabriel Velarde; Andrei L. Kholkin; Ivan Kravchenko; Lane W. Martin; Nina Balke; Peter Maksymovych

doi:10.1039/C9NA00731H

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

DOI: 10.1039/C9NA00731H (Paper) Nanoscale Adv., 2020, 2, 2063-2072

To switch or not to switch – a machine learning approach for ferroelectricity†

Sabine M. Neumayer ^a, Stephen Jesse ^a, Gabriel Velarde ^bc, Andrei L. Kholkin ^de, Ivan Kravchenko ^a, Lane W. Martin ^bc, Nina Balke ^a and Peter Maksymovych *^a
^aCenter for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA. E-mail: maksymovychp@ornl.gov
^bDepartment of Materials Science and Engineering, University of California, Berkeley, CA 94720, USA
^cMaterials Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
^dDepartment of Physics & CICECO – Aveiro Institute of Materials, University of Aveiro, Aveiro, Portugal
^eSchool of Natural Sciences and Mathematics, Ural Federal University, Ekaterinburg, Russia

Received 19th November 2019 , Accepted 6th April 2020

First published on 15th April 2020

Abstract

With the advent of increasingly elaborate experimental techniques in physics, chemistry and materials sciences, measured data are becoming bigger and more complex. The observables are typically a function of several stimuli resulting in multidimensional data sets spanning a range of experimental parameters. As an example, a common approach to study ferroelectric switching is to observe effects of applied electric field, but switching can also be enacted by pressure and is influenced by strain fields, material composition, temperature, time, etc. Moreover, the parameters are usually interdependent, so that their decoupling toward univariate measurements or analysis may not be straightforward. On the other hand, both explicit and hidden parameters provide an opportunity to gain deeper insight into the measured properties, provided there exists a well-defined path to capture and analyze such data. Here, we introduce a new, two-dimensional approach to represent hysteretic response of a material system to applied electric field. Utilizing ferroelectric polarization as a model hysteretic property, we demonstrate how explicit consideration of electromechanical response to two rather than one control voltages enables significantly more transparent and robust interpretation of observed hysteresis, such as differentiating between charge trapping and ferroelectricity. Furthermore, we demonstrate how the new data representation readily fits into a variety of machine-learning methodologies, from unsupervised classification of the origins of hysteretic response via linear clustering algorithms to neural-network-based inference of the sample temperature based on the specific morphology of hysteresis.

Introduction

Nano- and mesoscale electromechanical behavior underpins the performance of sensors, actuators, energy harvesters, ferroelectric field effect transistors and electrocaloric devices.^1–9 Interesting phenomena arise from local defect chemistry, chemical domains and interfaces such as domain walls, grain boundaries and the surface itself, necessitating local probing techniques to study functional material response.^10–20 Therefore, piezoresponse force microscopy (PFM) and related techniques that probe piezoelectric properties on the nm to μm scale have become increasingly popular.^21–26 In these techniques, AC and DC voltages are applied to a conductive tip in contact with the sample surface and the electromechanical response of the surface to the voltage stimuli is detected. The periodic cantilever deflection D_ac in response to the applied voltage provides information on the strength of electromechanical interaction and the direction of the ferroelectric polarization.

Typically, electromechanical response is studied as a univariate function of DC voltage, producing a characteristic hysteresis loop which is reminiscent of macroscopic polarization-switching measurements. Hysteresis exhibits variability, however, with the number of voltage cycles, time and temperature. In order to analyze and extract meaning from these data sets, machine learning has become increasingly important.^27–34 Dimensionality reduction without loss of important information, de-noising, clustering and identifying characteristic features in data sets have been achieved using supervised and unsupervised machine-learning algorithms.

Despite the growing number of successful statistical analyses, the previous efforts to probe and analyze hysteresis attempt to separate interdependency of control parameters, typically by varying one parameter while keeping the rest of the conditions constant (for example measuring piezoresponse at a fixed voltage along the switching cycle – so-called switching spectroscopy PFM³⁵). To what extent such decoupling of control variables can be systematically achieved is up to debate. But more importantly, it is natural to ask whether there exist more encompassing approaches, where coupling of control parameters can be explicitly investigated.

For a model setting, we focus on hysteresis measured with the technique of contact Kelvin probe force microscopy^36,37 (cKPFM). cKPFM is conceptually analogous to commonly used open-loop Kelvin probe force microscopy (KPFM), albeit with the tip in mechanical contact with the measured surface. As is the case with KPFM, cKPFM detects net changes of electrostatic forces, this time acting on the leads of the capacitor, that sandwiches the studied dielectric between the scanning probe and the bottom electrode. Although less sensitive than KPFM owing to much larger contact stiffness in comparison to the unclamped cantilever, the demonstrated utility of cKPFM is to tease out electrostatic contributions to the net force acting on the capacitor, which can be complementary or competing to other forces – for example piezoelectric deformation of the dielectric. Therefore, cKPFM found application in distinguishing between piezoelectric and electret response of dielectrics to applied voltage,³⁸ particularly relevant for new and/or nanoscale materials, where piezoelectric response can be small and electromechanical properties unknown.

Here we introduce a two-dimensional (2D) representation of the field-induced hysteresis measured by cKPFM, which explicitly considers two-parameter dependence of the net measured signal on the “write” and “read” bias signals. Remarkably, just the transition to 2D dependencies and representation vastly simplifies qualitative interpretation of the measured signal, clearly separating, for example, ferroelectric and electret behaviors. Subsequently, we demonstrate how this 2D approach holistically facilitates application and interpretation of a variety of machine-learning algorithms, including artificial neural networks (ANN), that extend the applicability of this methodology to detect incomplete switching and ferroelectric-relaxor phase transitions with minimum human input. Even inference of the temperature of the sample from its hysteretic response appears to be possible with fairly simple neural networks. We believe that this data representation technique will help advance the experimental methodologies of hysteretic spectroscopy and spectro-microscopy to characterize memory functions and hysteretic materials in general and will also help bridge the theory and experiment in a more statistically robust setting.

Results and discussion

Hysteresis loops measured with cKPFM represent a basic example of dependency of the response on a combination of two electrical signals – one required to “write” the hysteretic state of the system, and the other one – to “read” the state. And, somewhat ironically, this two-parameter dependence presents one of the primary difficulties with observational interpretation of the cKPFM data. Indeed, a successful interpretation requires considering not only the opening of the hysteresis loop, but also the evolution of the loop with varying read-bias. Without rigorous statistical metrics, such an interpretation is quite challenging, especially in cases where electromechanical and electrostatic signal contributions are of comparable strength. This is particularly true for intermediate cases where subtle as well as dramatic features may arise in the hysteresis loop.

To demonstrate our approach, we utilize cKPFM response for different ratios of V_read and V_write on a model ferroelectric lead zirconium titanate (PbZr_0.2Ti_0.8O₃, PZT) film before discussing typical cKPFM response on a non-piezoelectric amorphous hafnium dioxide (aHfO₂) sample, where electrostatic effects can lead to observation of hysteresis loops in standard switching spectroscopy PFM.³⁹ In a next step, we analyze spatially varying cKPFM response measured on macroscopically pre-poled lithium niobate (LN) that is subject to strong electrostatic interactions typical for that ferroelectric material.¹⁰ To further corroborate the applicability of our analysis approach we process cKPFM response measured across the ferroelectric-relaxor phase transition on multiple grains of lanthanum zirconate titanate (PLZT), which even in the ferroelectric state exhibits peculiarities in hysteresis loop, as commonly observed for relaxors.³¹

Fig. 1(a) schematically depicts the DC voltage waveform used in cKPFM, which consists of triangular write pulses V_write to initiate ferroelectric switching and a probe voltage V_read that is applied between the write pulses and stepwise changed with every write cycle. Therefore, the measured response is a function of read and write voltage and spans a 2D parameter space for each probed location. The cKPFM data (Fig. 1) were measured on PZT, one of the most common ferroelectrics.


	Fig. 1 cKPFM on ferroelectric PZT. (a) Schematic of sequence of DC voltage pulses applied during cKPFM. V_write pulses are increasing and decreasing in a triangular envelope, whereas the V_read is applied between write pulses and sequentially increased with each cycle. (b) Traditional cKPFM diagram where response D_ac is plotted as a function of V_read with V_write steps color coded. (c) Hysteresis loop extracted from V_read = 0 as a function of V_write (top) and unfolded as a function of V_write step #. (d) Unfolded loops for all V_read stacked along a third dimension. These loops are projected on a 2D map where D_ac is represented by the color scale, rows correspond to response during a certain V_read and columns correspond to V_write steps. (e) cKPFM with decreased V_read window as indicated in the green box in panel (d), (f) cKPFM map with a further decreased the probing window corresponding to the blue box in panel (e). The white dashed line in cKPFM maps in panels (d–f) indicate V_write.

In the previous representation of the cKPFM data (Fig. 1(b)), response D_ac is depicted versus V_read and data acquired during each V_write step is overlaid on the same plot, with different line colors indicating the preceding write step.^37,39 Interpretation of these diagrams, however, can be challenging. cKPFM diagrams typically show 40–100 lines in one graph, which often overlap and can therefore be hard to interpret. Moreover, it can be difficult to exactly distinguish between color nuances that correspond to the preceding V_write step. The classification as ferroelectric switching or electrostatically driven artefacts, which is often the main reason to apply this technique, has been only vaguely and qualitatively defined as the presence/absence of formation of “two bands” and the cKPFM curve shapes resembling a hysteresis loop, as opposed to electrostatic artefacts which would appear as one band of straight lines.^10,39 Such an assignment is highly dependent on the probing versus writing voltage ratios, as well as the vagueness of the definition of a “good” hysteresis loop.

Our proposed alternative is a 2D representation of the cKPFM experiment, which (1) unfolds the individual hysteresis loops so that they are a function of time or voltage step, rather than applied bias; and (2) stacks the progression of the unfolded loops into a matrix, where each row corresponds to a certain V_read and each column to a certain V_write step. The unfolding and stacking procedure is shown in Fig. 1(c and d). The strength of the response at each V_read/V_write parameter pair value is represented by the color scale. The magnitude of V_write is indicated by the white, dashed triangular line for reference purposes. Slicing the cKPFM map along the columns corresponds to D_ac as a function of V_read, i.e., the traditional cKPFM representation. A single slice at V_read = 0 (followed by conversion of the x-axis to V_write) corresponds to the often used “remnant” switching spectroscopy hysteresis loop. The diagonal slice at V_read = V_write produces the “in-field” hysteresis loop. Of course, many other forms of 1D hysteresis can also be created, via cuts in arbitrary directions in V_read–V_write space.

Therefore, right away, the cKPFM map generalizes the measurement of the hysteretic response of the dielectric to applied stimuli, encompassing, in principle, the response to all possible combinations of read-write waveforms.

For the specific case of PZT, the cKPFM map shows sharp horizontal and vertical edges where the measured D_ac abruptly changes, indicating ferroelectric switching events. Obviously, because V_write and V_read both apply electric field across the dielectric, the response is not fully independent as a function of these parameters. For example, if V_read exceeds the coercive voltage, polarization will necessarily switch into the preferred orientation. This is shown in Fig. 1(d), (e) and (f), where V_read was changed from values between ±4 V (Fig. 1(e)) and ±2 V (Fig. 1(f)), while V_write was kept the same triangular wave between ±10 V. The coercive voltages are approximately 4 V for the film (Fig. 1(c)). Thus, switching is induced by both V_read and V_write in Fig. 1(e) whereas in Fig. 1(f), only V_write is ramped above the coercive voltage. This is not, however, a limiting factor for statistical analysis, since a properly trained classifier can easily incorporate the additional information on when switching occurs during the V_write and V_read steps. One point of potential inquiry is the time-dependence of the cKPFM maps. We will not address it here, but naturally there is always some level of time-dependence for hysteretic processes, and under the right approach this will enhance the level of understanding even more. For now, we will proceed with the assumption that cKPFM maps are not strongly time-dependent, representing the case of measurement much slower than the characteristic response time of the material.

The immediate utility of cKPFM maps is that they represent spectral information as images. Therefore, the data can be more intuitive to interpret for the human mind and is more suitable for image-based deep-learning algorithms that are becoming increasingly important. To illustrate some of these advantages, we begin with the simplest task of data interpolation.

For experimental reasons, such as acquisition time and wear of the conductive tip coating, response maps can only be acquired for a limited amount of sampling points in voltage space. 2D interpolation, however, provides higher-resolution maps from which gradients can be obtained to further highlight response function characteristics. Interpolation can be calculated with a wide variety of algorithms (e.g., Gaussian progress regression, linear, cubic, spline), many of which are conveniently implemented in open-source packages such as scikit-image, scipy.interpolate, etc.^40,41

Interpolation of cKPFM maps for ferroelectric PZT and non-ferroelectric aHfO₂ are shown in Fig. 2. A simple check of the quality of the interpolation, other than the 2D image itself, is seen in the extracted 1D remnant hysteresis loop (ESI, Fig. S1†), revealing arguably excellent quality of interpolated values. With the interpolated values, one can extract gradients of the dielectric response to V_read and V_write. The gradients present a transparent and simple approach to differentiate characteristic behaviors, e.g., to compare piezoresponse of ferroelectric to electrostatic forces of a dielectric.


	Fig. 2 Maps of experimental cKPFM data, interpolated data and the gradients in x- and y-direction calculated from interpolated maps. (a) For PZT where ferroelectric switching events are characterized by the sharp contrast transition corresponding to thin lines in the x-gradient map and (b) for non-ferroelectric aHfO₂ where D_ac changes gradually dependent on V_read and V_write, resulting in broad bands in the x-gradient map.

Fig. 2 shows experimental data, interpolated maps and the gradients calculated from the interpolated data for ferroelectric PZT using V_read amplitudes of 2 V and V_write amplitudes of 10 V (Fig. 2(a)) and non-ferroelectric aHfO₂ (Fig. 2(b)). The corresponding gradients reveal sharp edges in the x-gradient for ferroelectric switching and little contrast in the y-gradient, as expected, based on abrupt switching of the sign of the piezoelectric response at specific voltages on one hand, and weak dependence of piezoelectric response with applied voltage on the other. A composite figure showing cKPFM maps of experimental data, interpolated data, x- and y-gradients for PZT upon applying different V_read and V_write ratios is shown in the ESI, Fig. S2.† An interesting case is the hysteresis of aHfO₂, which previously was assigned to transient charging of the probed volume by rechargeable traps.³⁶ The aHfO₂ cKPFM map is sharply contrasting that of a ferroelectric. Fig. 2(b) shows that there is no abrupt change of contrast in the x-direction (of varying V_write), but rather a gradual wave-like modulation of the response along the y-direction (of varying V_read). The corresponding gradients in x-direction are very smooth (unlike the sharp transitions of the ferroelectric case). Yet, there is also a notable feature in the y-gradient, corresponding to the position of the contact potential minimum, that shifts with applied V_write. Applying higher read and write voltages to increase charge injection does not significantly change the overall contrast of experimental and interpolated data maps and x-gradients still do not exhibit sharp vertical contrast but rather exhibit V-shaped tilted lines of similar values as the background (see Fig. S3 in the ESI†). Therefore, cKPFM maps in the absence of polarization switching as in the presented aHfO₂ data are very clearly distinguishable from the case of ferroelectric switching, and reveal useful features, such as the modulation of the contact potential difference.

Due to the distinctive features of non-switching versus switching cKPFM data, statistical clustering techniques can be utilized to identify functional behavior in large data sets dependent on material, spatial region within a sample, temperature or time. In general, clustering algorithms find similarities in data and use those similarities to group data. Two examples for suitable methods to break down the different types of detected response are hierarchical agglomerative clustering (HAC) and density based spatial clustering (DBSC).⁴² These algorithms can either be applied directly to the measured response or after de-noising and dimensionality reduction, e.g., through principal component analysis (PCA).

HAC initially assigns a cluster to each cKPFM map in a given data set. In a next step, the most similar clusters where the response is closest in the feature space are paired. These grouped clusters are then clustered again dependent on their similarities. Clustering into a decreasing number of clusters containing an increasing amount of cKPFM maps is continued iteratively until all the data is grouped into two main clusters. We apply HAC directly to a data set consisting of cKPFM data for three types of behavior: (i) electret response measured on aHfO₂, (ii) ferroelectric switching measured on PZT and (iii) response from ferroelectric PZT upon applying DC voltages below the coercive voltage, which does not initiate polarization switching. The total number of data sets was 3000, comprising 400 measured on PZT upon switching with different ratios V_read/V_write ratios, 100 data sets for sub-coercive voltages on PZT and 2500 measured on aHfO₂ representing the electret response. The results of HAC are depicted in Fig. 3. The dendrogram in Fig. 3(a) shows the relationship between the identified clusters with the vertical lines indicating similarities between the grouped data. The shorter the vertical line, the more similar is the response. The numbers in parentheses on the x-axis correspond to the number of cKPFM data sets within the clusters associated with the vertical lines. The two most distinct clusters are indicated in blue in the dendrogram and represent the cKPFM data in Fig. 3(b) for point 1 and 2 at a numeric distance of 150. Clearly, cluster #1 exhibits non-ferroelectric behavior similar to the map for aHfO₂ depicted in Fig. 3(b). Contrary, cluster #2 corresponds to ferroelectric switching associated with the sharp contrast as discussed previously. Thus, unsupervised clustering immediately answers the proverbial “To switch or not to switch” question, which continues to dominate the studies of nanoscale ferroelectrics, 2D materials and other emerging members of the ferroelectric family, where the signals are typically quite weak.³⁸


	Fig. 3 Hierarchical agglomerative clustering. (a) Dendrogram indicating distribution of clusters and their similarities (indicated by the length of vertical lines), (b) cKPFM maps of mean response for clusters indicated by color and numbers in panel (a). Cluster #1 identifies characteristics where no polarization switching occurs either because a V_write below the coercive voltage is applied to a ferroelectric sample (cluster #5) or the response is purely electrostatic (cluster #6). Cluster #2 groups data of switching events occurring at different V_write steps (cluster #3 and #4).

If we further look into the next level of clustering, however, both clusters #1 and #2 separate into methodologically meaningful and useful categories. Cluster #1 correctly subdivides cKPFM data where no polarization switching occurs either due to sub-coercive voltages (cluster #5) or a non-ferroelectric sample where contrast is governed by electrostatic interactions (cluster #6), as shown in Fig. 3(b). Indeed, although the material is itself ferroelectric, this fact cannot be inferred with sub-coercive voltage spectroscopy, which generates no switching by electric field (see maps in Fig. S2,† first row). At the same time, cluster #2, partitions the data depending on the extent by which V_read exceeds the coercive voltage (cluster #3 and #4), which is a valuable methodological distinction, as discussed earlier. Due to the hierarchical approach, clusters #3 to #6 can be subdivided into more clusters that show similar behavior to each of the parent clusters.

Apart from grouping data of different materials, clustering is also able to identify local variability of hysteretic response within a single sample. To this end, we combine PCA and clustering to categorize cKPFM data measured on a spatial grid of 40 × 40 pixels on LN. The LN sample had been macroscopically pre-poled and dynamic response on this material is typically subject to strong electrostatic interactions even in the presence of polarization switching.¹⁰

In a first step, PCA projects cKPFM maps into a lower dimensional feature space.^28,42 This way, component #0 contains the most information on variance in the data, followed by component #1 and so on. PCA scores are a projection of the data points on the found eigenvectors. Fig. 4(a) shows scores for PCA components #0 to #3 represented as 2D cKPFM maps as discussed previously. The scree plot (Fig. 4(b)) shows that the variance within the data is successfully captured by about 3 components, with PCA components #0 through #2 accounting for almost 100% of total variance.


	Fig. 4 Dimensionality reduction and density based spatial clustering of cKPFM measured on LN. (a) Score maps of PCA components and (b) scree plot indicating the data explained with each PCA component. (c) 3D representation of PCA components #0 to #2, color indicates DBSC clusters. (d) Spatial map of DBSC cluster assignment, (e) maps of mean response for each cluster. Cluster #0 (blue) corresponds to ferroelectric switching, cluster #1 and #2 are characteristic for electrostatic response.

In a next step, DBSC is applied to the first three PCA components. DBSC algorithms group data based on the density of data points. Unlike k-means, where the user defined number of clusters dominates the clustering result, the most important parameters in DBSC are the maximum distance between two data points to be considered in the same neighborhood and the minimum number of datapoints within a neighborhood. DBSC on the first three PCA components identified 3 clusters (shown in blue, red and green) in Fig. 4(c). Reverting back to the original grid of points across the LN surface, we observe that the clustering identifies distinct hexagonal areas (red) on the surface, while the small green cluster is primarily at the boundary of this area. This observation matches the history of the sample, which was oriented in +z direction and macroscopically pre-poled in hexagonal areas with −z polarization orientation.¹⁰ The mean response for each cluster is shown in Fig. 4(e). Clearly, the difference between the clusters corresponds to switching (cluster #0) and non-switching (cluster #1 and #2) behaviors within the applied voltage range. Note that while no polarization switching occurs for these clusters, the similarity to cKPFM maps acquired on aHfO₂ indicates charge injection and strong electrostatic contributions to the measured signal. The not assigned data points appear to be a mixture between switching and non-switching characteristics and correspond to data points at the decision boundaries in Fig. 4(c). Apart from HAC and DBSC, other algorithms like k-means (e.g., Li et al.,³² Neumayer et al.³¹) can be used to group response to find trends, e.g., dependent on material, location, temperature, etc.

Successful differentiation of switching properties via machine learning of cKPFM maps motivated us to apply non-linear clustering methods, such as neural networks, that could potentially reveal even more details of dielectric behavior. Moreover, while previously discussed clustering algorithms implicitly assume linear separability between members of different clusters in Euclidian space, multilayer perceptron ANNs are able to separate data where this requirement is not fulfilled.⁴³

First, we trained an autoencoder network to reveal how effective simple ANN structures are at capturing and reproducing cKPFM maps. Subsequently, we clustered the maps into characteristic types of behavior, using the autoencoder's latent space as a low-dimensional representation of the cKPFM dataset. We utilized temperature-dependent cKPFM data across the ferroelectric-relaxor phase transition of PLZT.³¹ The electromechanical response of relaxor ferroelectrics can be particularly challenging to analyze due to their peculiarities in hysteresis loops, even in the ferroelectric state.⁴⁴ The proximity of the phase transition point, however, allows us to see how effective machine learning is in capturing the relevant changes of the hysteretic response.

The experimentally acquired cKPFM maps are depicted in Fig. 5(a), with the temperature as a third dimension. Fig. 5(b) shows the topology of the autoencoder network ANN. We intentionally minimized the complexity of the network, which essentially consists of two sequential multiperceptrons (layers 2 and 4), separated by non-linear activation layers. Yet remarkably, such a network can efficiently capture a high level of detail in the cKPFM map, as shown in Fig. 5(c), where we compare a random selection of measured cKPFM maps extracted at different locations and temperatures to their reconstructions by the trained autoencoder.


	Fig. 5 Training ANN autoencoder to represent cKPFM data. (a) Experimental cKPFM data on PLZT measured at 7 different temperatures across the ferroelectric-relaxor phase transition. (b) The graph of autoencoder network, starting with flattening of the input array (1), followed by mapping into 4 layer perceptron (2), rectifying activation (3), mapping into 4 layer perceptron (4), sigmoid activation (5) and reshaping (6). “MS” corresponds to mean-squared loss layer. The dimensionality of tensors is shown above the graph edges. (c) Randomly chosen measured and cKPFM maps corresponding to cKPFM data from single pixels at the indicated temperature, (d) DBSC on resulting encoded vectors. Clustering was performed in 4D latent space, for representation purposes only vector 1 to 3 are plotted in the 3D graph.

Subsequently, we truncated the autoencoder to layer 5, so as to project the measured cKPFM maps onto the encoder's latent 4D space – a common practice for such techniques.

Equally noteworthy is that ANNs can then be used to infer certain experimental conditions from the cKPFM map. To this end, we trained a linear network shown in Fig. S4(a)† to predict the temperature of the sample from the cKPFM map, using one subset of the temperature-dependent data-set for training of the cKPFM – temperature relationship, and another subset for validation.

As seen in Fig. S4(b),† the network can not only predict the data for the low (5 °C) and high (95 °C) temperatures with a high degree of certainty, but it is also effective in partitioning the whole data set into 5 °C increments. The least accurate predictions (with an accuracy of ∼70%) correspond to temperatures of ∼80 °C. But even in this case, the prediction error concentrates within ±5 °C proximity of the given temperature.

Given the simplicity of the networks employed, we can readily envision a rich spectrum of applications for this type of machine learning, enabled by increasing complexity as well as flexibility of the network functions – for example with the user of convolutional variational autoencoders, as well as a platform for matching models of the electromechanical response to observations.

Conclusions

In conclusion, we introduced a 2D representation of electromechanical response measured by scanning probe microscopy on ferroelectric and electret samples and demonstrated the ability of machine learning algorithms to distinguish between functional material characteristics. The two-dimensional maps are much easier to interpret by qualitative inspection, than the corresponding one-dimensional representations of dielectric and hysteretic response, and they provide a fertile opportunity for machine-learning techniques to capture and infer material properties. In particular, we provide now statistically robust and automated differentiation between electrostatic charging and ferroelectric switching, which is particularly important for the emerging fields of nanoscale ferroelectrics and energy efficient electronics. Moreover, even simple neural networks trained on this representation can detect phase transitions across distinct dielectric properties and even infer experimental parameters. While discussed here for cKPFM data, the new representation and its analysis can be extended for other dielectric and electromechanical spectroscopy measurements, such as first-order reversal curves or relaxation data. Moreover, the representation is completely general, and can be easily applied to any dynamic and hysteretic response, with likely effectiveness in systematic analysis of magnetic and resistive hysteresis, as well as higher dimensionality of the measurement, for example to capture relaxation behavior in time as a function of temperature and applied voltages. Overall, we believe that higher dimensional data representation coupled with machine learning will provide numerous advances in microscopy and spectroscopy, particularly in the areas of noisy and complex response as well as theory-experiment matching.

Methods

The 150 nm PbZr_0.2Ti_0.8O₃ thin-film was grown on 25 nm of SrRuO₃ bottom electrode and DyScO₃ (110) single-crystal substrate via pulsed laser deposition. With use of a Kr–F excimer laser (Coherent LPX-300), the SrRuO₃ bottom electrode was first deposited at a heater temperature of 640 °C in a dynamic oxygen-partial pressure of 100 mTorr with a laser repetition rate and fluence of 14 Hz and 1.23 J cm⁻², respectively. Next, the chamber was adjusted to a heater temperature of 630 °C and dynamic oxygen-partial pressure of 200 mTorr before depositing 150 nm PbZr_0.2Ti_0.8O₃ at a laser repetition rate and fluence of 2 Hz and 1.43 J cm⁻², respectively. Lastly, the ferroelectric heterostructure was cooled to 25 °C at 5 °C min⁻¹ under a static oxygen pressure of ∼700 Torr.

See references for details on the aHfO₂,³⁹ LN¹⁰ and PLZT³¹ samples.

cKPFM was measured using Nanosensor PPP-EFM tips (aHfO₂) or Budget sensors (all other samples) with a nominal force constant around 3 N m⁻¹ on commercial atomic force microscopes (aHfO₂: Bruker Icon, lithium niobate: Asylum Research MFP-3D, PLZT and PZT: Asylum Research, Cypher). Custom LabView codes and National Instruments data acquisition hardware was used to acquire cKPFM data.

All analysis was performed in Python⁴⁵ and Mathematica (ANN).

Conflicts of interest

The authors declare no competing interests.

Acknowledgements

Experiments on PZT, data analysis and manuscript preparation were supported by the Division of Materials Science and Engineering, Basic Energy Sciences, US Department of Energy (S. N., N. B., P. M.). Experiments were conducted at and supported by (S. J.) the Center for Nanophase Materials Sciences, which is a DOE Office of Science User Facility. S. M. N. would like to thank Katia Gallo and Mohammad Amin Baghban for providing the lithium niobate sample. G. V. acknowledges support from the National Science Foundation under grant DMR-1708615. L. W. M. acknowledges support from the Army Research Office under grant W911NF-14-1-0104. Part of this work was developed within the scope of the project CICECO-Aveiro Institute of Materials, refs. UIDB/50011/2020 & UIDP/50011/2020, financed by national funds through the FCT/MEC.

References

X. Chen, S. Xu, N. Yao and Y. Shi, Nano Lett., 2010, 10, 2133–2137 CrossRef CAS PubMed.
Z. Kutnjak, J. Petzelt and R. Blinc, Nature, 2006, 441, 956–959 CrossRef CAS PubMed.
S. Lu, Q. Liao, J. Qi, S. Liu, Y. Liu, Q. Liang, G. Zhang and Y. Zhang, Nano Res., 2016, 9, 372–379 CrossRef CAS.
S. Trolier-McKinstry, F. Griggio, C. Yaeger, P. Jousse, D. Zhao, S. S. Bharadwaja, T. N. Jackson, S. Jesse, S. V. Kalinin and K. Wasa, IEEE Trans. Ultrason. Ferroelectrics Freq. Contr., 2011, 58, 1782–1792 Search PubMed.
G. A. Boni, L. D. Filip, C. Chirila, I. Pasuk, R. Negrea, I. Pintilie and L. Pintilie, Nanoscale, 2017, 9, 19271–19278 RSC.
M. Si, P. Y. Liao, G. Qiu, Y. Duan and P. D. Ye, ACS Nano, 2018, 12, 6700–6705 CrossRef CAS PubMed.
M. Si, A. K. Saha, P. Y. Liao, S. Gao, S. M. Neumayer, J. Jian, J. Qin, N. Balke Wisinger, H. Wang, P. Maksymovych, W. Wu, S. K. Gupta and P. D. Ye, ACS Nano, 2019, 13, 8760–8765 CrossRef CAS PubMed.
N. Ledermann, P. Muralt, J. Baborowski, M. Forster and J.-P. Pellaux, J. Micromech. Microeng., 2004, 14, 1650–1658 CrossRef CAS.
Y. Yan, J. E. Zhou, D. Maurya, Y. U. Wang and S. Priya, Nat. Commun., 2016, 7, 13089 CrossRef CAS PubMed.
S. M. Neumayer, A. V. Ievlev, L. Collins, R. Vasudevan, M. A. Baghban, O. Ovchinnikova, S. Jesse, K. Gallo, B. J. Rodriguez and S. V. Kalinin, ACS Appl. Mater. Interfaces, 2018, 10, 29153–29160 CrossRef CAS PubMed.
S. M. Neumayer, E. Strelcov, M. Manzo, K. Gallo, I. I. Kravchenko, A. L. Kholkin, S. V. Kalinin and B. J. Rodriguez, J. Appl. Phys., 2015, 118, 244103 CrossRef.
A. N. Morozovska, E. A. Eliseev, N. V. Morozovsky and S. V. Kalinin, Phys. Rev. B, 2017, 95, 195413 CrossRef.
A. V. Ievlev, P. Maksymovych, M. Trassin, J. Seidel, R. Ramesh, S. V. Kalinin and O. S. Ovchinnikova, ACS Appl. Mater. Interfaces, 2016, 8, 29588–29593 CrossRef CAS PubMed.
Y. Kim, A. N. Morozovska, A. Kumar, S. Jesse, E. A. Eliseev, F. Alibart, D. Strukov and S. V. Kalinin, ACS Nano, 2012, 6, 7026–7033 CrossRef CAS PubMed.
P. Maksymovych, J. Seidel, Y. H. Chu, P. Wu, A. P. Baddorf, L. Q. Chen, S. V. Kalinin and R. Ramesh, Nano Lett., 2011, 11, 1906–1912 CrossRef CAS PubMed.
B. J. Rodriguez, Y. H. Chu, R. Ramesh and S. V. Kalinin, Appl. Phys. Lett., 2008, 93, 142901 CrossRef.
J. Seidel, L. W. Martin, Q. He, Q. Zhan, Y. H. Chu, A. Rother, M. E. Hawkridge, P. Maksymovych, P. Yu, M. Gajek, N. Balke, S. V. Kalinin, S. Gemming, F. Wang, G. Catalan, J. F. Scott, N. A. Spaldin, J. Orenstein and R. Ramesh, Nat. Mater., 2009, 8, 229–234 CrossRef CAS PubMed.
E. Strelcov, A. V. Ievlev, S. Jesse, I. I. Kravchenko, V. Y. Shur and S. V. Kalinin, Adv. Mater., 2014, 26, 958–963 CrossRef CAS PubMed.
E. Strelcov, S. M. Yang, S. Jesse, N. Balke, R. K. Vasudevan and S. V. Kalinin, Nanoscale, 2016, 8, 13838–13858 RSC.
S. M. Yang, A. N. Morozovska, R. Kumar, E. A. Eliseev, Y. Cao, L. Mazet, N. Balke, S. Jesse, R. K. Vasudevan, C. Dubourdieu and S. V. Kalinin, Nat. Phys., 2017, 13, 812 Search PubMed.
N. Balke, I. Bdikin, S. V. Kalinin and A. L. Kholkin, J. Am. Ceram. Soc., 2009, 92, 1629–1647 CrossRef CAS.
S. Jesse, R. Vasudevan, L. Collins, E. Strelcov, M. B. Okatan, A. Belianinov, A. P. Baddorf, R. Proksch and S. V. Kalinin, Annu. Rev. Phys. Chem., 2014, 65, 519–536 CrossRef CAS PubMed.
S. V. Kalinin, S. Jesse, A. Tselev, A. P. Baddorf and N. Balke, ACS Nano, 2011, 5, 5683–5691 CrossRef CAS PubMed.
A. Kumar, Y. Ehara, A. Wada, H. Funakubo, F. Griggio, S. Trolier-McKinstry, S. Jesse and S. V. Kalinin, J. Appl. Phys., 2012, 112, 052021 CrossRef.
A. Gruverman and S. V. Kalinin, J. Mater. Sci., 2006, 41, 107–116 CrossRef CAS.
E. Soergel, J. Phys. D: Appl. Phys., 2011, 44, 464003 CrossRef.
L. A. Griffin, I. Gaponenko, S. Zhang and N. Bassiri-Gharb, npj Comput. Mater., 2019, 5, 85 CrossRef.
A. Belianinov, R. Vasudevan, E. Strelcov, C. Steed, S. M. Yang, A. Tselev, S. Jesse, M. Biegalski, G. Shipman, C. Symons, A. Borisevich, R. Archibald and S. Kalinin, Adv. Struct. Chem. Imaging, 2015, 1, 6 CrossRef PubMed.
J. C. Agar, Y. Cao, B. Naul, S. Pandya, S. van der Walt, A. I. Luo, J. T. Maher, N. Balke, S. Jesse, S. V. Kalinin, R. K. Vasudevan and L. W. Martin, Adv. Mater., 2018, 30, e1800701 CrossRef PubMed.
N. Borodinov, S. Neumayer, S. V. Kalinin, O. S. Ovchinnikova, R. K. Vasudevan and S. Jesse, npj Comput. Mater., 2019, 5, 25 CrossRef.
S. M. Neumayer, L. Collins, R. Vasudevan, C. Smith, S. Somnath, V. Y. Shur, S. Jesse, A. L. Kholkin, S. V. Kalinin and B. J. Rodriguez, ACS Appl. Mater. Interfaces, 2018, 10, 42674–42680 CrossRef CAS PubMed.
L. Li, Y. Yang, D. Zhang, Z.-G. Ye, S. Jesse, S. V. Kalinin and R. K. Vasudevan, Sci. Adv., 2018, 4, eaap8672 CrossRef PubMed.
H. Trivedi, V. V. Shvartsman, M. S. A. Medeiros, R. C. Pullar and D. C. Lupascu, npj Comput. Mater., 2018, 4, 28 CrossRef.
X. Li, L. Collins, K. Miyazawa, T. Fukuma, S. Jesse and S. V. Kalinin, Nat. Commun., 2018, 9, 2428 CrossRef PubMed.
S. Jesse, A. P. Baddorf and S. V. Kalinin, Appl. Phys. Lett., 2006, 88, 062908 CrossRef.
N. Balke, P. Maksymovych, S. Jesse, A. Herklotz, A. Tselev, C.-B. Eom, I. I. Kravchenko, P. Yu and S. V. Kalinin, ACS Nano, 2015, 9, 6484–6492 CrossRef CAS PubMed.
N. Balke, P. Maksymovych, S. Jesse, I. I. Kravchenko, Q. Li and S. V. Kalinin, ACS Nano, 2014, 8, 10229–10236 CrossRef CAS PubMed.
R. K. Vasudevan, N. Balke, P. Maksymovych, S. Jesse and S. V. Kalinin, Appl. Phys. Rev., 2017, 4, 021302 Search PubMed.
N. Balke, P. Maksymovych, S. Jesse, A. Herklotz, A. Tselev, C.-b. Eom, I. I. Kravchenko, P. Yu and S. V. Kalinin, ACS Nano, 2015, 9, 6484–6492 CrossRef CAS PubMed.
P. Virtanen, R. Gommers, T. E. Oliphant, M. Haberland, T. Reddy, D. Cournapeau, E. Burovski, P. Peterson, W. Weckesser, J. Bright, S. J. van der Walt, M. Brett, J. Wilson, K. J. Millman, N. Mayorov, A. R. J. Nelson, E. Jones, R. Kern, E. Larson, C. J. Carey, İ. Polat, Y. Feng, E. W. Moore, J. VanderPlas, D. Laxalde, J. Perktold, R. Cimrman, I. Henriksen, E. A. Quintero, C. R. Harris, A. M. Archibald, A. H. Ribeiro, F. Pedregosa, P. van Mulbregt, A. Vijaykumar, A. P. Bardelli, A. Rothberg, A. Hilboll, A. Kloeckner, A. Scopatz, A. Lee, A. Rokem, C. N. Woods, C. Fulton, C. Masson, C. Häggström, C. Fitzgerald, D. A. Nicholson, D. R. Hagen, D. V. Pasechnik, E. Olivetti, E. Martin, E. Wieser, F. Silva, F. Lenders, F. Wilhelm, G. Young, G. A. Price, G.-L. Ingold, G. E. Allen, G. R. Lee, H. Audren, I. Probst, J. P. Dietrich, J. Silterra, J. T. Webber, J. Slavič, J. Nothman, J. Buchner, J. Kulick, J. L. Schönberger, J. V. de Miranda Cardoso, J. Reimer, J. Harrington, J. L. C. Rodríguez, J. Nunez-Iglesias, J. Kuczynski, K. Tritz, M. Thoma, M. Newville, M. Kümmerer, M. Bolingbroke, M. Tartre, M. Pak, N. J. Smith, N. Nowaczyk, N. Shebanov, O. Pavlyk, P. A. Brodtkorb, P. Lee, R. T. McGibbon, R. Feldbauer, S. Lewis, S. Tygier, S. Sievert, S. Vigna, S. Peterson, S. More, T. Pudlik, T. Oshima, T. J. Pingel, T. P. Robitaille, T. Spura, T. R. Jones, T. Cera, T. Leslie, T. Zito, T. Krauss, U. Upadhyay, Y. O. Halchenko, Y. Vázquez-Baeza and C. SciPy, Nat. Methods, 2020, 17, 261–272 CrossRef CAS PubMed.
S. J. van der Walt, J. L. Schönberger, J. Nunez-Iglesias, F. Boulogne, J. Warner, N. Yager, E. Gouillart, T. Yu and the scikit-image contributors, PeerJ, 2014, 2, e453 CrossRef PubMed.
G. James, D. Witten, T. Hastie and R. Tibshirani, An Introduction to Statistical Learning, SpringerNew York, 2013 Search PubMed.
P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio and P.-A. Manzagol, J. Mach. Learn. Res., 2010, 11, 3371–3408 Search PubMed.
R. K. Vasudevan, H. Khassaf, Y. Cao, S. Zhang, A. Tselev, B. Carmichael, M. B. Okatan, S. Jesse, L.-Q. Chen, S. P. Alpay, S. V. Kalinin and N. Bassiri-Gharb, Adv. Funct. Mater., 2016, 26, 478–486 CrossRef CAS.
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot and E. Duchesnay, J. Mach. Learn. Res., 2011, 12, 2825–2830 Search PubMed.

Footnote

† Electronic supplementary information (ESI) available. See DOI: 10.1039/c9na00731h

Click here to see how this site uses Cookies. View our privacy policy here.