Decoding non-linearity and complexity: deep tabular learning approaches for materials science

Vahid Attari; Raymundo Arroyave

doi:10.1039/D5DD00166H

Decoding non-linearity and complexity: deep tabular learning approaches for materials science

Vahid Attari

*^a and Raymundo Arroyave^a

Author affiliations

* Corresponding authors

^a Department of Materials Science & Engineering, Texas A&M University, College Station, TX, USA
E-mail: attari.v@tamu.edu

Abstract

Materials datasets, particularly those capturing high-temperature properties pose significant challenges for learning tasks due to their skewed distributions, wide feature ranges, and multimodal behaviors. While tree-based models like XGBoost are inherently non-linear and often perform well on many tabular problems, their reliance on piecewise constant splits can limit effectiveness when modeling smooth, long-tailed, or higher-order relationships prevalent in advanced materials data. To address these challenges, we investigate the effectiveness of encoder–decoder model for data transformation using regularized Fully Dense Networks (FDN-R), Disjunctive Normal Form Networks (DNF-Net), 1D Convolutional Neural Networks (CNNs), and Variational Autoencoders, along with TabNet, a hybrid attention-based model, to address these challenges. Our results indicate that while XGBoost remains competitive on simpler tasks, encoder–decoder models, particularly those based on regularized FDN-R and DNF-Net, demonstrate better generalization on highly skewed targets like creep resistance, across small, medium, and large datasets. TabNet's attention mechanism offers moderate gains but underperforms on extreme values. These findings emphasize the importance of aligning model architecture with feature complexity and demonstrate the promise of hybrid encoder–decoder models for robust and generalizable materials prediction from composition data.

Supplementary files

Article information

DOI: https://doi.org/10.1039/D5DD00166H
Article type: Paper
Submitted: 22 Apr 2025
Accepted: 07 Jul 2025
First published: 01 Aug 2025
This article is Open Access

Download Citation

Digital Discovery, 2025, Advance Article

Permissions

Request permissions

Decoding non-linearity and complexity: deep tabular learning approaches for materials science

V. Attari and R. Arroyave, Digital Discovery, 2025, Advance Article , DOI: 10.1039/D5DD00166H

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Digital Discovery

Decoding non-linearity and complexity: deep tabular learning approaches for materials science

Abstract

Supplementary files

Article information

Download Citation

Permissions

Decoding non-linearity and complexity: deep tabular learning approaches for materials science

Social activity

Search articles by author

Spotlight

Advertisements