Text Serialization is the process of turning tabular data into textual representations, first proposed by TabLLM.
In this project, we are interested in addressing two questions in regards to text serialization. In the first part of our research, we compare how text serialization compares to traditional tabular machine learning paradigms in data curation. In the second part of our research, we explore how text serialization can be used to address common challenges in tabular machine learning and whether they are better than existing machine learning methodologies.
We apologize for the inconvenience but we will be at ICML 2024 AI4Science Workshop for poster session #2 in Hall A8 to present our paper. We will be updating this webpage with more information soon.
@article{ono2024text,
title={Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning},
author={Ono, Kyoka and Lee, Simon A},
journal={arXiv preprint arXiv:2406.13846},
year={2024}}
If there are any questions or concerns, please feel free to reach out to us at simonlee711@g.ucla.edu.