Skip to content

Unified machine learning model for predicting polymer properties through human language instructions

License

Notifications You must be signed in to change notification settings

hkqiu/Unified_ML4Polymers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PolyNC - the Unified Polymer Design Workflow with Language

This is the official repository of the paper entitled as PolyNC: a natural and chemical language model for unified polymer properties prediction on Chemical Science. This work features a a revolutionary model to enable rapid and precise prediction of Polymer properties via the power of Natural language and Chemical language (PolyNC). This work extends the powerful natural language understanding capabilities of AI to the field of polymer research, marking an impressive step towards the development of expert models and human-level AI for understanding polymer knowledge.

All data and code about model training and polyScreen software are in this repository.

TMAP: The HTML files of Polymer Tree of each structure within the train and test dataset of $T_g$.

data: The train, test and validation datasets utilized in this work.

notebooks: The jupyter notebooks (code history) for training, inference PolyNC and baseline models, and for attention analysis in .ipynb format.

src: The Python file and configuration file for t-SNE analysis.

By the way, our trained model is stored at this repo, where you can perform some simple demos with huggingface API.

Q&A

How to deploy and use?

  1. Access this website🤗, and then you can perform some the demos with huggingface API, such as try this: Predict the Tg of the following SMILES: c1cc(Oc2ccc(Oc3ccc(-n4c(=O)c5cc6c(c(=O)n()c6=O)cc5c4=O)cc3)cc2)ccc1 or Predict the heat resistance class of the following SMILES: c1cc(Oc2ccc(Oc3ccc(-n4c(=O)c5cc6c(c(=O)n()c6=O)cc5c4=O)cc3)cc2)ccc1. Just enjoy it!
  2. Since the development of polymers is an issue of concern to polymer scientists, in order to facilitate everyone's use in a more custom manner (custom task, custom max length, and other parameters), we are developing a Hugging Face playground for PolyNC. Stay tuned!🤗

And any issue on this work, please fell free to email hkqiu@ciac.ac.cn.

Cite this:

@Article{D3SC05079C,
author ="Qiu, Haoke and Liu, Lunyang and Qiu, Xuepeng and Dai, Xuemin and Ji, Xiangling and Sun, Zhao-Yan",
title  ="PolyNC: a natural and chemical language model for the prediction of unified polymer properties",
journal  ="Chem. Sci.",
year  ="2024",
volume  ="15",
issue  ="2",
pages  ="534-544",
publisher  ="The Royal Society of Chemistry",
doi  ="10.1039/D3SC05079C",
url  ="http://dx.doi.org/10.1039/D3SC05079C"}

About

Unified machine learning model for predicting polymer properties through human language instructions

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published