Skip to content

Converts list of materials from molecular formula to one hot vectors

Notifications You must be signed in to change notification settings

mcsorkun/Material-2-One-Hot-Vector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Formula to One Hot Vector

This python script converts set of molecular formulas into the One Hot Vector.

It reads a CSV file that contains a column named "Formula" . (Example file: 2d-materials-list.csv )

Then it converts it into a One Hot Vector (actually it is a count vector: it contains number of elements).

Writes the result in to a CSV file (OneHotVector.csv).

This information can be used as a descriptors for machine learning processes.

Code also plots the occurence counts of each element:

alt text

About

Converts list of materials from molecular formula to one hot vectors

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages