Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Identification of catalysts #92

Open
dswigh opened this issue May 5, 2023 · 0 comments
Open

Identification of catalysts #92

dswigh opened this issue May 5, 2023 · 0 comments

Comments

@dswigh
Copy link
Collaborator

dswigh commented May 5, 2023

There is a list of catalysts in Therapeutics Data Commons (TDC) that could potentially be used to identify catalysts (https://tdcommons.ai/multi_pred_tasks/catalyst/), in a similar way to how we identify solvents. However, identifying catalysts by using a list is not necessarily a good idea, since the same set of organic molecules could play many different roles, depending on the context (so, identification of solvents by list is probably less error-prone than identifying catalysts like this). One way to improve catalyst identification would be to segment the catalyst list by reaction class, such that only a subset of all catalysts are associated with a particular reaction class (potentially by clustering catalysts and associating clusters of catalysts with rxn classes from name rxn), however, this matching process would be very laborious and go beyond the scope of ORDerly, which is built to be primarily computationally extensible and not have hand-crafted rules.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant