Reaction Prediction Models: Chapter 14 – Large Language Models

Large Language Models (LLMs or, in the parlance of our times, “AI”) have some potential value for reversing the translation of chemical reaction experiments into scientific English, into something more digitally friendly. Or put another way, there is an enormous amount of chemical reaction data that exists only as text, and if there was a less labour-intensive way to extract it, we would be much better off.

Continue reading →

Reaction Prediction Models: Chapter 13 – Using with ELNs

Filling out content for an electronic lab notebook (ELN) is one of the main high value workflows for reaction drawing. This includes drawing the outline of a reaction that is to be performed in the lab and writing up experiments that have been completed. Either way having the reaction scheme as complete as possible, viewable by chemists and meaningful to digital archives, is important.

Continue reading →

Reaction Prediction Models: Chapter 12 – Synthesis Planning Markup

There are a number of capable synthesis planning tools available to chemists. A retrosynthesis algorithm usually returns the barebones reactant/product pair for each step, which leaves plenty of work to do before the experiment is ready for the lab. Reaction prediction tools can help get these suggestions closer to a proposed experiment.

Continue reading →

Reaction Prediction Models: Chapter 11 – Exporting Graphics

The reaction editor described in these articles can read & write a number of different formats, which means it does not exist in isolation. Reactions can be exported using informatics formats or as publication-ready presentation graphics.

Continue reading →

Reaction Prediction Models: Chapter 10 – Training Data

The technology described in these articles is based on models that were built using a custom training set, or in the case of procedural algorithms, validated using that same content. The training set is unusual and proprietary, as it has been curated to a very high degree of completion and accuracy relative to any other available reaction data.

Continue reading →

Reaction Prediction Models: Chapter 9 – Component-Based Reaction Editing

There are two main approaches to drawing reactions: the canvas model, where everything is drawn onto a flat page, and the component model, where each reaction participant is treated as a discrete object. Both of these approaches have important advantages, and significant problems which have to be addressed.

Continue reading →

Reaction Prediction Models: Chapter 8 – Aligned Depiction

Sometimes the world of informatics overlaps strongly with the universe of human comprehension, and reaction component alignment is one of these cases: when reactants and products are drawn with a common orientation, it can be made very easy and immediately apparent to anyone with basic chemistry experience what is going on in the reaction. An arbitrary molecule layout on the other hand can impose a fairly high cognitive burden.

Continue reading →

ai artificial-intelligence chemical reactions chemistry reagent-proposal retrosynthesis science

Reaction Prediction Models: Chapter 7 – Backward/Forward Synthesis and Reagents

Using models to propose recommendations for chemical reactions is appropriate for many of the steps needed to fill out the scheme, but sometimes it’s more effective to pick an existing reaction and use it as a template to build out the missing pieces. This approach can be applied to forward and backward syntheses (starting from a reactant or product respectively) and also to finding and proposing stoichiometric reagents.

Continue reading →

chemical reactions chemistry machine-learning yield

Reaction Prediction Models: Chapter 6 – Yield Contours

Once the reaction scheme components are all present and marked up, the last step in this toolchain is to propose a coarse-grained set of conditions, namely the duration, temperature and catalyst concentration that is likely to provide the best yield.

Continue reading →

chemical reactions literature-lookup machine-learning

Reaction Prediction Models: Chapter 5 – Literature

Sometimes the best way to validate a prediction is to go check the literature and look for the most similar examples, to see if it looks reasonable. Fortunately the data that these reaction prediction tools are based only mostly comes with a DOI attached, so this feature can be added automagically.

Continue reading →

Cheminformatics 2.0

A blog about chemical information software for next generation computing environments.

Author: Dr. Alex M. Clark

Reaction Prediction Models: Chapter 14 – Large Language Models

Reaction Prediction Models: Chapter 13 – Using with ELNs

Reaction Prediction Models: Chapter 12 – Synthesis Planning Markup

Reaction Prediction Models: Chapter 11 – Exporting Graphics

Reaction Prediction Models: Chapter 10 – Training Data

Reaction Prediction Models: Chapter 9 – Component-Based Reaction Editing

Reaction Prediction Models: Chapter 8 – Aligned Depiction

Reaction Prediction Models: Chapter 7 – Backward/Forward Synthesis and Reagents

Reaction Prediction Models: Chapter 6 – Yield Contours

Reaction Prediction Models: Chapter 5 – Literature