The 1.5 release of the Mobile Molecular DataSheet (MMDS) introduced a couple of major features, including a minimum viable feature deployment of OpenPHACTS assay integration. That is being quite liberal with the meaning of viable: building this into a practical scientific workflow is more of an ongoing campaign than a specific new piece of functionality, and so other features are being improved in lock step. The next one to get an upgrade is the searching capability, which has an additional preflight configuration block (shown to the right).
The search functionality relies on a middleware service called MetaSearch, which runs on the molsync.com server, along with various other supporting cheminformatics functionality. Its initial task is to farm out searches to PubChem, ChEBI and ChemSpider, each of which has its own strengths and weaknesses; its second task is to reconcile the results and provide a series of deduplication, homogenisation and useful post-processing: limited in scope at first, but growing to fit the needs of various front end workflows (e.g. scaffold analysis).
The newly added options block allows selection of the underlying database. By default it searches them all, but sometimes you know what you want (or rather, where you want it from). While performance is not much impacted (the sub-searches are submitted concurrently), in the case of substructure or similarity searches, it is often useful to restrict it to the most relevant database, so that the precious 100-item maximum cutoff is likely to keep more of the results you want (e.g. ChEBI tends to offer the most heavily curated drug-relevant compounds, but is much smaller than the other two; PubChem has the greatest diversity, but also the most junk; ChemSpider offers uniquely crowdsourced post-curation).
Awhile back, the ability to lookup and return vendor information for records was added. This still applies only to content that can be mapped to PubChem (though it will branch out someday). Unfortunately the lookup process is kinda slow, and so making the fetch optional (and off by default) is a useful feature. And for that matter, the results can also be restricted to include only those which have some available vendor information: if you only want to see compounds you can buy, select the Require option.
There is an innocuous-looking option right at the end entitled Assays. If this is switched to Require, what happens is that each potential result candidate is quickly checked against the OpenPHACTS service, to make sure that the corresponding molecule identifier has been recorded in that database, and for which there is at least one assay record (where assay is defined fairly liberally). It does not actually return the assay information (there’s a separate feature for that), but rather it allows you to gather a collection of molecules and strip out the ones that are unknown to OpenPHACTS. The overall nefarious plan here is to make it increasingly viable (and convenient) to find a bunch of related compounds, grab all relevant known information about their properties and activities, then continue on toward various kinds of SAR analysis and model building.
Other important improvements are: importing of resulting content from the search uses the more sophisticated assimilation process introduced recently to make importing of content more flexible. And, the source identifiers from the underlying databases are also imported (i.e. PubChem, ChEBI and ChemSpider ID codes for the content that was folded into each search result).