Markush Structures & Combinatorial LibrariesFingerprinting & DictionariesClusteringDiversity AnalysisChemical Query Conversion
Torus™ToolkitsMain ProgramsWeb ServicesThird Party Integration
  About Us
  Products
  Consulting
  Support
  News & Events
  Contact Us
  Sitemap
 
Click here to login
 
 
... Diversity Analysis
This page gives an introduction to Diversity Analysis and includes:
An Introduction to Diversity Analysis
Digital Chemistry Diversity Analysis Tools
How to get more Information and Evaluation Software
An Introduction to Diversity Analysis

The concept of similarity between compounds leading to grouping of a dataset based on the similarities between structures, clustering, has been discussed elsewhere under Digital Chemistry Clustering Tools. The opposite of this is the concept of dataset analysis based on measures of dissimilarity, Diversity Analysis.

Diversity Analysis includes the calculation of the structural or property diversity of datasets. It can be used in diverse subset selection methods that enable the extraction of subsets of maximally dissimilar compounds from a dataset.

This is of particular importance in constructing combinatorial libraries and for biological screening programmes. Diversity Analysis finds much application therefore in the pharmaceutical industry, where one needs to select a small subset of compounds that best cover a very large dataset. The cost and time savings in needing to synthesise only this small number of compounds to investigate potential drug candidates over vast chemical space are quite apparent.

Diversity Analysis is assisted by the generation of two general dataset representations, the centroid fingerprint and the modal fingerprint. The centroid fingerprint is a form of average fingerprint for the whole dataset and can be used to calculate a measure of the average dataset dissimilarity and also to calculate the change in average dataset dissimilarity if two datasets were to be merged.The modal fingerprint is only applicable to datasets in which the compounds are represented by fingerprints and are very useful in analysing the frequency of incidence of each element of a fingerprint across the whole dataset. Click here for more information about fingerprints.

Digital Chemistry Diversity Analysis Tools

Digital Chemistry offers a comprehensive Diversity Analysis package for the rapid analysis of dataset diversity and the selection of subsets of maximally dissimilar compounds.

The key features of Digital Chemistry Diversity Analysis Tools include:

  • Generation and use of centroid and modal fingerprints
  • Output of diversity information based on centroid and modal fingerprints
  • Extraction of a user-requested number of maximally dissimilar compounds from a dataset.

Digital Chemistry Diversity Analysis is available in 2 formats as listed below, if you would like more detailed information about these please click on the links:

Digital Chemistry also supports the following operating systems, for a full list of hardware and software requirements for Digital Chemistry products please click here.

  • Windows
  • SUN Solaris
  • Linux
How to get more Information and Evaluation Software
If you would like any more information about Digital Chemistry's software or if you would like to request an evaluation copy please contact us, our details are given opposite.
Top

 

 
   
  search :
     
  For general enquiries, contact:
T: +44 (0)113 2181851
F: +44 (0)113 2181869
E: info@digitalchemistry.co.uk

The Iron Shed
Harewood House Estate
Harewood
Leeds LS17 9LF
United Kingdom