Then consider an
embedding of the chemical space into the
unit sphere of dimension n, (typically,
n ~ 300). For example,
Mol2Vec or Smiles2Vec (
PNNL). Such embedding is analogous to
Word2Vec in Natural Language Processing. Then we can identify a molecule in the chemical space with its image in the sphere by the embedding, and we can compute the entropy of
A with the formula: