NextInnovation. Powered by Blogger.
RSS

SMILES notation




Assalamualaikum, today I'm will introduce you about SMILES application

Simplified molecular input line entry specification (SMILES)

it is a software describing the structure of chemical molecular using short ASCII strings. Smile strings can be imported by most molecule editors for conversion back into two-dimensional drawings or three-dimensional models of the molecules.

the image is the example of SMILES application.


Canonical SMILES and Isomeric SMILES

The term Canonical SMILES refers to the version of the SMILES specification that includes rules for ensuring that each distinct chemical molecule has a single unique SMILES representation
     - A common application of Canonical SMILES is for indexing and ensuring uniqueness of molecules in a          database

The term Isomeric SMILES refers to the version of the SMILES specification that includes extensions to support the specification of isotopes, chirality, and configuration about double bonds
      - A notable feature of these rules is that they allow rigorous partial specification of chirality.

SMILES Bonds

SINGLE -
DOUBLE =
TRIPLE #
AROMATIC  :

SMILES Branches

- Represented by enclosure in parentheses
- Can be nested or stacked
- Examples:

CC(O)CC is 2-Butanol
OCC(C)C is iso-Butanol
OC(C)(C)C is tert-Butanol

SMILES BONDS


Ethene C=C
Chloroethene ClC=C
1,1-Dichloroethene ClC(Cl)=C
cis-1,2-Dichloroethene ClC=CCl
Trichloroethene ClC(Cl)=CCl
Perchloroethene ClC(Cl)=C(Cl)Cl

Table below show some of molecular structures and its SMILES formula.

Application on some molecules

MoleculeStructureSMILES Formula
DinitrogenN≡NN#N
Methyl isocyanate (MIC)CH3–N=C=OCN=C=O
Copper(II) sulfateCu2+ SO42-[Cu+2].[O-]S(=O)(=O)[O-]
Œnanthotoxin (C17H22O2)Molecular structure of œnanthotoxinCCC[C@@H](O)CC\C=C\C=C\C#CC#C\C=C\CO
Pyrethrin II (C22H28O5)Molecular structure of pyrethrin IICOC(=O)C(\C)=C\C1C(C)(C)[C@H]1C(=O)O[C@@H]2C(C)=C(C(=O)C2)CC=CC=C
Aflatoxin B1 (C17H12O6)Molecular structure of aflatoxin B1O1C=C[C@H]([C@H]1O2)c3c2cc(OC)c4c3OC(=O)C5=C4CCC(=O)5
Glucose (glucopyranose) (C6H12O6)Molecular structure of glucopyranoseOC[C@@H](O1)[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O)1
Bergenin (cuscutin) (a resin) (C14H16O9)Molecular structure of cuscutine (bergenin)OC[C@@H](O1)[C@@H](O)[C@H](O)[C@@H]2[C@@H]1c3c(O)c(OC)c(O)cc3C(=O)O2
pheromone of the Californian scale insect(3Z,6R)-3-methyl-6-(prop-1-en-2-yl)deca-3,9-dien-1-yl acetateCC(=O)OCCC(/C)=C\C[C@H](C(C)=C)CCC=C
2S,5R-Chalcogran: a pheromone of the bark beetle Pityogenes chalcographus[8](2S,5R)-2-ethyl-1,6-dioxaspiro[4.4]nonaneCC[C@H](O1)CC[C@@]12CCCO2
VanillinMolecular structure of vanillinO=Cc1ccc(O)c(OC)c1
Melatonin (C13H16N2O2)Molecular structure of melatoninCC(=O)NCCC1=CNc2c1cc(OC)cc2
Flavopereirin (C17H15N2)Molecular structure of flavopereirinCCc(c1)ccc2[n+]1ccc3c2Nc4c3cccc4
Nicotine (C10H14N2)Molecular structure of nicotineCN1CCC[C@H]1c2cccnc2
Alpha-thujone (C10H16O)Molecular structure of thujoneCC(C)[C@@]12C[C@@H]1[C@@H](C)C(=O)C2
Thiamin (C12H17N4OS+)
(vitamin B1)
SMolecular structure of thiaminOCCc1c(C)[n+](=cs1)Cc2cnc(C)nc(N)2


to get more detail about SMILES and how to use it, click this link http://en.wikipedia.org/wiki/Simplified_molecular-input_line-entry_system.
to download the software go to this http://chemsketch.en.softonic.com/.

  • Digg
  • Del.icio.us
  • StumbleUpon
  • Reddit
  • RSS

0 comments:

Post a Comment