NextInnovation. Powered by Blogger.
RSS

PDB Protein Data Bank

 
 
 
 
 
 
 
 
Assalamualaikum.Today we will explain about PBD protein Data Bank.
 
The Protein Data Bank (PDB) is a repository for the three-dimensional structural data of large biological molecules, such as proteins and nucleic acids. (See also crystallographic database.) The data, typically obtained by X-ray crystallography or NMR spectroscopy and submitted by biologists and biochemists from around the world, are freely accessible on the Internet via the websites of its member organisations (PDBe, PDBj, and RCSB). The PDB is overseen by an organization called the Worldwide Protein Data Bank, wwPDB.
The PDB is a key resource in areas of structural biology, such as structural genomics. Most major scientific journals, and some funding agencies, such as the NIH in the USA, now require scientists to submit their structure data to the PDB. If the contents of the PDB are thought of as primary data, then there are hundreds of derived (i.e., secondary) databases that categorize the data differently. For example, both SCOP and CATH categorize structures according to type of structure and assumed evolutionary relations; GO categorize structures based on genes.

HISTORY


Two forces converged to initiate the PDB: 1) a small but growing collection of sets of protein structure data determined by X-ray diffraction; and 2) the newly available (1968) molecular graphics display, the Brookhaven RAster Display (BRAD), to visualize these protein structures in 3-D. In 1969, with the sponsorship of Walter Hamilton at the Brookhaven National Laboratory, Edgar Meyer (Texas A&M University) began to write software to store atomic coordinate files in a common format to make them available for geometric and graphical evaluation. By 1971, one of Meyer's programs, SEARCH, enabled researchers to remotely access information from the database to study protein structures offline. SEARCH was instrumental in enabling networking, thus marking the functional beginning of the PDB.
Upon Hamilton's death in 1973, Tom Koeztle took over direction of the PDB for the subsequent 20 years. In January 1994, Joel Sussman of Israel's Weizmann Institute of Science was appointed head of the PDB. In October 1998,[ the PDB was transferred to the Research Collaboratory for Structural Bioinformatics (RCSB);the transfer was completed in June 1999. The new director was Helen M. Berman of Rutgers University (one of the member institutions of the RCSB). In 2003, with the formation of the wwPDB, the PDB became an international organization. The founding members are PDBe (Europe),[ RCSB (USA), and PDBj (Japan). The BMRB joined in 2006. Each of the four members of wwPDB can act as deposition, data processing and distribution centers for PDB data. The data processing refers to the fact that wwPDB staff review and annotate each submitted entry. The data are then automatically checked for plausibility (the source code for this validation software has been made available to the public at no charge).

Contents

The PDB database is updated weekly (UTC+0 Wednesday). Likewise, the PDB holdings list is also updated weekly. As of 1 October 2013 (2013-10-01), the breakdown of current holdings is as follows:

Experimental
Method
ProteinsNucleic AcidsProtein/Nucleic Acid
complexes
OtherTotal
X-ray diffraction7778114844074383342
NMR88671047193710114
Electron microscopy474451290648
Hybrid5232158
Other1514613174
Total:87325258344042494336
72,884 structures in the PDB have a structure factor file.
7,424 structures have an NMR restraint file.
1,179 structures in the PDB have a chemical shifts file.
602 structures in the PDB have a 3DEM map file.
These data show that most structures are determined by X-ray diffraction, but about 10% of structures are now determined by protein NMR. When using X-ray diffraction, approximations of the coordinates of the atoms of the protein are obtained, whereas estimations of the distances between pairs of atoms of the protein are found through NMR experiments. Therefore, the final conformation of the protein is obtained, in the latter case, by solving a distance geometry problem. A few proteins are determined by cryo-electron microscopy. (Clicking on the numbers in the original table will bring up examples of structures determined by that method.)
The significance of the structure factor files, mentioned above, is that, for PDB structures determined by X-ray diffraction that have a structure file, the electron density map may be viewed. The data of such structures is stored on the "electron density server".

Growth trend

In the past, the number of structures in the PDB has grown at an approximately exponential rate.However, since 2007, the rate of accumulation of new proteins appears to have plateaued:

Number of searchable structures per year
 
Year# addedTotal
2012897887,089
2011807278,111
2010789770,039
2009738062,142
2008695654,762
2007719847,806
2006647340,608
2005535934,135
2004518028,776
2003416723,596
2002300119,429
2001283116,428
2000262713,597
1999236010,970
199820578,610
199715656,553
199611724,988
19959453,816
199412892,871
19936961,582
1992192886
1991187694
1990142507
198974365
198853291
198725238
198618213
198520195
198422175
198336153
198232117
19811685
19801669
19791153
1978642
19772336
19761313
197500
197400
197300
197200

Note: searchable structures vary over time as some become obsolete and are removed from the database. Template:SVG Chart
 

Software Tools

Various software tools can minimize the amount of manual labor.Help scientists deposit their results quicker.Also help validate their results

Commonly used software:
– pdb_extract
– ADIT
– PDB Validation Suite
 
PDB Extract Tool
pdb_extract can extract information from the output of standard crystallographic programs. It merges the information into mmC.IF files at each step of the structure-determination process.These mmCIF files are then ready for validation and deposition.

 Data Deposition

AutoDep Input Tool

ADIT is available both as web-based tool and a standalone tool.It is used for assembling, editing, validating and deposition structural data.ADIT is built on top of the mmCIF dictionary.Data will go through series of computerized validation procedures.

PDB Validation Suite

PDB Validation Suite creates reports based upon the validation results.Reports are generated in plain text and PostScript formats.It also calculates derived information that could be used for assessing the quality of a structure.

PDB Validation

The geometry of the macromolecule is checked against known standards for distances and angles.Coordinate data are checked against results from other experiments.Results of the validations are reviewed by the annotation staff and then returned to the author for review.

Data Deposition
Processing time, including all author correspondence average about two weeks.ADIT along with other data collection and validation tools enables the depositors to pre-check all aspects of the structure before submission.Besides the RCSB site, data are processed by PDBj in Osaka and European Bioinformatics Institute
 

Introduction To Rasmol
 
  RasMol is an important scientific tool for visualisation of molecules created by Roger Sayle in 1992. RasMol is used by hundreds of thousands of users world-wide to view macromolecules and to prepare publication-quality images. Science is best served when the tools we use are fully understood by those who wield those tools and by those who make used of results obtained with those tools. When a scientific tool exists as software, access to source code is an important element in achieving full understanding of that tool. As our field evolves and new versions of software are required, access to source allows us to adapt our tools quickly and effectively.

You can download from this website RaSmOL
 
 
 


protein picture description
Glucose is a major source of energy in your body, but unfortunately, free glucose is relatively rare in our typical diet. Instead, glucose is locked up in many larger forms, including lactose and sucrose, where two small sugars are connected together, and long chains of glucose like starches and glycogen. One of the major jobs of digestion is to break these chains into their individual glucose units, which are then delivered by the blood to hungry cells throughout your body.
Bacteria pull no punches when they fight to protect themselves. Some bacteria build toxins so powerful that a single molecule can kill an entire cell. This is far more effective than chemical poisons like cyanide or arsenic. Chemical poisons attack important molecules one by one, so many, many molecules of cyanide are needed to kill a cell. Bacterial toxins use two strategies to make their toxins far more deadly than this.
During the holiday season, we often place greater demands on our digestive enzymes than at other times of the year. Our digestive system contains a host of tough, stable enzymes designed to seek out those rich holiday treats and break them into small pieces. Pepsin is the first in a series of enzymes that digest proteins. In the stomach, protein chains bind in the deep active site groove of pepsin, seen in the upper illustration (from PDB entry 5pep), and are broken into smaller pieces. Then, a variety of proteases and peptida...
As you read this Molecule of the Month, the light from the page is being focused in your eyes by a concentrated solution of crystallin proteins. The lenses in your eyes are built of long cells that, early in their development, filled themselves with crystallins and then made the major sacrifice, ejecting their nuclei and mitochondria and leaving only a smooth, transparent solution of protein. We then rely on these proteins to see for the rest of our lives.
Bacteria pose a continual threat of infection, both to humans and to other higher organisms. Thus, when looking for new ways to fight infection, it is often productive to look at how other plants, animals and fungi protect themselves. This is how penicillin was discovered. Through a chance observation in 1928, Alexander Fleming discovered that colonies of Penicillium mold growing in his bacterial cultures were able to stave off infection. With more study, he found that the mold was flooding the culture with a molecule that killed the bacteria, penicillin.

  • Digg
  • Del.icio.us
  • StumbleUpon
  • Reddit
  • RSS

SMILES notation




Assalamualaikum, today I'm will introduce you about SMILES application

Simplified molecular input line entry specification (SMILES)

it is a software describing the structure of chemical molecular using short ASCII strings. Smile strings can be imported by most molecule editors for conversion back into two-dimensional drawings or three-dimensional models of the molecules.

the image is the example of SMILES application.


Canonical SMILES and Isomeric SMILES

The term Canonical SMILES refers to the version of the SMILES specification that includes rules for ensuring that each distinct chemical molecule has a single unique SMILES representation
     - A common application of Canonical SMILES is for indexing and ensuring uniqueness of molecules in a          database

The term Isomeric SMILES refers to the version of the SMILES specification that includes extensions to support the specification of isotopes, chirality, and configuration about double bonds
      - A notable feature of these rules is that they allow rigorous partial specification of chirality.

SMILES Bonds

SINGLE -
DOUBLE =
TRIPLE #
AROMATIC  :

SMILES Branches

- Represented by enclosure in parentheses
- Can be nested or stacked
- Examples:

CC(O)CC is 2-Butanol
OCC(C)C is iso-Butanol
OC(C)(C)C is tert-Butanol

SMILES BONDS


Ethene C=C
Chloroethene ClC=C
1,1-Dichloroethene ClC(Cl)=C
cis-1,2-Dichloroethene ClC=CCl
Trichloroethene ClC(Cl)=CCl
Perchloroethene ClC(Cl)=C(Cl)Cl

Table below show some of molecular structures and its SMILES formula.

Application on some molecules

MoleculeStructureSMILES Formula
DinitrogenN≡NN#N
Methyl isocyanate (MIC)CH3–N=C=OCN=C=O
Copper(II) sulfateCu2+ SO42-[Cu+2].[O-]S(=O)(=O)[O-]
Œnanthotoxin (C17H22O2)Molecular structure of œnanthotoxinCCC[C@@H](O)CC\C=C\C=C\C#CC#C\C=C\CO
Pyrethrin II (C22H28O5)Molecular structure of pyrethrin IICOC(=O)C(\C)=C\C1C(C)(C)[C@H]1C(=O)O[C@@H]2C(C)=C(C(=O)C2)CC=CC=C
Aflatoxin B1 (C17H12O6)Molecular structure of aflatoxin B1O1C=C[C@H]([C@H]1O2)c3c2cc(OC)c4c3OC(=O)C5=C4CCC(=O)5
Glucose (glucopyranose) (C6H12O6)Molecular structure of glucopyranoseOC[C@@H](O1)[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O)1
Bergenin (cuscutin) (a resin) (C14H16O9)Molecular structure of cuscutine (bergenin)OC[C@@H](O1)[C@@H](O)[C@H](O)[C@@H]2[C@@H]1c3c(O)c(OC)c(O)cc3C(=O)O2
pheromone of the Californian scale insect(3Z,6R)-3-methyl-6-(prop-1-en-2-yl)deca-3,9-dien-1-yl acetateCC(=O)OCCC(/C)=C\C[C@H](C(C)=C)CCC=C
2S,5R-Chalcogran: a pheromone of the bark beetle Pityogenes chalcographus[8](2S,5R)-2-ethyl-1,6-dioxaspiro[4.4]nonaneCC[C@H](O1)CC[C@@]12CCCO2
VanillinMolecular structure of vanillinO=Cc1ccc(O)c(OC)c1
Melatonin (C13H16N2O2)Molecular structure of melatoninCC(=O)NCCC1=CNc2c1cc(OC)cc2
Flavopereirin (C17H15N2)Molecular structure of flavopereirinCCc(c1)ccc2[n+]1ccc3c2Nc4c3cccc4
Nicotine (C10H14N2)Molecular structure of nicotineCN1CCC[C@H]1c2cccnc2
Alpha-thujone (C10H16O)Molecular structure of thujoneCC(C)[C@@]12C[C@@H]1[C@@H](C)C(=O)C2
Thiamin (C12H17N4OS+)
(vitamin B1)
SMolecular structure of thiaminOCCc1c(C)[n+](=cs1)Cc2cnc(C)nc(N)2


to get more detail about SMILES and how to use it, click this link http://en.wikipedia.org/wiki/Simplified_molecular-input_line-entry_system.
to download the software go to this http://chemsketch.en.softonic.com/.

  • Digg
  • Del.icio.us
  • StumbleUpon
  • Reddit
  • RSS