article

PubChem is a database of chemical molecules. The system is maintained by the National Center for Biotechnology Information (NCBI) which belongs to the United States National Institutes of Health (NIH). PubChem can be accessed for free through a web user interface. PubChem contains mostly small molecules with a molecular mass below 500. The American Chemical Society have tried to get the U.S. Congress to restrict the operation of PubChem, because they claim it competes with their Chemical Abstracts Service.*.

Databases


PubChem consists of three primary databases:

  • Compounds, 5.2 million entries, contains pure and characterized chemical compounds.
  • Substances, 7.7 million entries, contains also mixtures, extracts, complexes and uncharacterized substances.
  • BioAssay, bioactivity results from 176 high throughput screening programs with several million values.

Searching


Searching the databases is possible for a broad range of properties including chemical structure, name fragments, chemical formula, molecular weight, XLogP, and hydrogen bond donor and acceptor count.

PubChem contains an own online molecule editor with SMILES/SMARTS and InChI support that allows the import and export of all common chemical file formats to search for structures and fragments.

Each hit provides information about synonyms, chemical properties, chemical structure including SMILES and InChI strings, bioactivity, and links to structurally related compounds and other NCBI databases like PubMed.

In the text search form the database fields can be searched by adding the field name in square brackets to the search term. A numeric range is represented by two numbers separated by a colon. The search terms and field names are case-insensitive. Parentheses and the logical operators AND, OR, and NOT can be used. AND is assumed if no operator is used.

Example (Lipinski's Rule of Five):

0:5000:5*" target="_blank" >-5:5[logp

Database fields



Identification numbers
Identification number in current database *
Substance identification number *
Compound identification number *
BioAssay identification number [AID

General
Any database field *
Comment *
Deposition date [DEPDAT
Depositor's external ID [SRCID
Source name *
Source release date *
Medical Subject Heading (MeSH) term [MESHT
MeSH tree node [MESHTN
MeSH pharmacological actions [PHARMA

Substance properties
Substance synonyms *
IUPAC name [IUPAC
International Chemical Identifier (InChI) *
Molecular weight *
Chemical elements [EL
Non-Hydrogen atoms [HACNT
Isotope count [IACNT
Total formal charge *
Chiral atom count [ACCNT
Defined chiral atom count [ACDCNT
Undefined chiral atom count [ACUCNT
Hydrogen bond acceptor count [HBACNT
Hydrogen bond donor count [HBDCNT
Tautomer count *
Rotatable bond count [RBCNT
XLogP [LOGP

Compound properties
Compound synonyms [CSYNO
Component count [CCNT
Covalent unit (molecule) count [CUCNT
Total bioactivity count *

See also


External links


Chemistry | National Institutes of Health | Online databases

PubChem | PubChem

 

This article is licensed under the GNU Free Documentation License. It uses material from the "PubChem".

Home Pageartsbusinesscomputersgameshealthhospitalshomekids & teensnewsphysiciansrecreationreferenceregionalscienceshoppingsocietysportsworld