NCBI
PubMed
A service of the
U.S. National Library of Medicine
and the
National Institutes of Health
My NCBI
[Sign In]
[Register]
All Databases
PubMed
Nucleotide
Protein
Genome
Structure
OMIM
PMC
Journals
Books
Search
Database name
PubMed
Protein
Nucleotide
GSS
EST
Structure
Genome
Books
CancerChromosomes
Conserved Domains
dbGaP
3D Domains
Gene
Genome Project
GENSAT
GEO Profiles
GEO DataSets
HomoloGene
Journals
MeSH
NCBI Web Site
NLM Catalog
OMIA
OMIM
PMC
PopSet
Probe
Protein Clusters
PubChem BioAssay
PubChem Compound
PubChem Substance
SNP
Taxonomy
ToolKit
UniGene
UniSTS
for
Search term
Go
Clear
Advanced Search
Limits
Preview/Index
History
Clipboard
Details
Your browser version may not work well with NCBI's Web applications. More information
here...
Display
Summary
Brief
Abstract
AbstractPlus
Citation
MEDLINE
XML
UI List
LinkOut
ASN.1
Related Articles
Cited in Books
CancerChrom Links
Domain Links
3D Domain Links
dbGaP Links
GEO DataSet Links
Gene Links
Gene (OMIM) Links
Gene (GeneRIF) Links
Genome Links
Project Links
GENSAT Links
GEO Profile Links
HomoloGene Links
Nucleotide Links
Nucleotide (RefSeq) Links
Nucleotide (Weighted) Links
EST Links
EST (RefSeq) Links
GSS Links
GSS (RefSeq) Links
OMIA Links
OMIM (calculated) Links
OMIM (cited) Links
BioAssay Links
Compound Links
Compound (MeSH Keyword)
Compound (Publisher) Links
Substance Links
Substance (MeSH Keyword)
Substance (Publisher) Links
PMC Links
Cited in PMC
PopSet Links
Probe Links
Protein Links
Protein (RefSeq) Links
Protein (Weighted) Links
Protein Cluster Links
Cited Articles
SNP Links
SNP (Cited)
Structure Links
Taxonomy via GenBank
UniGene Links
UniSTS Links
Show
5
10
20
50
100
200
500
Sort By
Pub Date
First Author
Last Author
Journal
Title
Send to
Text
File
Printer
Clipboard
Collections
E-mail
Order
All: 1
Review: 0
Click to change filter selection through MyNCBI.
1:
BMC Bioinformatics.
2005;6 Suppl 1:S17. Epub 2005 May 24.
Related Articles
,
Links
An evaluation of GO annotation retrieval for BioCreAtIvE and GOA.
Camon EB
,
Barrell DG
,
Dimmer EC
,
Lee V
,
Magrane M
,
Maslen J
,
Binns D
,
Apweiler R
.
European Molecular Biology Laboratory, European Bionformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK. goa@ebi.ac.uk
BACKGROUND: The Gene Ontology Annotation (GOA) database http://www.ebi.ac.uk/GOA aims to provide high-quality supplementary GO annotation to proteins in the UniProt Knowledgebase. Like many other biological databases, GOA gathers much of its content from the careful manual curation of literature. However, as both the volume of literature and of proteins requiring characterization increases, the manual processing capability can become overloaded. Consequently, semi-automated aids are often employed to expedite the curation process. Traditionally, electronic techniques in GOA depend largely on exploiting the knowledge in existing resources such as InterPro. However, in recent years, text mining has been hailed as a potentially useful tool to aid the curation process. To encourage the development of such tools, the GOA team at EBI agreed to take part in the functional annotation task of the BioCreAtIvE (Critical Assessment of Information Extraction systems in Biology) challenge. BioCreAtIvE task 2 was an experiment to test if automatically derived classification using information retrieval and extraction could assist expert biologists in the annotation of the GO vocabulary to the proteins in the UniProt Knowledgebase. GOA provided the training corpus of over 9000 manual GO annotations extracted from the literature. For the test set, we provided a corpus of 200 new Journal of Biological Chemistry articles used to annotate 286 human proteins with GO terms. A team of experts manually evaluated the results of 9 participating groups, each of which provided highlighted sentences to support their GO and protein annotation predictions. Here, we give a biological perspective on the evaluation, explain how we annotate GO using literature and offer some suggestions to improve the precision of future text-retrieval and extraction techniques. Finally, we provide the results of the first inter-annotator agreement study for manual GO curation, as well as an assessment of our current electronic GO annotation strategies. RESULTS: The GOA database currently extracts GO annotation from the literature with 91 to 100% precision, and at least 72% recall. This creates a particularly high threshold for text mining systems which in BioCreAtIvE task 2 (GO annotation extraction and retrieval) initial results precisely predicted GO terms only 10 to 20% of the time. CONCLUSION: Improvements in the performance and accuracy of text mining for GO terms should be expected in the next BioCreAtIvE challenge. In the meantime the manual and electronic GO annotation strategies already employed by GOA will provide high quality annotations.
Publication Types:
Evaluation Studies
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, P.H.S.
Validation Studies
PMID: 15960829 [PubMed - indexed for MEDLINE]
PMCID: PMC1869009
Display
Summary
Brief
Abstract
AbstractPlus
Citation
MEDLINE
XML
UI List
LinkOut
ASN.1
Related Articles
Cited in Books
CancerChrom Links
Domain Links
3D Domain Links
dbGaP Links
GEO DataSet Links
Gene Links
Gene (OMIM) Links
Gene (GeneRIF) Links
Genome Links
Project Links
GENSAT Links
GEO Profile Links
HomoloGene Links
Nucleotide Links
Nucleotide (RefSeq) Links
Nucleotide (Weighted) Links
EST Links
EST (RefSeq) Links
GSS Links
GSS (RefSeq) Links
OMIA Links
OMIM (calculated) Links
OMIM (cited) Links
BioAssay Links
Compound Links
Compound (MeSH Keyword)
Compound (Publisher) Links
Substance Links
Substance (MeSH Keyword)
Substance (Publisher) Links
PMC Links
Cited in PMC
PopSet Links
Probe Links
Protein Links
Protein (RefSeq) Links
Protein (Weighted) Links
Protein Cluster Links
Cited Articles
SNP Links
SNP (Cited)
Structure Links
Taxonomy via GenBank
UniGene Links
UniSTS Links
Show
5
10
20
50
100
200
500
Sort By
Pub Date
First Author
Last Author
Journal
Title
Send to
Text
File
Printer
Clipboard
Collections
E-mail
Order
About Entrez
Text Version
Entrez PubMed
Overview
Help
|
FAQ
Tutorials
New/Noteworthy
E-Utilities
PubMed Services
Journals Database
MeSH Database
Single Citation Matcher
Batch Citation Matcher
Clinical Queries
Special Queries
LinkOut
My NCBI
Related Resources
Order Documents
NLM Mobile
NLM Catalog
NLM Gateway
TOXNET
Consumer Health
Clinical Alerts
ClinicalTrials.gov
PubMed Central
Write to the Help Desk
NCBI
|
NLM
|
NIH
Department of Health & Human Services
Privacy Statement
|
Freedom of Information Act
|
Disclaimer