Title of article :
A comparative study of patent sequence databases Original Research Article
Author/Authors :
Piet Jan Andree، نويسنده , , Mark F. Harper، نويسنده , , Stéphane Nauche، نويسنده , , Robert A. Poolman، نويسنده , , Jo Shaw، نويسنده , , Joop C. Swinkels، نويسنده , , Sally Wycherley، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2008
Abstract :
Nucleic acid and protein sequence data from patent publications is available from a plurality of commercial and public sources. As the searching and analysis of this data is of crucial importance to the life sciences industry, the Patent Documentation Group’s Biotechnology Information Working Group conducted a study to critically compare and evaluate patent sequence databases for data content. A series of sequences were searched to find similar sequences from several well known sources: GENESEQ™, CAS REGISTRY/CAplusSM, PCTGEN, NCBI GenBank®, EMBL-Bank and the EBI Fasta databases. The study highlights some differences between GENESEQ™ and REGISTRY/CAplusSM results within the context of indexing policy and patent coverage. In comparison to the proprietary databases, the authors have identified important deficiencies in the content of the public databanks. This paper also discusses database timeliness and the choice of algorithm as potential reasons for missing data.
Keywords :
Sequence databases , Sequence searching , Patent sequences , REGISTRY , GENESEQ , CAplus , GenBank , PCTGEN , EMBL-Bank , EBI Fasta , PDG , Patent Documentation Group , Biosequences
Journal title :
World Patent Information
Journal title :
World Patent Information