Report:QPAT/Data Coverage/Patent Coverage/Bibliographic Coverage

From Intellogist

Jump to: navigation, search
  Report          
This search system report was created by the Intellogist Team and is available for viewing only. If you'd like to share your knowledge on Intellogist, please visit the Best Practices, Glossary, or Community Reports pages. Registered users may be notified of any substantial changes to this report by placing a "watch" on the Revisions page, which is the last page listed in the table of contents. To learn more about using the Intellogist "watchlist," see the Watchlist Help page.
As of January 1, 2013, both QPAT and PatentExaminer have been discontinued, and they have been replaced by the Orbit.com portal.

Bibliographic Coverage

QPAT’s bibliographic records are stored within the FamPat and PlusPat files, which draw from the EPO's INPADOC/DOCDB files, and also from the historical EDOC file (formerly produced by INPI, the French Patent Office, and now no longer available). The difference between FamPat and PlusPat is in their family organization structures; the data content should be the same. Where available, original or machine-translated abstracts originally present in the source data will be available in the FamPat/PlusPat files.

At the end of 2007, Questel began adding machine translated English titles and abstracts for German (DE), French (FR), Japanese (JP), South Korean (KR) and Chinese (CN) documents (this data is from 2007 forward; a backfile has not been produced). In addition, at the same time the company began receiving Asian data from JP, KR and CN (and later Taiwan, TW) on a “fast track” (from an outside producer): bibliographic data for documents from these countries will appear in the FamPat and PlusPat files within 2-3 weeks of publication. In 2009 Questel added enhanced bibliographic data collections for Asian countries, including:

  • Original English-language bibliographic data for Indian (IN) patent documents, which is comprehensive back to 2005 but contains some data back to 1975,
  • Machine-translated bibliographic data for Taiwan (TW) back to 1993, and
  • Human translated bibliographic data for Korean (KR) documents back to 1978.


In addition, QPAT notes that they have invested additional time and energy into supplementing their Korean (KR) collection. According to a QPAT representative, machine translations for the KR collection are replaced with hand translated English abstracts as soon as they are available from KIPRIS.[1] Further, a June 2009 press release announced that hand translations assignee data for both the JP and KR collections will be produced from May 2009 onward.[2]


editors note iconEditor's Note:

The 2008 addition of machine-translated English titles and abstracts to the FamPat file is a unique advantage to the data collection. The machine translation data is produced in-house at Questel, making this resource a potentially independent one from other search services that add their own machine translated data to bibliographic records, such as Minesoft PatBase.


FamPat

Produced by Questel-Orbit, the FamPat database is a simple family database (type 1 as described in the Patent Families article). It does apply some additional corrective techniques to Japanese documents, which allows it to identify Japanese publications which really do belong to a certain family, even if their priority information is not exactly the same. According to a Questel representative, these corrective measures are performed by an automated system, not an intellectual re-grouping effort.

FamPat will display a representative family member for each family in the database – this is usually done in a way that ensures the English-language document is chosen. However, via the system settings in QPAT, users can set their own preferences for what type of document will be shown as the representative.

Another notable feature about the FamPat file is that it contains text excerpts from the patents themselves, called “Key Content” by Questel. These excerpts are produced by linguistic extraction from the full text documents for families containing EP, US, and WO records, and consist of "Patent Object," "Advantages and Previous Patent Drawbacks," as well as "Independent Claims." Questel has partnered with Lingway,[3] a company that specializes in linguistic analysis and semantic text processing, to be able to extract this key content from the patent full text. Key content is only extracted from EP, US, and WO patent family members. The figure below shows an example of this content in a FamPat family record.


An example of semantically extracted "Key Content" in a FamPat family record. The content has been extracted from a US family member (US 2008/0224473A1). The paragraph numbers from the original document are visible in the figure.


The goal of including this extracted "Key Content" in the FamPat bibliographic/family file is to distill the important content of the major family members. The purpose of including only these targeted text excerpts is to reduce the white noise and false hits that can occur when searching full text files.

The FamPat file does not include significant additional bibliographic data sources beyond what is included in DOCDB. In addition to its "Key Content" text excerpts, the file contains title and abstract data, clipped images, citations with relevancy indicators, and current classification data (including US, IPC, ECLA, ICO (in-computer-only codes), and Japanese F-I/F-terms).

FamPat was introduced a few years after the PlusPat file, and offers an alternative to the restrictive family limitations in PlusPat. To learn more about FamPat families, see the Family Data section of this article.


editors note iconEditor's Note:

With FamPat, Questel can be suspected of following the successful model of the Derwent World Patents Index, a bibliographic/family file that contains re-written titles and abstracts created by a professional indexing team. However, FamPat's key content is extracted by machine, rather than created by human indexers, making the file much cheaper to produce, but without human indexing value added.


PlusPat

The PlusPat file was the original patent family database offered by Questel-Orbit. The family structures was based on strict patent family standards used internally for the EPO’s examiner database, EDOC/EPODOC (EDOC was the commercial version of EPODOC, and was produced by the French Patent Office). PlusPat contains only corresponding publication stages related to the same application; for example, a US published application and the corresponding US patent are grouped together into a “domestic” patent family (type 5 as described in the Patent Families article). The file contains title and abstract data, clipped images, citations with relevancy indicators, and current classification data (including US, IPC, ECLA, ICO (in-computer-only codes), and Japanese F-I/F-terms). PlusPat is currently created from DOCDB data. The file does not contain the "Key Content" text excerpts that are present in FamPat.

The original PlusPat database came from the logic used in the European patent examiner’s database, EPODOC. In organizing their database, the EPO grouped together all publication stages for a single patent, thereby preventing the unnecessary duplication caused by retrieving the published application, and the granted patent, as two separate hits. Thus, the PlusPat family definition is not really a theory of patent families; it is really just a convenient way to prevent coming across multiple publication stages of the same document during a search.

PlusPat does not contain any “Key content” text as FamPat does, but the records do contain bibliographic information, abstracts, and classifications, including ECLA, IPC, and US classes. Patent and non-patent citation data is also available. Search reports, patent and bibliographic cited references are available for:

  • Australia (AU)
  • Belgium (BE)
  • Cyprus (CY)
  • Denmark (DK)
  • European Patent Office (EP)
  • Finland (FI)
  • France (FR)
  • Germany (DE)
  • Italy (IT)
  • Japan, published applications (JP-A) and grants (JP-B)
  • The Netherlands (NL)
  • Patent Cooperation Treaty (WO/PCT)
  • Spain (ES)
  • Switzerland (CH)
  • Turkey (TR)
  • United Kingdom (GB)
  • United States (US)

In addition, relevancy indicators are also searchable; see Citation Coverage for more information. As with FamPat, the file also contains machine translated English titles and abstracts for DE, FR, JP, KR, TW and CN documents from 2007 forward. PlusPat also receives the “fast track” Asian bibliographic data from JP, KR, CN, and TW within 2-3 weeks of publication, as well as the English-language bibliographic collections for IN (original English), KR (human translations) and TW (machine translations) added in 2009.

PlusPat is produced by Questel, and it is accessible through QPAT; by changing the default settings users can make PlusPat their primary family database. Using PlusPat groups hits by only including various publication stages; in contrast, using FamPat groups hits (including their various publication stages) into larger families of related documents which all share the same original priority.

Even when PlusPat is chosen as the preferred family database, FamPat data is still used to group hits into patent families. There is no way to use PlusPat as a grouping tool in QPAT.

To learn more about patent family definitions, see the Patent Families article. To learn more about PlusPat families, see the Family Data section of this article.


FamPat/PlusPat Country Coverage

The following coverage dates were obtained from the system's user guide materials and represent the reported start dates for each collection. Although it seems clear that the system uses INPADOC as a major source of its bibliographic data, the system also provides this table of coverage information, which contains coverage dates that may not exactly match those provided by the EPO for the current INPADOC file. Bibliographic coverage information in the table below which appears to differ from what is available in INPADOC is denoted by an asterisk * symbol. Please note the date that this information was last updated, which is shown above the table. See Downloading INPADOC Coverage for the current coverage in the INPADOC bibliographic file.

Please note that gaps in coverage may exist in any collection due to a number of factors, such as missing documents, scanning errors, OCR errors, even failure of national and regional patent offices to report data. No data source can be considered 100% complete, and searchers should consult as many unique data sources as possible. See this article for a discussion of unique worldwide bibliographic and family data collections, such as INPADOC, the Derwent World Patents Index, and CAplus.


editors note iconEditor's Note:

Notably, although the EPO's INPADOC/DOCDB file is the major source for QPAT's FamPat and PlusPat files, the QPAT user materials show slightly different coverage than the most current EPO coverage data. For example, QPAT claims coverage of Trinidad and Tobago (TT), while the EPO does not. Upon investigation, it seems that only three records are present with a TT country code in QPAT, and these are present in the EPO's esp@cenet database also. So it seems fair to say that the QPAT bibliographic files do not offer substantial coverage beyond EPO data.

Also notably, when the EPO introduced Costa Rica (CR), Ecuador (EC), and Peru (PE) into its data feeds beginning in week 52/2008, QPAT had already loaded this new INPADOC content on the Monday of that introductory week, 12/22/2008. Empirical testing showed that this data was available on QPAT earlier than it was available from competitors Thomson Innovation, PatBase, and Delphion. However, QPAT did not list these countries as covered by the FamPat/PlusPat files for many weeks after this data's introduction.

Given these factors, it is generally safe for users to assume that despite somewhat out-of-date user coverage information, FamPat/PlusPat data corresponds to no more or less than the most recent INPADOC/DOCDB coverage, and that very timely updates to these files occur from EPO data feeds.


Bibliographic coverage table last updated September 02, 2008[4][5]
(Year in parenthesis indicates the year of earliest documents, column indicates comprehensive coverage date)

Country Country Code Coverage (from)
Argentina AR 1973
ARIPO AP 1984
Australia AU 1966
Austria AT 1969
Belgium (1926) BE 1964
Bosnia and Herzegovina BA 1998
Brazil BR 1973
Bulgaria BG 1973
Canada CA 1973
China CN 1986
Croatia HR 1994
Cuba CU 1974
Cyprus CY 1975
Czech Republic CZ 1993
Czechoslovakia CS 1973
Denmark DK 1968
Estonia EE 1995
Egypt EG 1976
EPO citations to Publications XP 1835
Eurasian Patents EA 1997
Finland FI 1968
France (1902) FR 1920
Germany (1877) DE 1968
Germany, East DD 1973
Great Britain (1909) GB 1963
Gulf Council GC 2002
Greece GR 1977
Hong Kong HK 1976
Hungary HU 1994
India IN 1975
Ireland IE 1973
Israel IL 1968
Italy IT 1973
Japan JP 1973
Kenya KE 1975
Korea, Republic of KR 1978
Latvia LV 1994
Lithuania LT 1994
Luxembourg (1946) LU 1960
Malawi MW 1973
Malta MT 1968
Mexico MX 1981
Moldova MD 1994
Monaco MC 1975
Mongolia MN 1972
Morocco MA 1977
Netherlands (1912) NL 1964
New Zealand NZ 1979
Norway NO 1968
OAPI OA 1966
Philippines PH 1975
Poland PL 1973
Portugal PT 1976
Romania RO 1973
Russian Federation RU 1972
Singapore SG 1983
Slovakia SK 1993
Slovenia SI 1992
South Africa ZA 1971
Soviet Union SU 1972
Spain ES 1968
Sweden SE 1968
Switzerland (1920) CH 1969
Tajikistan TJ 1998
Taiwan TW 2000
Trinidad & Tobago TT 1994
Turkey TR 1973
United States (1920) US 1968
Vietnam CN 1984
WIPO (PCT) Applications WO 1978
Yugoslavia YU 1973
Zambia ZM 1968
Zimbabwe ZW 1980


Sources

  1. E-mail correspondence from Questel representative, received June 18, 2009.
  2. "Summer e-communique: Focus on Asia." Questel website, http://www.questel.com/emailing/ecommunique_summer2009.htm. Accessed on June 23, 2009.
  3. "Questel•Orbit & Lingway sign Exclusive Agreement to Incorporate Technology Improving Full-Text Patent Reading Efficiency." Press Release, December 02, 2003. Questel website, http://www.questel.com/en/news/press/Lingway.pdf. Accessed on September 23, 2008.
  4. "QPAT's FamPat coverage information." QPAT website, http://www.qpat.com/help/en/fampat.htm. Accessed on September 02, 2008.
  5. "Questel’s FamPat Fact Sheet." Questel website, http://www.questel-orbit.fr/EN/customersupport/userdoc/fctsht/FamPat.pdf. Accessed on September 02, 2008.
Patent search questions. Expert answers.  Brought to you by Landon IP
HOT Items

Intellogist is brought to you by the patent search experts at Landon IP.

Welcome to Intellogist!

To network with our international community of patent info pros, please create an account.

For a list of our current members, see our Community Page.