Technical information

SwePub is a service that harvests scientific publication metadata from the institutional repositories of Swedish higher education institutions (HEIs) and research organisations.

The SwePub metadata is accessible for end users via the SwePub search service and SwePub Analysis and is available for free to harvest or to access through the protocols OAI PMH and SRU. We have also developed a lightweight API, called Xsearch, that accesses the data through HTTP in a simple and straightforward manner. The metadata is available as data dumps as well.

List of data providers (Swedish HEIs and research organisations).

OAI-PMH

The data is exactly as we receive it from the data providers without any data manipulation.

About OAI PMH: Open Archives Initiative

Supported metadata formats:

Base URL: http://api.libris.kb.se/swepub/oaipmh/SWEPUB

Update frequency: every night. Some of the data providers support selective harvesting and others perform a total reload every night. As a consequence the datestamps will vary.

SRU

SRU is an XML-based protocol for searching.

About SRU Library of congress standards

Base URL: http://api.libris.kb.se/sru/swepub

Xsearch lightweight API


About Xsearch: http://librishelp.libris.kb.se/help/xsearch_eng.jsp 
 

Base URL: http://libris.kb.se/xsearch?database=swepub

Data dumps

Data dumps are available both from the SwePub search service and from SwePub Analysis via ftp.

Data dumps from SwePub search service

The dumps are formatted as OAI-PMH "ListRecords"-responses with SwePub-modified mods as metadata format (see passage OAI-PMH above). For the deduplicated data a repeatable non-standard element, <identifier2>, in the OAI-PMH record-header is used for listing the identifiers in a duplicate tuple.

Update frequency: every night.

Data dumps from SwePub Analysis

Listed below are data dumps available via ftp from SwePub Analysis (English name for SwePub för analys och bibliometri - the Swedish bibliometrics service based on aggregated publication posts from Swedish repositories delivered via OAI-PMH).

These dumps will be provided in conjunction with every monthly update and reindexing of SwePub Analysis. As these dumps are substituted monthly, the old dumps will also be saved for the purpose of retreival of historical data.  

The following ftp-address leads you to an index page containing the SwePub Analysis data dump files based on a complete export of all posts from all years together with all available fields. Apart from a file with the full data dump this page also contains smaller files divided into years (zipped files).

Available data dumps

2016
December 

Index page
  • Full datadump (zipped file: all posts, years and fields)
  • - 1999 (zipped file: all posts and fields)
  • 2000-2005 (zipped file: all posts and fields)
  • 2006-2010 (zipped file: all posts and fields)
  • 2011 (zipped file: all posts and fields)
  • 2012 (zipped file: all posts and fields)
  • 2013 (zipped file: all posts and fields)
  • 2014 (zipped file: all posts and fields)
  • 2015 (zipped file: all posts and fields)
  • 2016- (zipped file: all posts and fields, from 2016 and posts with future dates)

November

Index page
  • Full datadump (zipped file: all posts, years and fields)
  • - 1999 (zipped file: all posts and fields)
  • 2000-2005 (zipped file: all posts and fields)
  • 2006-2010 (zipped file: all posts and fields)
  • 2011 (zipped file: all posts and fields)
  • 2012 (zipped file: all posts and fields)
  • 2013 (zipped file: all posts and fields)
  • 2014 (zipped file: all posts and fields)
  • 2015 (zipped file: all posts and fields)
  • 2016- (zipped file: all posts and fields, from 2016 and posts with future dates)

 Update frequency: monthly.

 

Senast uppdaterad: 2017-01-02