Skip to content

A simple python script to grab publication lists from cristin.no

Notifications You must be signed in to change notification settings

Roberthaf/Pub-Grab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pub-Grab

pubgrab.py will generate an html publication list for given authors, retrieving records from the CRISTIN database of Norwegian scientific publications.

Sample output: Centre for Integrative Genetics' publication list.

Prerequisites

Installation

git clone https://github.com/Roberthaf/Pub-Grab.git
cd Pub-Grab
python pubgrab.py --help

To process non-ascii characters, please read the help text below closely. If your console code page and python input/output encoding are set correctly, the example near the end should show "Våge" with an "å".

Usage instructions

usage: pubgrab.py [-h] [-d] [--fra FRA] [--til TIL] [--hovedkategori HKAT]
				  [--clear]
				  [authors [authors ...]]

Compile HTML bibliography from CRISTIN for list of authors.

If no authors are given, read from stdin.

positional arguments:
  authors               list of authors, e.g. 'Jane Doe' 'John Deere'

optional arguments:
  -h, --help            show this help message and exit
  -d, --debug           log debug messages
  --fra FRA             from year
  --til TIL             to year
  --hovedkategori HKAT  Hovedkategori, see http://www.cristin.no/cristin/super
						brukeropplaering/ws-
						dokumentasjon.html#hovedkategorier2011
  --clear               clear cache

To work with non-ascii author names, set the console code page and Python i/o encoding to utf-8.
In a Windows command shell:

> CHCP 65001
> SET PYTHONIOENCODING=UTF-8

Then run e.g.

> python pubgrab.py 'Dag Inge Våge'
> python pubgrab.py < people.txt > publications.html

Cristin API resources

About

A simple python script to grab publication lists from cristin.no

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published