Skip to content
This repository has been archived by the owner on May 27, 2019. It is now read-only.

bibliocoll/csv2solr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

# AUTHOR
Daniel Zimmel 2013 <[email protected]>
MPI Collective Goods Bonn, Library

# REQUIREMENTS
(bash) Perl, Awk, Java installed

# USAGE
csv2solr.sh [inputfile.csv]

# DESCRIPTION
csv2solr.sh converts a CSV input file to XML in Solr Schema.
For example, you might want to feed it to VuFind like this:

curl http://localhost:28083/solr/biblio/update -H 'Content-type:text/xml; charset=utf-8' --data-binary @update.solr.saxon.xml

1. discard duplicates
csv2solr.sh compares the inputfile.csv with a list of ISBNs (example: isbn.csv) and creates two files (duplicates only & inputfile.csv without duplicates)
Format: One line = one ISBN (for awk).
2. csv to xml conversion with csv2xml.pl
3. xml to xml (solr schema) with saxon. You might want to modify the example XSLT (vub-pda.xsl) to match your Solr schema.xml or input file.

# TODO
Improve csv2xml conversion speed. 
Make it more flexible (add option to not check for duplicates, etc.)

# REFERENCE
https://devtools.mpdl.mpg.de/projects/library2.0/wiki/projects/vufind/VUFNonMarcData	




About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published