Skip to content
/ Ldtools Public

ldtools - A lightweight "ORM" to handle Linked Data Resources and their Origins

License

Notifications You must be signed in to change notification settings

dmr/Ldtools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Ldtools

Ldtools is a simple library to handle RDF data in a convenient way. It can be used as a simple ORM for Linked Data Resources and their Origins. A resource "http://dbpedia.org/resource/Karlsruhe" might be mentioned by different origins. Ldtools helps to keep track of verified statements (authoritative) about resources and provides an API to query more information about resources.

Different rdf triple storage backends are provided for the retrieved data: RestBackend, FileBackend or MemoryBackend

The CLI ldtools can be used to retrieve information about linked data resources.

Installation

Just use pip:

pip install Ldtools

This will install all dependencies (argparse, rdflib) needed and provide the command line utility "ldtools".

Alternatively, do a git clone and execute python setup.py install/develop.

How to use it?

Via the commandline, all information within a Linked Data resource can be retrieved by executing:

ldtools http://dbpedia.org/resource/Karlsruhe

Further options can be utilised to influence whether the URIs that are discovered within the origin should be followed and how deep. Try:

ldtools --help

for more usage information.

Alternatively, the python console can be used:

First, we create an Origin object:

import pprint, rdflib
from ldtools.resource import Resource
from ldtools.origin import Origin

uri = "http://dbpedia.org/resource/Karlsruhe"
origin, created = Origin.objects.get_or_create(uri)

Process it, hopefully discovering more Origins in there (rdfs:seeAlso, owl:sameAs...):

origin.GET(only_follow_uris=[rdflib.OWL.sameAs,rdflib.RDFS.seeAlso])

If everything went well, there now is a Resource object for our uri:

resource, created = Resource.objects.get_or_create(uri, origin=origin)
pprint.pprint(resource.__dict__)

Process all the other Origins we know about:

Origin.objects.GET_all()

Result: 5 URIs crawled and 500 Resources discovered and processed.

Why?

  • The Semantic Web is out there and there is really not enough tools yet to work with Linked Data
  • SPARQL is not needed to get the RAW data from resources, this library demonstrates that. Just the basic Linked Data Stack: URIs, Content Negotiation, RDF needed
  • ldtools intends to make it easy to handle the data you get from an URI and to follow links you discover
  • Based on that, you can modify your objects and PUT them back to their origin

Tests

To run the tests, install spec and/or nose and run nose:

pip install spec coverage
nosetests --with-coverage --cover-package=ldtools
nosetests --with-specplugin

Build Status status_image

Contributions/Credits

Feel free to submit ideas and bugs to http://github.com/dmr/Ldtools/issues, I'll be happy to accept pull requests for new features.

Thank you Travis CI for running the tests :)

Thanks to Django, Flask, peewee and sentry for inspiration regarding model structure!

About

ldtools - A lightweight "ORM" to handle Linked Data Resources and their Origins

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages