Express middleware which validates request bodies using SHACL Shapes
Uses rdf-validate-shacl to create the validation report.
If validation fails, returns an RFC 7807 Problem Details document extended to be valid JSON-LD resource.
The package exports a factory to create an express middleware handler. One parameter is required:
import express from 'express'
import { shaclMiddleware } from 'express-middleware-shacl'
import { rdf } from '@tpluscode/rdf-ns-builders'
import { NamedNode, DatasetCore } from 'rdf-js'
import { loadResources } from './store'
const app = express()
app.use(shaclMiddleware({
// Load the shapes to populate the Shapes Graph.
async loadShapes(req): Promise<DatasetCore> {
// Should at least use the payload resource types but might also select more shapes,
// such as any shape annotated on the existing resource in database
const types = req.shacl.dataGraph.out(rdf.type).terms
// Possible implementation could be a DESCRIBE query
/*
DESCRIBE ?shape1 ?shape2 ... ?shapeN
*/
},
// (Optional) Load rdf:type quads of the given resources
loadTypes(resources: NamedNode[]): Promise<DatasetCore> {
// For example, could be implemented as a SPARQL query
/*
CONSTRUCT { ?resource a ?type }
WHERE { ?resource a ?type }
*/
}
}))
The middleware works in four stages. Check the sections below for more explanations
- Create
req.shacl
structure onRequest
object - Load SHACL Shapes needed to validate
- Load instance types, if necessary
- Run validation
Downstream handlers will have access to req.shacl
, which is defined as
import type { AnyPointer } from 'clownface'
interface Shacl {
/**
* The requested RDF resources, parsed from body
*/
dataGraph: AnyPointer
/**
* The loaded Shapes
*/
shapesGraph: AnyPointer
}
By default, express-rdf-resource will be used to parse the incoming request. This can be changed by adding a custom middleware before SHACL.
import express from 'express'
import clownface from 'clownface'
const app = express()
app.use((req, res, next) => {
req.resource = async function() {
// an async function which should return a Graph Pointer
let dataset
return clownface({ dataset })
}
next()
})
app.use(shaclMiddleware({
// ...
}))
Consider a resource with links to another:
prefix schema: <http://schema.org/>
base <http://example.com/>
<Leonard> a <vocab/Person> ;
schema:spouse <Penny> ;
schema:knows <Howard>, <Sheldon> ;
.
The type <vocab/Person>
may be a shape which requires the object of schema:spouse
and
schema:knows
to also be an instances of the same type:
prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#>
prefix sh: <http://www.w3.org/ns/shacl#>
prefix schema: <http://schema.org/>
base <http://example.com/>
<vocab/Person>
a rdfs:Class, sh:NodeShape ;
sh:property [
sh:path schema:spouse ;
sh:class <vocab/Person> ;
] , [
sh:path schema:knows ;
sh:class <vocab/Person> ;
] ;
.
As-is, validation of the resource <Leonard>
will fail because a SHACL validation library will not
be able to verify that the linked resources also have the correct RDF type.
Given such shapes, it will be necessary to provide an implementation of the loadTypes
function. As
mentioned above, it could be implemented as SPARQL Construct. In a simplest case, when the store's
default graph would be queried fo all data, that query could be as simple as:
base <http://example.com/>
CONSTRUCT { ?instance a ?type }
WHERE {
VALUES ?instance {
# iterate the array passed to loadTypes
<Penny> <Howard> <Sheldon>
}
?instance a ?type .
}
If the req.shacl.shapesGraph
dataset is empty, the next middleware will be called.
If there are shapes in req.shacl.shapesGraph
but none of them have a target matching the requested
resources, Bad Request
will be the response. This is to prevent false-positives, in case when the
payload would be completely unrelated to the shapes, thus resuling in no validation actually happening.
Considered are:
- implicit class targets
sh:targetClass
,sh:targetNode
sh:targetSubjectsOf
.
If the validation succeeds (sh:conforms true
), the next middleware will be called.