You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since we're moving to a model where the webapp is the component that interprets our curation rules and not Sinequa, we need to write an interpreter module.
Implementation Considerations
Main module:
Pattern interpreter: Given a pattern, it returns the URLs in the system that match this pattern. This already exists.
Types of rules:
Exclude rules
For these we can simply ignore the URLs being excluded from being added to the final JSON output. We shouldn't delete them off the webapp but rather store them with include=False like we already do.
Include rules
These need to go on the final JSON output. You can add patterns here.
Title rules
Rewrite the old title with a new title
Use the {title} variable to reference the old title and add something to the beginning or end of it.
XML component: Use xpaths to select something on the webpage that should be used in the above (don't implement yet.)
Document Type rules
Replace sourcestr56 for certain URLs.
Deliverable
A working module that can interpret rules as outlined above, and integration with existing pipelines.
The text was updated successfully, but these errors were encountered:
Description
Since we're moving to a model where the webapp is the component that interprets our curation rules and not Sinequa, we need to write an interpreter module.
Implementation Considerations
Main module:
Pattern interpreter: Given a pattern, it returns the URLs in the system that match this pattern. This already exists.
Types of rules:
Exclude rules
For these we can simply ignore the URLs being excluded from being added to the final JSON output. We shouldn't delete them off the webapp but rather store them with include=False like we already do.
Include rules
These need to go on the final JSON output. You can add patterns here.
Title rules
XML component: Use xpaths to select something on the webpage that should be used in the above (don't implement yet.)Replace sourcestr56 for certain URLs.
Deliverable
A working module that can interpret rules as outlined above, and integration with existing pipelines.
The text was updated successfully, but these errors were encountered: