You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The ecosystem of Python-based NLP tools is much greater than what's available on the JVM - we want to look into changing DSTLR to be written in Python using PySpark and having our own extractors (e.g., BERT based NER, entity linking, relation extraction, etc.).
Let's start some initial exploration into this and see how it works? One of the biggest issues is that BERT is slow, but what about spaCy? It recently added entity linking in version 2.2 (unreleased) and has NER already. Can we look into training/adding a relational extractor?
from @r-clancy
The ecosystem of Python-based NLP tools is much greater than what's available on the JVM - we want to look into changing DSTLR to be written in Python using PySpark and having our own extractors (e.g., BERT based NER, entity linking, relation extraction, etc.).
Let's start some initial exploration into this and see how it works? One of the biggest issues is that BERT is slow, but what about spaCy? It recently added entity linking in version 2.2 (unreleased) and has NER already. Can we look into training/adding a relational extractor?
https://github.com/explosion/spaCy
The text was updated successfully, but these errors were encountered: