You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Oct 4, 2022. It is now read-only.
A TextContainer object should have a getTree method that returns the tree based on the text. This tree should be generated by a linguistic parser that knows how to split a text into sentences and words. The getTree method should return a tree with Sentence and Word objects.
The Sentence object should contain the content of the sentence and the relative indexes within the text container.
The Word object should contain the content of the word and the relative indexes within the text container.
Explanation
A
TextContainer
object should have agetTree
method that returns the tree based on the text. This tree should be generated by a linguistic parser that knows how to split a text into sentences and words. ThegetTree
method should return a tree withSentence
andWord
objects.Sentence
object should contain the content of the sentence and the relative indexes within the text container.Word
object should contain the content of the word and the relative indexes within the text container.Better suggestions for the name of the linguistic parser are welcome. The linguistic parser should use the code we already have available in the current code. So the sentence parser can be reused. I've created an issue to track the removal of the HTML specific code from the sentence parser
Tasks
Technical decisions
Feedback?
The text was updated successfully, but these errors were encountered: