process/filter/parser for various data formats - eg: text/html/xml/json
-
SensitiveKeywordsFilter
sensitive word filter/match in regex or trie
-
PlainTextParser
extract semantic information for target tags in text
-
HTMLTextProcessor
filter and decorate the information in html format, then extract target value
Please refer to the test case or source code for specific use