Gerapy Auto Extractor Changelog
- Fix issues of similarity function
- Fix issues of dependencies
- Fix bug of get best cluster when extract result is None
- Fix extraction of title from
h
tag
- Add support for distinguishing list page between detail page
- Add test cases for classification
- Add
content
method support to get html content by file path
- Move
jsonify
to helper
module
- Fix missed extraction of list extractor
- Removed unnecessary logs unless set
APP_DEBUG
to true
- Fix extraction of content from
<footer>
tag
- Add support for
base_url
arg of extract_list
method
- Add test cases in
tests
folder
- Add more samples in
samples
folder
- Add
jsonify
method for converting json format
- Remove blank lines from result of
extract_content
method
- Add property
nth
in Element Class
- Add
nth
suffix of alias
property in Element Class