You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
So I was trying to parse content from multiple document formats and turns out it works for other document formats pdf, doc etc. but not for html files somehow
below is the minimal example with sample html
main.go
package main
import (
"fmt""log""code.sajari.com/docconv"
)
funcmain() {
// Attempt to read filetxt, err:=docconv.ConvertPath("test.html")
iferr!=nil {
log.Fatal(err)
}
fmt.Println(txt.Body)
}
test.html
<!DOCTYPE html><html><body><h1>This is heading 1</h1><h2>This is heading 2</h2><h3>This is heading 3</h3><h4>This is heading 4</h4><h5>This is heading 5</h5><h6>This is heading 6</h6></body></html>
As of now output is blank
also I noticed that there's no release from 2019 feb so code.sajari.com might be sending older library is there any way to maybe pre-release? version or configure CI to do that
The text was updated successfully, but these errors were encountered:
So I was trying to parse content from multiple document formats and turns out it works for other document formats
pdf
,doc
etc. but not for html files somehowbelow is the minimal example with sample html
main.go
test.html
As of now output is blank
also I noticed that there's no release from 2019 feb so
code.sajari.com
might be sending older library is there any way to maybe pre-release? version or configure CI to do thatThe text was updated successfully, but these errors were encountered: