Skip to content
forked from CeON/SegmEdit

Editor of training sets for page segmentation and zone classification of scholarly PDFs

License

Notifications You must be signed in to change notification settings

dmehar13/SegmEdit

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About SegmEdit

SegmEdit allows to browse and edit XML files (in TrueViz format) containing information about structure of PDF documents (words, lines, zones) and about zones' classification (title, author, abstract, etc.) One of the components of the solution is a server responsible for distribution of documents to be processed.

SegmEdit was created in order to create a test suite for page segmentation and zone classification algorithms, which are part of a metadata extraction framework developed at CeON.

It is an open source software, wrote in Python using wxWidgets library and ImageMagick software suite.

Authors

Documentation

See files:

About

Editor of training sets for page segmentation and zone classification of scholarly PDFs

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.4%
  • Smarty 0.6%