Skip to content

A large scale feature extraction tool for text-based machine learning

License

Notifications You must be signed in to change notification settings

ansariyusuf/tesserae

 
 

Repository files navigation

Tesserae

A large scale feature extraction tool for text-based machine learning.

Building from Source

  1. Make sure you have installed the dependencies:

    • A recent version of g++ or clang
    • GNU make
    • cmake 3.0 or later
    • git
  2. Clone the source with git:

    $ git clone https://github.com/rmit-ir/tesserae.git
    $ cd tesserae
  1. Build and install:

    git submodule update --init --recursive --progress
    mkdir build
    cd build
    cmake ..
    make

Features

The toolkit offers a large number of text-based features that can be configured for extraction. See the feature list for the feature types and descriptions.

Documentation

For a quick tour see the quick start guide. Then refer to the main documentation for specific topics in more detail.

License

Tesserae is distributed under the terms of the MIT license.

See LICENSE for details.

About

A large scale feature extraction tool for text-based machine learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 92.0%
  • C 5.8%
  • Other 2.2%