Skip to content

Latest commit

 

History

History
45 lines (33 loc) · 1.05 KB

README.md

File metadata and controls

45 lines (33 loc) · 1.05 KB

Tesserae

A large scale feature extraction tool for text-based machine learning.

Building from Source

  1. Make sure you have installed the dependencies:

    • A recent version of g++ or clang
    • GNU make
    • cmake 3.0 or later
    • git
  2. Clone the source with git:

    $ git clone https://github.com/rmit-ir/tesserae.git
    $ cd tesserae
  1. Build and install:

    git submodule update --init --recursive --progress
    mkdir build
    cd build
    cmake ..
    make

Features

The toolkit offers a large number of text-based features that can be configured for extraction. See the feature list for the feature types and descriptions.

Documentation

For a quick tour see the quick start guide. Then refer to the main documentation for specific topics in more detail.

License

Tesserae is distributed under the terms of the MIT license.

See LICENSE for details.