Skip to content

oudalab/sqlite-fts-python

 
 

Repository files navigation

codeship build status appveyor build status


sqlitefts-python provides binding for tokenizer of SQLite Full-Text search(FTS3/4). It allows you to write tokenizers in Python.

SQLite has Full-Text search feature FTS3/FTS4 along with some predefined tokenizers. It is easy to use and has enough functionality. Python has a built-in SQLite module, so that it is easy to use and deploy. You don't need anything else to full-text search.

But... the predefined tokenizers are not enough for some languages including Japanese. Also it is not easy to write own tokenizers. This module provides ability to write your own tokenizers using Python without C compiler by CFFI.

It also has ranking functions based on peewee.

NOTE: all connections using this modules should be explicitly closed. due to GC behavior, it can be crashed if a connection is left open when a program terminated.


  • Python 2.7, Python 3.3+, and PyPy2.7, PyPy3.2+
  • CFFI
  • FTS3/4 enabled SQLite3 or APSW (for Windows, you may need to download and replace sqlite3.dll) * SQLite 3.11.x have to be compiled with -DSQLITE_ENABLE_FTS3_TOKENIZER to enable 2-arg fts3_tokenizer * SQLite 3.10.2 and older versions do not have extra requirements. 2-arg fts3_tokenizer is always avaiable. * SQLite 3.12.0 and later vesrions do not have extra requirements. 2-arg fts3_tokenizer can be enabled dynamically.

This software is released under the MIT License, see LICENSE.


About

A Python binding of SQLite Full Text Search Tokenizer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 60.4%
  • Jupyter Notebook 38.9%
  • Shell 0.7%