GitHub - pooyaEst/WiktionaryParser: A Python Wiktionary Parser

Wiktionary Parser

A python project which downloads words from English Wiktionary (en.wiktionary.org) and parses articles' content in an easy to use JSON format. Right now, it parses etymologies, definitions, pronunciations, examples, audio links and related words.

JSON structure

[{
    "pronunciations": {
        "text": ["pronunciation text"],
        "audio": ["pronunciation audio"]
    },
    "definitions": [{
        "relatedWords": [{
            "relationshipType": "word relationship type",
            "words": ["list of related words"]
        }],
        "text": ["list of definitions"],
        "partOfSpeech": "part of speech",
        "examples": ["list of examples"]
    }],
    "etymology": "etymology text",
}]

Installation

Using pip

run pip install wiktionaryparser

From Source

Clone the repo or download the zip
cd to the folder
run pip install -r "requirements.txt"

Usage

Import the WiktionaryParser class.
Initialize an object and use the fetch("word", "language") method.
The default language is English, it can be changed using the set_default_language method.
Include/exclude parts of speech to be parsed using include_part_of_speech(part_of_speech) and exclude_part_of_speech(part_of_speech)
Include/exclude relations to be parsed using include_relation(relation) and exclude_relation(relation)

Examples

>>> from wiktionaryparser import WiktionaryParser
>>> parser = WiktionaryParser()
>>> word = parser.fetch('test')
>>> another_word = parser.fetch('test', 'french')
>>> parser.set_default_language('french')
>>> parser.exclude_part_of_speech('noun')
>>> parser.include_relation('alternative forms')

Requirements

requests==2.20.0
beautifulsoup4==4.4.0

Contributions

If you want to add features/improvement or report issues, feel free to send a pull request!

License

Wiktionary Parser is licensed under MIT.

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
.github/workflows		.github/workflows
scripts		scripts
tests		tests
wiktionaryparser		wiktionaryparser
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
readme.md		readme.md
requirements.txt		requirements.txt
setup.py		setup.py
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wiktionary Parser

JSON structure

Installation

Using pip

From Source

Usage

Examples

Requirements

Contributions

License

About

Releases

Packages

Languages

License

pooyaEst/WiktionaryParser

Folders and files

Latest commit

History

Repository files navigation

Wiktionary Parser

JSON structure

Installation

Using pip

From Source

Usage

Examples

Requirements

Contributions

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages