Thank you for your interest in contributing!
Please take a moment to review this document in order to make the contribution process easy and effective for everyone involved.
Following these guidelines helps to communicate that you respect the time of the developers managing and developing this open-source project. In return, and in accordance with this project's code of conduct, other contributors will reciprocate that respect in addressing your issue or assessing changes and features.
If you have questions or would like to communicate with the team, please join us in our public Matrix chat rooms. We'd be happy to hear from you!
- First steps as a contributor
- Learning the tech stack
- Development environment
- Testing
- Issues and projects
- Bug reports
- Feature requests
- Pull requests
- Data edits
- Documentation
First steps as a contributor ⇧
Thank you for your interest in contributing to Scribe-Data! We look forward to welcoming you to the community and working with you to build tools for language learners to communicate effectively :) The following are some suggested steps for people interested in joining our community:
- Please join the public Matrix chat to connect with the community
- Read through this contributing guide for all the information you need to contribute
- Look into issues marked
good first issue
and the Projects board to get a better understanding of what you can work on - Check out our public designs on Figma to understand Scribes's goals and direction
- Consider joining our bi-weekly developer sync!
Note
Those new to Python or wanting to work on their Python skills are more than welcome to contribute! The team would be happy to help you on your development journey :)
Learning the tech stack ⇧
Scribe is very open to contributions from people in the early stages of their coding journey! The following is a select list of documentation pages to help you understand the technologies we use.
Docs for those new to programming
- Mozilla Developer Network Learning Area
- Doing MDN sections for HTML, CSS and JavaScript is the best ways to get into web development!
- Open Source Guides
- Guides from GitHub about open-source software including how to start and much more!
Development environment ⇧
Important
Suggested IDE extensions
VS Code
The development environment for Scribe-Data can be installed via the following steps:
- Fork the Scribe-Data repo, clone your fork, and configure the remotes:
Note
Consider using SSH
Alternatively to using HTTPS as in the instructions below, consider SSH to interact with GitHub from the terminal. SSH allows you to connect without a user-pass authentication flow.
To run git commands with SSH, remember then to substitute the HTTPS URL, https://github.com/...
, with the SSH one, [email protected]:...
.
- e.g. Cloning now becomes
git clone [email protected]:<your-username>/Scribe-Data.git
GitHub also has their documentation on how to Generate a new SSH key 🔑
# Clone your fork of the repo into the current directory.
git clone https://github.com/<your-username>/Scribe-Data.git
# Navigate to the newly cloned directory.
cd Scribe-Data
# Assign the original repo to a remote called "upstream".
git remote add upstream https://github.com/scribe-org/Scribe-Data.git
- Now, if you run
git remote -v
you should see two remote repositories named:origin
(forked repository)upstream
(Scribe-Data repository)
- Use Python venv to create the local development environment within your Scribe-Data directory:
-
On Unix or MacOS, run:
python3 -m venv venv # make an environment named venv source venv/bin/activate # activate the environment
-
On Windows (using Command Prompt), run:
python -m venv venv venv\Scripts\activate.bat
-
On Windows (using PowerShell), run:
python -m venv venv venv\Scripts\activate.ps1
After activating the virtual environment, install the required dependencies and set up pre-commit by running:
pip install --upgrade pip # make sure that pip is at the latest version
pip install -r requirements.txt # install dependencies
pip install -e . # install the local version of Scribe-Data
pre-commit install # install pre-commit hooks
# pre-commit run --all-files # lint and fix common problems in the codebase
If you face any issues, consider reinstalling Scribe-data by running the following:
# Install the new version of Scribe-Data:
pip uninstall scribe-data
pip install . # or pip install scribe-data
# Update the entry_points and console_scripts:
python setup.py egg_info
Note that you may need to run this command every time you make any change to the code to have them be reflected in the development Scribe-Data:
pip install -e .
Note
Feel free to contact the team in the Data room on Matrix if you're having problems getting your environment setup!
Testing ⇧
In addition to the pre-commit hooks that are set up during the development environment section, Scribe-Data also includes a testing suite that should be ran before all pull requests and subsequent commits. Please run the following in the project root:
pytest
Issues and projects ⇧
The issue tracker for Scribe-Data is the preferred channel for bug reports, features requests and submitting pull requests. Scribe also organizes related issues into projects.
Note
Just because an issue is assigned on GitHub doesn't mean that the team isn't interested in your contribution! Feel free to write in the issues and we can potentially reassign it to you.
Be sure to check the -next release-
and -priority-
labels in the issues for those that are most important, as well as those marked good first issue
that are tailored for first time contributors.
Bug reports ⇧
A bug is a demonstrable problem that is caused by the code in the repository. Good bug reports are extremely helpful - thank you!
Guidelines for bug reports:
-
Use the GitHub issue search to check if the issue has already been reported.
-
Check if the issue has been fixed by trying to reproduce it using the latest
main
or development branch in the repository. -
Isolate the problem to make sure that the code in the repository is definitely responsible for the issue.
Great Bug Reports tend to have:
- A quick summary
- Steps to reproduce
- What you expected would happen
- What actually happens
- Notes (why this might be happening, things tried that didn't work, etc)
To make the above steps easier, the Scribe team asks that contributors report bugs using the bug report template, with these issues further being marked with the bug
label.
Again, thank you for your time in reporting issues!
Feature requests ⇧
Feature requests are more than welcome! Please take a moment to find out whether your idea fits with the scope and aims of the project. When making a suggestion, provide as much detail and context as possible, and further make clear the degree to which you would like to contribute in its development. Feature requests are marked with the feature
label, and can be made using the feature request template.
Pull requests ⇧
Good pull requests - patches, improvements and new features - are the foundation of our community making Scribe-Data. They should remain focused in scope and avoid containing unrelated commits. Note that all contributions to this project will be made under the specified license and should follow the coding indentation and style standards (contact us if unsure).
Please ask first before embarking on any significant pull request (implementing features, refactoring code, etc), otherwise you risk spending a lot of time working on something that the developers might not want to merge into the project. With that being said, major additions are very appreciated!
When making a contribution, adhering to the GitHub flow process is the best way to get your work merged:
-
If you cloned a while ago, get the latest changes from upstream:
git checkout <dev-branch> git pull upstream <dev-branch>
-
Create a new topic branch (off the main project development branch) to contain your feature, change, or fix:
git checkout -b <topic-branch-name>
-
Commit your changes in logical chunks, and please try to adhere to Conventional Commits.
Note
The following are tools and methods to help you write good commit messages ✨
- commitlint helps write Conventional Commits
- Git's interactive rebase cleans up commits
-
Locally merge (or rebase) the upstream development branch into your topic branch:
git pull --rebase upstream <dev-branch>
-
Push your topic branch up to your fork:
git push origin <topic-branch-name>
-
Open a Pull Request with a clear title and description.
Thank you in advance for your contributions!
Data edits ⇧
Note
Please see the Wikidata and Scribe Guide for an overview of Wikidata and how Scribe uses it.
Scribe does not accept direct edits to the grammar JSON files as they are sourced from Wikidata. Edits can be discussed and the Scribe-Data queries will be changed and ran before an update. If there is a problem with one of the files, then the fix should be made on Wikidata and not on Scribe. Feel free to let us know that edits have been made by opening an issue and we'll be happy to integrate them!
Documentation ⇧
The documentation for Scribe-Data can be found at scribe-data.readthedocs.io. Documentation is an invaluable way to contribute to coding projects as it allows others to more easily understand the project structure and contribute. Issues related to documentation are marked with the documentation
label.
Scribe-Data generally follows numpydoc conventions for documenting functions and Python code in general. Function docstrings should have the following format:
def example_function(argument: argument_type) -> return_type:
"""
An example docstring for a function so others understand your work.
Parameters
----------
argument : argument_type
Description of your argument.
Returns
-------
return_value : return_type
Description of your return value.
Raises
------
ErrorType
Description of the error and the condition that raises it.
"""
...
return return_value
Use the following commands to build the documentation locally:
cd docs
make html
You can then open index.html
within docs/build/html
to check the local version of the documentation.