Thank you for your interest in contributing to EduKTM! Before you begin writing code, it is important that you share your intention to contribute with the team, based on the type of contribution:
- You want to propose a new feature and implement it.
- Post about your intended feature in an issue, and we shall discuss the design and implementation. Once we agree that the plan looks good, go ahead and implement it.
- You want to implement a feature or bug-fix for an outstanding issue.
- Search for your issue in the EduKTM issue list.
- Pick an issue and comment that you'd like to work on the feature or bug-fix.
- If you need more context on a particular issue, please ask and we shall provide.
Once you implement and test your feature or bug-fix, please submit a Pull Request to EduKTM:
- Fork this repository to your branch.
- Modify the code. Note that we strongly recommend that you comply with our commit format specifications.
- Pass code tests and make the test coverage reach 100%. An example.
- Submit a Pull Request to EduKTM. Note that we provide a standard template of Pull Request here. Please fill in the information carefully.
The followings are some helpful guidelines for different types contribution:
If you want to add the data analysis or a new dataset, please submit a Pull Request to EduData.
The newly implemented KTM model requires:
- Dataset processing.
- Inherit the
class KTM
inEduKTM/meta.py
and implement four methods in it. - Write the corresponding example code for the model (This refers to a demo that can be tested by others). It should include at least notebook and script. An example.
- Write the corresponding test code for the model and make sure that the test coverage is 100%. An example.
As for the dataset preprocessing, we suggest:
- Write a script, and make sure that:
- Processing and converting of the raw dataset.
- Partitioning Training/validation/test dataset.
- Provide or use KTBD dataset (which is already divided into training/validation/test datasets).
All modules are inherited from Class KTM
, it will raise NotImplementedError
if the functions are not implemented.
Note that we do not constrain your neural network or algorithms (for example, the network construction, optimizer, loss function definitions, etc.).
- Train module
This module is a training module, which is used to train model.
def train(self, *args, **kwargs) -> ...:
raise NotImplementedError
- Eval module
This module is a evaluation module, which is used to verify and test the model.
def eval(self, *args, **kwargs) -> ...:
raise NotImplementedError
- Save module
This module is a model saving module, which is used to save the trained model.
def save(self, *args, **kwargs) -> ...:
raise NotImplementedError
- Load module
This module is a model loading module, which is used to load the saved model.
def load(self, *args, **kwargs) -> ...:
raise NotImplementedError
Make sure you make a demo for your model. An example.
Numpy docs format is used:
function
Parameters
----------
Variable 1: type <int, float>, optional or not
description
Variable 2: type <int, float>, optional or not
description
...
Returns
-------
Variable: type <int, float>
description
See Also (Optional)
--------
Similar to function():
Examples (Optional)
--------
>>> For example:
...
[<type>](<scope>) <subject>
feat
:New feature。fix/to
:Fix bugs, either found in Q&A or found in your own use.fix
:Generating diff and fixes the problem automatically. Suitable for one submit to fix the problem directly.to
:Generating only diff but does not automatically fix the problem. Suitable for multiple submissions. Usefix
when the final fix problem is committed.
docs
:Documentation.style
:Format (do not affect code execution).refactor
:Refactoring (not new features or bug fix).perf
:Optimize related issues, such as code performance, user experience.test
:Add test unit.chore
:Build process or auxiliary tools change.revert
:Roll back to the previous version.merge
:Code merge.sync
:Synchronizing the bug of main or branch。arch
: Engineering documents or tools change.
Scope is used to describe the impact of the commit, such as the data layer, the control layer, the view layer, and so on, depending on the project.
For example, in Angular, it can be location, browser, compile, compile, rootScope, ngHref, ngClick, ngView, and so on. If your changes affect more than one scope, you can use *
instead.
A subject is a short description of the purpose of the commit, not more than 50 characters.
There is no period or other punctuation at the end.
- [docs] update the README.md
git commit -m "[docs] update the README.md"
Q: I have carefully test the code in my local system (all testing passed) but still failed in online CI?
A: There are two possible reasons:
- the online CI system is different from your local system;
- there are some network error causing the downloading test failed, which you can find in the CI log.
For the second reason, all you need to do is to retry the test.