We at Weights & Biases ❤️ open source and welcome contributions from the community!
This guide discusses the development workflow of the wandb
library.
-
Browse the existing Issues on GitHub to see if the feature/bug you are willing to add/fix has already been requested/reported.
- If not, please create a new issue. This will help the project keep track of feature requests and bug reports and make sure effort is not duplicated.
-
If you are a first-time contributor, please go to
https://github.com/wandb/wandb
and click the "Fork" button in the top-right corner of the page. This will create your personal copy of the repository that you will use for development.-
Set up SSH authentication with GitHub.
-
Clone the forked project to your machine and add the upstream repository that will point to the main
wandb
project:git clone https://github.com/<your-username>/wandb.git cd wandb git remote add upstream https://github.com/wandb/wandb.git
-
-
Develop your contribution.
- Make sure your fork is in sync with the main repository:
git checkout main git pull upstream main
- Create a
git
branch where you will develop your contribution. Use a sensible name for the branch, for example:
git checkout -b <username>/<short-dash-seperated-feature-description>
- Hack! As you make progress, commit your changes locally, e.g.:
git add changed-file.py tests/test-changed-file.py git commit -m "feat(integrations): Add integration with the `awesomepyml` library"
- Test and lint your code! Please see below for a detailed discussion.
- Ensure compliance with conventional commits, see below. This is enforced by the CI and will prevent your PR from being merged if not followed.
-
Proposed changes are contributed through GitHub Pull Requests.
-
When your contribution is ready and the tests all pass, push your branch to GitHub:
git push origin <username>/<short-dash-seperated-feature-description>
-
Once the branch is uploaded,
GitHub
will print a URL for submitting your contribution as a pull request. Open that URL in your browser, write an informative title and a detailed description for your pull request, and submit it. -
Please link the relevant issue (either the existing one or the one you created) to your PR. See the right column on the PR page. Alternatively, in the PR description, mention that it "Fixes link-to-the-issue" - GitHub will do the linking automatically.
-
The team will review your contribution and provide feedback. To incorporate changes recommended by the reviewers, commit edits to your branch, and push to the branch again (there is no need to re-create the pull request, it will automatically track modifications to your branch), e.g.:
git add tests/test-changed-file.py git commit -m "test(sdk): Add a test case to address reviewer feedback" git push origin <username>/<short-dash-seperated-feature-description>
-
Once your pull request is approved by the reviewers, it will be merged into the main branch in the repository.
-
At Weights & Biases, we ask that all PR titles conform to the Conventional Commits specification. Conventional Commits is a lightweight convention on top of commit messages.
Structure
The commit message should be structured as follows:
<type>(<scope>): <description>
Only certain types are permitted.
⭐ User-facing notes such as `fix` and `feat` should be written so that a user can clearly understand the changes. If the feature or fix does not directly impact users, consider using a different type. Examples can be found in the section below.Type | Name | Description | User-facing? |
---|---|---|---|
feat | ✨ Feature | A pull request that adds new functionality that directly impacts users | Yes |
fix | 🐛 Fix | A pull request that fixes a bug | Yes |
docs | 📚 Documentation | Documentation changes only | Maybe |
style | 💎 Style | Changes that do not affect the meaning of the code (e.g. linting or adding type annotations) | No |
refactor | 📦 Code Refactor | A code change that neither fixes a bug nor adds a feature | No |
perf | 🚀 Performance Improvements | A code change that improves performance | No |
test | 🚨 Tests | Adding new or missing tests or correcting existing tests | No |
build | 🛠 Builds | Changes that affect the build system (e.g. protobuf) or external dependencies | Maybe |
ci | ⚙️ Continuous Integrations | Changes to our CI configuration files and scripts | No |
chore | ♻️ Chores | Other changes that don't modify source code files. | No |
revert | 🗑 Reverts | Reverts a previous commit | Maybe |
security | 🔒 Security | Security fix/feature | Maybe |
Which part of the codebase does this change impact? Only certain scopes are permitted.
Scope | Name | Description |
---|---|---|
sdk | Software Development Kit | Generic SDK changes or if can't define a narrower scope |
cli | Command-Line Interface | Generic CLI changes |
public-api | Public API | Public API changes |
integrations | Integrations | Changes related to third-party integrations |
artifacts | Artifacts | Changes related to Artifacts |
media | Media Types | Changes related to Media types |
sweeps | Sweeps | Changes related to Sweeps |
launch | Launch | Changes related to Launch |
Sometimes a change may span multiple scopes. In this case, please choose the scope that would be most relevant to the user.
Write a short, imperative tense description of the change.
User-facing notes (ones with type fix
and feat
) should be written so that a user can understand what has changed.
If the feature or fix does not directly impact users, consider using a different type.
✅ Good Examples
-
feat(media): add support for RDKit Molecules
It is clear to the user what the change introduces to our product.
-
fix(sdk): fix a hang caused by keyboard interrupt on Windows
This bug fix addressed an issue that caused the sdk to hang when hitting Ctrl-C on Windows.
❌ Bad Examples
-
fix(launch): fix an issue where patch is None
It is unclear what is referenced here.
-
feat(sdk): Adds new query to the the internal api getting the state of the run
It is unclear what is of importance to the user here, what do they do with that information. A better type would be
chore
or the title should indicate how it translates into a user-facing feature.
The W&B SDK is implemented in Python and Go.
You can use your favorite python
version management tool, such as pyenv
. To install it, follow these instructions.
Optionally set up a tool to manage multiple virtual environements, for example pyenv-virtualenv
.
Install nox
and uv
into your environment:
pip install -U nox uv
Install Go version 1.22.5
following the instructions here or using your package manager, for example:
brew install [email protected]
We recommend installing the wandb
package in the editable mode with either pip
or uv
:
uv pip install -e .
If you are modifying Go code, you should rerun the command to rebuild and reinstall the package.
Alternatively, you can install wandb-core
(the Go backend of the SDK) in development mode, by running the following command:
./core/scripts/setup-core-path.sh
This script will also allow you to unset the wandb-core
path if you no longer want to use
the development version of wandb-core
.
We are using pre-commit hooks to manage our linters and other auto-generated code.
To install pre-commit
run the following:
pip install -U pre-commit
To install all of our pre-commit hooks run:
./core/scripts/code-checks.sh update
pre-commit install
If you just want to run a specific hook, for example formating your code, you could run the following:
pre-commit run ruff-format --all-files --hook-stage pre-push
We use protocol buffers to communicate
from the user process to the wandb
backend process.
If you update any of the .proto
files in wandb/proto
, you'll need to run the
proto nox command to build the protocol buffer files:
nox -t proto
Note: you only need to do that if you change any of our protocol buffer files.
- Add a new type-annotated
SettingsData
class attribute. - Add the new field to
wandb/proto/wandb_settings.proto
following the existing pattern.- Run
nox -t proto
to re-generate the python stubs.
- Run
- If the setting comes with a default value/preprocessor/additional validators/runtime hooks, add them to
the template dictionary that the
Settings._default_props
method returns, using the same key name as the corresponding class variable.- For any setting that is only computed (from other settings) and need/should not be set/updated
(and so does not require any validation etc.), define a hook (which does not have to depend on the setting's value)
and use
"auto_hook": True
in the template dictionary (see e.g. thewandb_dir
setting).
- For any setting that is only computed (from other settings) and need/should not be set/updated
(and so does not require any validation etc.), define a hook (which does not have to depend on the setting's value)
and use
- Add tests for the new setting to
tests/wandb_settings_test.py
. - Note that individual settings may depend on other settings through validator methods and runtime hooks,
but the resulting directed dependency graph must be acyclic. You should re-generate the topologically-sorted
modification order list with
nox -s codegen
-- it will also automatically detect cyclic dependencies and throw an exception.
All URLs displayed to the user should be added to wandb/sdk/lib/wburls.py
. This will better
ensure that URLs do not lead to broken links.
Once you add the URL to that file you will need to run:
nox -s codegen
Starting with version 1.0.0, wandb
will be using Semantic Versioning.
The major version of the library will be incremented for all backwards-incompatible changes,
including dropping support for older Python versions.
Features currently marked as deprecated will be removed in the next major version (1.0.0).
To mark a feature as deprecated (and to be removed in the next major release), please follow these steps:
- Add a new field to the
Deprecated
message definition inwandb/proto/wandb_telemetry.proto
, which will be used to track the to-be-deprecated feature usage. - Rebuild protocol buffers and re-generate
wandb/proto/wandb_deprecated.py
by runningnox -t proto
. - Finally, to mark a feature as deprecated, call
wand.sdk.lib.deprecate
in your code:
from wandb.sdk.lib import deprecate
deprecate.deprecate(
field_name=deprecate.Deprecated.deprecated_field_name, # new_field_name from step 1
warning_message="This feature is deprecated and will be removed in a future release.",
)
If there is a schema change on the Server side that affects your GraphQL API,
update core/api/graphql/schemas/schema-latest.graphql
and run
nox -s graphql-codegen-schema-change
If there is no schema change and you are e.g. just adding a new query or mutation against the schema that already supports it, DO NOT USE this nox session. Our pre-commit hook will auto-generate the required code for you.
We use the pytest
framework. Tests can be found in tests/
.
All test dependencies should be in requirements_dev.txt
so you could just run:
`pip install -r requirements_dev.txt`
After that you can run your test using the standard pytest
commands. For example:
pytest -s -vv tests/path-to-tests/test_file.py