Contributing to Pangraph

This guide describes how to setup developer environment, how to build Pangraph, contribute to the codebase and maintain the project.

Setup developer environment

This guide assumes Ubuntu 24.04 operating system, but will likely work similarly to any other Linux and Unix-like machine.

Pangraph is written in Rust. The usual rustup & cargo workflow can be used:

# Install required dependencies
# These particular commands are specific for Ubuntu Linux and will work on some other Debian-based Linux distros.
# Refer to documentation of your operating system to find how to install these dependencies.
sudo apt-get update
sudo apt-get install \
  bash \
  build-essential \
  clang \
  curl \
  gcc \
  git \
  libbz2-dev \
  libclang-dev \
  liblzma-dev \
  libssl-dev \
  libzstd-dev \
  make \
  pkg-config \
  zlib1g-dev \

# Install Rustup, the Rust version manager (https://www.rust-lang.org/tools/install)
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | bash -s -- -y

# Add Rust tools to the $PATH. You might add this line to your .bashrc or .zshrc so that the $PATH is adjusted automatically when a new terminal session is opened.
export PATH="$PATH:$HOME/.cargo/bin"

Obtain source code

Pangraph is an open-source project and its source code is available on GitHub under MIT license.

To obtain source code use git to clone GitHub repository:

git clone https://github.com/neherlab/pangraph

If you are a team member, use SSH url to be able to push securely and without password prompt (More details: https://docs.github.com/en/authentication/connecting-to-github-with-ssh)

git clone git@github.com:neherlab/pangraph.git

If you are not a team member, but want to contribute, make a fork, and clone your forked repository instead. You can then submit changes to pangraph as a Pull Request . Pangraph maintainers will then review your changes and will consider merging them into the main project.

Build and run

Build and run executables

# Go to the cloned directory
cd pangraph

# (optional) checkout a branch (different from default)
git checkout <branch_name>

# Build and run in debug mode (convenient for development, fast to build, slow to run, has more information in stack traces and when running under a debugger)
cargo run --bin=pangraph

# Run with additional arguments passed to the executable
cargo run --bin=pangraph -- --help

# Instead of `--bin=pangraph` you can also run any other executable from `packages/pangraph/src/bin/`. Just substitute its filename.
# This is a `cargo` convention: everything in `src/bin/` that has a `main()` function in it becomes an executable. This way you can add more executables.
cargo run --bin=my_executable

# Run Pangraph in release mode (slow to build, fast to run, very little information in stack traces and during debugging)
cargo run --release --bin=pangraph

# Alternatively, build and run separately. The compiled binaries will be in `target/` directory by default.
cargo run --release --bin=pangraph
./target/release/pangraph

Note, on first build of a particular project, cargo will search for one of the possible toolchain config files and will automatically install Rust version required by the project. This may cause first build to take longer than usual.

Testing

Install requirements

We run tests using cargo-nextest (https://nexte.st/). You can install it from GitHub Releases or build and install from source with

cargo install cargo-nextest --locked

All tests

Run all tests with:

cargo nextest run

Add --no-fail-fast flag to keep going even if there are failures.

A subset of tests can be ran by providing a regex matching full test name. For example to run tests test_foo and test_foo_bar

cargo nextest run foo

You may experiment with command line arguments and the prettytest script for the most useful and pleasant output:

cargo -q nextest run --success-output=immediate --workspace --cargo-quiet --no-fail-fast --hide-progress-bar --color=always | "./dev/prettytest"

Arguments of cargo test (and by extension cargo nextest) are somewhat confusing, so you could setup some personal shell scripts or aliases to simplify routine work on Rust projects.

Unit tests

If you want to run only unit tests:

cargo nextest run --lib
cargo nextest run --lib foo

Integration tests

If you want to run only integration tests:

cargo nextest run --test='*'
cargo nextest run --test='*' foo
cargo nextest run --test='foo'

Linting (static analysis)

Rust code is linted by running Clippy:

cargo clippy --all-targets --all

Clippy is configured in clippy.toml and .cargo/config.toml.

Formatting (code style)

Code formatting is done using rustfmt:

cargo fmt --all

Rustfmt is configured in rustfmt.toml.

Maintenance

Upgrading Rust

Rust version is defined in rust-toolchain.toml. When using cargo, the version defined in this file gets installed automatically.

Upgrading Rust dependencies

Dependencies for subprojects are defined in packages/**/Cargo.toml and in Cargo.lock. They are periodically upgraded by a dedicated maintainer, manually using cargo-upgrade from cargo-edit package.

cargo upgrade --workspace

or, for a more radical upgrade:

cargo upgrade --pinned --incompatible --verbose --recursive

Then remove Cargo.lock and rebuild the project to apply the upgrades across dependency tree.

Documentation

End-user and developer documentation is in docs/ subdirectory. For details, see README.md file there.

Versioning and releases

There are multiple release targets and they are published by updating where the corresponding git branch is pointing, which triggers a corresponding CI workflow.

Branch	CI workflow	Target
`release`	cli.yml	Releases Pangraph CLI to GitHub Releases and DockerHub
`release-pypangraph`	pypangraph.yml	Releases PyPangraph to PyPI
`release-docs`	docs.yml	Releases documentation website to https://docs.pangraph.org

Releasing Pangraph CLI

Check out master branch. Make sure that all the changes that you want to release are on master branch. Make sure that the last GitHub Action on master branch succeeded (use branch filter dropdown). If not, make sure to fix the failures before trying to release.
Prepare changelog document for the release: open CHANGELOG.md in the root directory of the project, add ## Unreleased section at the top of the file, spelled exactly like this - important for automation. Under this section, describe all changes in the coming release. This is a user-facing document, so use simple words, avoid internal and dev jargon. If ## Unreleased section already exists, then extend it - do not add multiple of these sections.
Perform pre-release checks, bump versions, commit using the helper ./dev/release script.

Read comments in the script on how to install currently required dependencies.

This step is local to your working copy of the project, it does not push or otherwise publishes anything yet.
```
./dev/release ${bump_type}
```
most likely you want "bump type" to be one of :
- patch - for bug fix releases, e.g. 1.1.0 -> 1.1.1
- minor - for new feature releases, e.g. 1.1.0 -> 1.2.0
- major - for releases with breaking changes, e.g. 1.1.0 -> 2.0.0
Having hard times to decide? Read Semantic Versioning.
Follow instructions printed by the script. Resolve errors, if any. If finished successfully, follow instructions on how to fast-forward and push the changes to the release branch.
Optionally, you can combine the releases of CLI, docs and PyPangraph. Just add more commits to master branch and then fast-forward and push them to corresponding branches all together.
The push to release-cli triggers a GitHub Action. This is non-reversible step. Do the push. Watch for any malfunctions in GitHub Actions.
If GitHub Action succeeds, double check that release is up on GitHub Releases and Docker Hub.

Releasing PyPangraph

TODO: automate this

Check out master branch. Make sure that all the changes and no other changes that you want to release are on master branch.
Prepare changelog for the release: open packages/pypangraph/CHANGELOG.md and add ## ${version_number} section, describing released changes.
Bump version to the same version number in packages/pypangraph/pyproject.toml
Commit to master with message: chore: release pypangraph ${version_number} and push
Fast forward release-pypangraph branch to master:
The push to branch release-pypangraph triggers a GitHub Action. This is non-reversible step. Do the push. Watch for any malfunctions in GitHub Actions.
If GitHub Action succeeds, double check that PyPI release is up.

Releasing user documentation

See README.md in the docs/ directory.

Setup developer environment​

Obtain source code​

Build and run​

Build and run executables​

Testing​

Install requirements​

All tests​

Unit tests​

Integration tests​

Linting (static analysis)​

Formatting (code style)​

Maintenance​

Upgrading Rust​

Upgrading Rust dependencies​

Documentation​

Versioning and releases​

Releasing Pangraph CLI​

Releasing PyPangraph​

Releasing user documentation​