Contributing

This file is home to contribution documentation for the Colorado Center for Personalized Medicine - Informatics Operations (CCPM I/O) at the University of Colorado Anschutz Medical Campus.

Code of Conduct

We follow a code of conduct which may be found here: CODE_OF_CONDUCT.md.

Security

We follow security procedures which may be found here: SECURITY.md

Source Code, Data, and Reproducibility

Pride

We expect team members to sign their code, which means that source code contributions are attributable to an individual's account on GitHub. To quote from The Pragmatic Programmer:

[Craftspeople] of an earlier age were proud to sign their work. You should be, too… People should see your name on a piece of code and expect it to be solid, well written, tested, and documented.

While some code will be proof-of-concept code, it should be of a form that inspires confidence.

Programming Languages

We most often write code for our analyses in Python or R. This allows everyone in the organization to know two languages and understand analytical code.

Version Control Services

Our primary version control service is GitHub. We expect repository management and code maintenance to occur through our CCPM-IO GitHub organization. We discourage direct commits to the default branch (for example, main or master) for all repositories. Changes to default repository branches occur through "feature branches" and GitHub pull requests with related code review and merges after approval.

Development

Linting

A linter is a code analysis tool designed to analyze code and flag for programming errors, bugs and stylistic errors. These tools allow one to drastically improve their productivity when writing code for their research projects as stylistic errors and programming errors will automatically be covered through this setup.

Pre-commit

We recommend the use of pre-commit with our repositories. Using pre-commit allows you to setup a "git hook" which can check for errors prior to committing changes with git. See pre-commit installation documentation and individual repositories for their unique configurations and potential linting expectations.

Updates to Code

We practice code review on all changes to repositories. Code review is handled through GitHub pull requests. The process is described briefly below. Feel free to ask for guidance if you are uncomfortable with the process.

⚠️ We will revoke write access for failing to adhere to these rules.

Make changes to your code and commit them in a non-default branch.
Create a pull request into the repository owned by CCPM I/O.
Select potential reviewers for your pull request.
Once at least one organization member has approved your pull request, you or a reviewer may merge your pull request.
- There are sometimes exceptions to this policy where, in addition to the above rules, an IO director must also approve the pull request (for example, with the .github repository).

Composition of Pull Requests

Each pull request may contain one or more changes. In keeping with good source control practice, each pull request should include all commits necessary to complete a particular fix or update. In addition, each pull request should relate to no more than one functional area in the code base you are updating. Keeping the pull request focused to one area makes it easier for your reviewers to provide thoughtful feedback. We recommend keeping pull requests as small as feasible to address fixes or new capabilities. Larger pull requests involving many lines changed and/or very complex changes may be a sign that further granularity (and smaller changes) would be beneficial.

Reviewing Pull Requests

We expect that team members will participate in requested review of pull requests. See the checklist below on how to facilitate code review. As a reviewer, you are responsible for making sure that all checklist guidelines are followed.

Code Review Checklist

Pride: We expect team members to sign their code via commits attributable to a user. Each commit must be attributed to a recognized user.
Licensing: A LICENSE file is in the root of the repository.
Using Other Code: Code taken from elsewhere is properly acknowledged and compatible with the license.
Style Guide: Python code follows PEP 8. R code follows Google's R Style Guide. We provide instructions on how to automate the linting process here
Variable and Function Names: Variable names are descriptive and interpretable to someone looking at this code for the first time (e.g. not a, b, x, etc.).
File Commenting: Each file has a comment at the top to broadly describe its function and how it is expected to be used (e.g. imported, run from command line, both).
Function Comments: Each function has a docstring which reports the computation that it intends to implement, its arguments, and its return value(s).
In-line Commenting: At least 2 spaces are placed between in-line comments (#) and source code.
Imports: All trivial imports are at the top of the file.
Column Length: The code should be readable in a text editor with a reasonable format as well as the GitHub interface. This means that there are no excessively long lines. We strongly recommend that repository maintainers select a maximum line length for code of 80 or 100 characters and that this be specified in a contributors document for the repository. Plain text, markdown, and other text-based formats can alternatively be broken at sentences. This rule is already covered well in PEP 8 but called out here to clarify that we apply it to more than Python code. One reason for this is to aid in readability of diff output when performing code reviews.
Whitespace: There is no unnecessary whitespace.
Code with constants Any constants are specified at the beginning of the file.
Code that uses a random seed [special case of constants] Code that uses a random seed is reproducible. This means that the seed can be set and a default value is specified.
API error handling APIs should catch and handle anticipated errors (e.g. key doesn't exist, type mismatch in lookup) by identifying the source of the error (e.g. lookup failed with PK=XYZ) to the caller with as much precision as possible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CONTRIBUTING.md

CONTRIBUTING.md

Contributing

Code of Conduct

Security

Source Code, Data, and Reproducibility

Pride

Programming Languages

Version Control Services

Development

Linting

Pre-commit

Updates to Code

Composition of Pull Requests

Reviewing Pull Requests

Code Review Checklist

Files

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing

Code of Conduct

Security

Source Code, Data, and Reproducibility

Pride

Programming Languages

Version Control Services

Development

Linting

Pre-commit

Updates to Code

Composition of Pull Requests

Reviewing Pull Requests

Code Review Checklist