Skip to content

add miniPile dataset for pretraining, 1M entries (solves the 'out of data' at 40 iters issue) #120

add miniPile dataset for pretraining, 1M entries (solves the 'out of data' at 40 iters issue)

add miniPile dataset for pretraining, 1M entries (solves the 'out of data' at 40 iters issue) #120

Workflow file for this run

name: Lint
on:
pull_request:
concurrency:
group: lint-${{ github.workflow }}-${{ github.ref == 'refs/heads/main' && github.run_number || github.ref }}
cancel-in-progress: true
defaults:
run:
shell: bash -l -eo pipefail {0}
jobs:
lint:
runs-on: ubuntu-latest
strategy:
matrix:
python-version: ['3.10']
steps:
- name: Check out repo
uses: actions/checkout@v3
- name: Setup python
uses: actions/setup-python@v4
with:
python-version: ${{ matrix.python-version }}
- name: Update pip
run: python -m pip install --upgrade pip
- name: Install lint utilities
run: |
python -m pip install pre-commit
pre-commit install-hooks
- name: Get changed files
id: changed-files
uses: tj-actions/[email protected]
- name: Lint modified files
run: pre-commit run --files ${{ steps.changed-files.outputs.all_changed_files }}