Skip to content

Commit

Permalink
confluence
Browse files Browse the repository at this point in the history
Signed-off-by: Costa Shulyupin <[email protected]>
  • Loading branch information
makelinux committed May 25, 2024
1 parent 74f70a5 commit d0ba5f9
Showing 1 changed file with 40 additions and 0 deletions.
40 changes: 40 additions & 0 deletions docs/confluence-doc-source.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
# Confluence document source

Fetching information from Confluence is an essential
feature for fine-tuning models on internal documentation.

## Interfaces

qna.yaml file, `document` section:

- Confluence Host: The base URL of the Confluence instance.
- Space: The Confluence space key where the documents reside.
- Page titles: The titles of the Confluence pages to fetch.
- Version: The version of the Confluence page (optional).

The qna.yaml file can define single host and multiple spaces and pages,
each with an optional version.

Confluence credentials in config.yaml:
- Username

Check failure on line 19 in docs/confluence-doc-source.md

View workflow job for this annotation

GitHub Actions / markdown-lint

Lists should be surrounded by blank lines

docs/confluence-doc-source.md:19 MD032/blanks-around-lists Lists should be surrounded by blank lines [Context: "- Username"] https://github.com/DavidAnson/markdownlint/blob/v0.34.0/doc/md032.md
- [Token](https://support.atlassian.com/atlassian-account/docs/manage-api-tokens-for-your-atlassian-account/)

## Changes across modules

- [Configuration module](https://github.com/instructlab/instructlab/blob/main/src/instructlab/config.py)
defines the structure and validation rules for
the config.yaml file.
- [Schema module](https://github.com/instructlab/schema) defines the structure and validation rules for
the qna.yaml file.
- [ilab utilities module](https://github.com/instructlab/instructlab/blob/main/src/instructlab/utils.py)
fetches documents
- [unit test](https://github.com/instructlab/instructlab/tree/main/tests)

## Additional External Packages

The implementation relies on the following external packages:

- [atlassian-python-api](https://atlassian-python-api.readthedocs.io/)
A Python library to interact with Atlassian products, including Confluence.
- [markdownify](https://pypi.org/project/markdownify/)
A library to convert HTML content to Markdown for processing Confluence page content.

0 comments on commit d0ba5f9

Please sign in to comment.