GitHub - SyJarvis/PolarisRAG: "PolarisRAG": Fast and Easy Retrieval-Augmented Generation

🚀 PolarisRAG: Fast and Easy Retrieval-Augmented Generation

Install

Install from source (Recommend)

cd PolarisRAG
pip install -e .

Install from PyPI

pip install polarisrag

Quick Start

All the code can be found in the examples
Set ZhipuAI API key in environment if using ZhipuAI models: export ZHIPUAI_API_KEY="537...".
Download the demo text "A Christmas Carol by Charles Dickens":

curl https://raw.githubusercontent.com/gusye1234/nano-graphrag/main/tests/mock_data.txt > ./book.txt

Maybe you can try loading environment variables like this. Create a new .env file

ZHIPUAI_API_KEY="537..."
OPENAI_API_KEY="sk-..."
OPENAI_BASE_URL="https://api..."
QWEN2_API_KEY="sk-no-key-required"
QWEN2_BASE_URL="http://127.0.0.1:8080/v1"

from dotenv import load_dotenv, find_dotenv
load_dotenv(find_dotenv(), override=True)

Use the below Python snippet (in a script) to initialize PolarisRAG and perform queries:

import os
os.environ["ZHIPUAI_API_KEY"] = ""
from polarisrag import PolarisRAG
# 定义工作空间
WORKING_DIR = "documents"
rag = PolarisRAG(working_dir=WORKING_DIR)
# 初始化rag,加载embedding、vector、llm
rag.init_rag()
# 插入数据
with open("documents/test.txt", 'r') as f:
    rag.insert(f.read())
print(rag.chat("什么是BERT"))
print(rag.chat("如何下载BERT-base-chinese预训练模型"))

Use polarisrag by loading a document in your working directory

import os
os.environ["ZHIPUAI_API_KEY"] = ""
from polarisrag import PolarisRAG
# 定义工作空间
WORKING_DIR = "documents"
rag = PolarisRAG(working_dir=WORKING_DIR)
# 初始化rag,加载embedding、vector、llm
rag.init_rag()
# 加载文档到向量数据库
rag.load_document()
print(
    rag.chat("什么是BERT")
)
print(
    rag.chat("如何下载BERT-base-chinese预训练模型")
)

Use PolarisRAG through a dictionary configuration

from polarisrag import PolarisRAG
WORKING_DIR = "documents"
embedding_conf = {
    "class_name": "HFEmbedding",
    "class_param": {
        "pretrain_dir": '/root/.cache/modelscope/hub/BAAI/bge-large-zh-v1___5'
    }
}
vector_conf = {
    "class_name": "MilvusDB",
    "class_param": {}
}
llm_model_conf = {
    "class_name": "ZhipuLLM",
    "class_param": {
        "model": "glm-4-flash",
        "is_memory": "True"
    }
}
rag = PolarisRAG(working_dir=WORKING_DIR,
                 embedding_model=embedding_conf,
                 vector_storage=vector_conf,
                 llm_model=llm_model_conf)
rag.init_rag()
with open("documents/test.txt", 'r') as f:
    rag.insert(f.read())
result = rag.chat("什么是BERT")
print(result)
result = rag.chat("如何下载BERT-base-chinese预训练模型")
print(result)

Use polarisrag through the component

import os
from polarisrag import PolarisRAG
from polarisrag.embedding import ZhipuEmbedding
from polarisrag.vector_database import MilvusDB
from polarisrag.llm import ZhipuLLM
from polarisrag.utils import FolderLoader

# api_key
ZHIPUAI_API_KEY = ""
# 工作空间
WORKING_DIR = "documents"
embedding_model = ZhipuEmbedding(api_key=ZHIPUAI_API_KEY)
llm_model = ZhipuLLM(api_key=os.getenv("ZHIPUAI_API_KEY"))
loader = FolderLoader(folder_path=WORKING_DIR)
docs = loader.get_all_chunk_content()
vector_db = MilvusDB({"embedding_model": embedding_model})
# 创建集合
vector_db.create_collection("default")
vector_db.insert(docs=docs)
rag = PolarisRAG(
    working_dir=WORKING_DIR,
    embedding_model=embedding_model,
    vector_storage=vector_db,
    llm_model=llm_model,
    is_memory=True
)
print(
    rag.chat("什么是BERT")
)

Use polarisrag through an existing vector database

import os
os.environ["ZHIPUAI_API_KEY"] = ""
from polarisrag import PolarisRAG
# 定义工作空间
WORKING_DIR = "documents"
# 向量数据库配置
vector_conf = {
    "class_name": "MilvusDB",
    "class_param": {
        # 数据库文件名，默认collection_name为default
        "db_file": "milvus_data.db",
    }
}
rag = PolarisRAG(working_dir=WORKING_DIR,
                vector_storage=vector_conf)
# 初始化rag,加载embedding、vector、llm
rag.init_rag()
print(
    rag.chat("什么是BERT")
)
print(
    rag.chat("如何下载BERT-base-chinese预训练模型")
)

🌟Citation

@article{guo2024polarisrag,
title={PolarisRAG: Fast and Easy Retrieval-Augmented Generation},
author={Runke Zhong},
year={2024}
}

保持热爱，奔赴星海！

这个世界上唯有两样东西能让我们的心灵感到深深的震撼：一是我们头上灿烂的星空，一是我们内心崇高的道德法则

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
assets		assets
examples		examples
polarisrag		polarisrag
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 PolarisRAG: Fast and Easy Retrieval-Augmented Generation

Install

Quick Start

🌟Citation

About

Releases

Packages

Languages

License

SyJarvis/PolarisRAG

Folders and files

Latest commit

History

Repository files navigation

🚀 PolarisRAG: Fast and Easy Retrieval-Augmented Generation

Install

Quick Start

🌟Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages