[Question]: Knowledge Graph is blank cant't display #4176

lxljw1314 · 2024-12-23T03:39:29Z

Describe your problem

I have deployed Ragflow in a private environment and used the knowledge graph parsing method for analysis. However, after the parsing is completed, the knowledge graph is blank, while the mind map has results.
I tried several embedding models, but the result was the same for all of them; however, the graph displays normally when using the official demo environment provided. please help！

the server log :

2024-12-23 11:22:57,922 INFO 33 172.22.0.6 - - [23/Dec/2024 11:22:57] "GET /v1/document/list?kb_id=7174a526c07911ef9ec30242ac160006&keywords=&page_size=10&page=1 HTTP/1.1" 200 - 2024-12-23 11:22:57,980 INFO 35 From minio(0.014944203197956085) 测试笔录.txt/测试笔录.txt 2024-12-23 11:22:58,003 INFO 35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.1, progress_msg: Page(1~100000001): Start to parse. 2024-12-23 11:22:58,026 INFO 35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.8, progress_msg: Page(1~100000001): Finish parsing. 2024-12-23 11:22:58,104 INFO 35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.5, progress_msg: Page(1~100000001): Extracting entities. 2024-12-23 11:23:10,064 INFO 35 HTTP Request: POST http://10.12.4.31:11434/api/chat "HTTP/1.1 200 OK" 2024-12-23 11:23:25,845 INFO 35 task_consumer_0 reported heartbeat: {"name": "task_consumer_0", "now": "2024-12-23T11:23:25.843712", "boot_at": "2024-12-23T11:13:55.255815", "pending": 1, "lag": 0, "done": 0, "failed": 0, "current": {"id": "34d74b00c0dd11efa8cb0242ac160006", "doc_id": "6c07af94c08211efbc000242ac160006", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "7174a526c07911ef9ec30242ac160006", "parser_id": "knowledge_graph", "parser_config": {"entity_types": ["organization", "person", "location", "event", "time"], "chunk_token_num": 8192, "delimiter": "\\n!?;\u3002\uff1b\uff01\uff1f"}, "name": "\u6d4b\u8bd5\u7b14\u5f55.txt", "type": "doc", "location": "\u6d4b\u8bd5\u7b14\u5f55.txt", "size": 3633, "tenant_id": "fed540f0c03411ef87ab0242ac160006", "language": "Chinese", "embd_id": "quentinz/bge-large-zh-v1.5@Ollama", "pagerank": 0, "img2txt_id": "llava@Ollama", "asr_id": "", "llm_id": "qwen2@Ollama", "update_time": 1734924177869}} 2024-12-23 11:23:52,021 INFO 35 HTTP Request: POST http://10.12.4.31:11434/api/chat "HTTP/1.1 200 OK" 2024-12-23 11:23:52,029 INFO 35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: None, progress_msg: Page(1~100000001): 1/1, elapsed: 53.93099748343229s, used tokens: 3371 2024-12-23 11:23:52,042 INFO 35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.5, progress_msg: Page(1~100000001): Entities extraction progress ... 1/1 2024-12-23 11:23:52,053 INFO 35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.6, progress_msg: Page(1~100000001): Extracting community reports. 2024-12-23 11:23:52,068 WARNING 35 /ragflow/.venv/lib/python3.10/site-packages/networkx/readwrite/json_graph/node_link.py:142: FutureWarning: The default value will be edges="edges" in NetworkX 3.6.

To make this warning go away, explicitly set the edges kwarg, e.g.:

nx.node_link_data(G, edges="links") to preserve current behavior, or
nx.node_link_data(G, edges="edges") for forward compatibility.
warnings.warn(

2024-12-23 11:23:52,072 INFO 2024-12-23 11:23:55,878 INFO 2024-12-23 11:24:25,911 INFO 2024-12-23 11:24:55,944 INFO 2024-12-23 11:25:04,763 INFO 2024-12-23 11:25:04,821 INFO 2024-12-23 11:25:04,822 INFO 2024-12-23 11:25:04,822 INFO 2024-12-23 11:25:04,835 INFO 2024-12-23 11:25:06,132 INFO 2024-12-23 11:25:06,295 INFO 2024-12-23 11:25:06,332 INFO 2024-12-23 11:25:06,343 INFO 2024-12-23 11:25:06,496 INFO 2024-12-23 11:25:07,006 INFO 2024-12-23 11:25:07,675 INFO 2024-12-23 11:25:07,686 INFO 2024-12-23 11:25:07,689 INFO 2024-12-23 11:25:07,692 INFO 2024-12-23 11:25:07,707 INFO 2024-12-23 11:25:07,818 INFO 2024-12-23 11:25:07,823 INFO 2024-12-23 11:25:07,832 INFO 2024-12-23 11:25:07,842 INFO 2024-12-23 11:25:07,849 INFO 2024-12-23 11:25:07,851 INFO 2024-12-23 11:25:25,975 INFO 35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.75, progress_msg: Page(1100000001): Extracting mind graph.
35 task_consumer_0 reported heartbeat: {"name": "task_consumer_0", "now": "2024-12-23T11:23:55.876698", "boot_at": "2024-12-23T11:13:55.255815", "pending": 1, "lag": 0, "done": 0, "failed": 0, "current": {"id": "34d74b00c0dd11efa8cb0242ac160006", "doc_id": "6c07af94c08211efbc000242ac160006", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "7174a526c07911ef9ec30242ac160006", "parser_id": "knowledge_graph", "parser_config": {"entity_types": ["organization", "person", "location", "event", "time"], "chunk_token_num": 8192, "delimiter": "\n!?;\u3002\uff1b\uff01\uff1f"}, "name": "\u6d4b\u8bd5\u7b14\u5f55.txt", "type": "doc", "location": "\u6d4b\u8bd5\u7b14\u5f55.txt", "size": 3633, "tenant_id": "fed540f0c03411ef87ab0242ac160006", "language": "Chinese", "embd_id": "quentinz/bge-large-zh-v1.5@Ollama", "pagerank": 0, "img2txt_id": "llava@Ollama", "asr_id": "", "llm_id": "qwen2@Ollama", "update_time": 1734924177869}}
35 task_consumer_0 reported heartbeat: {"name": "task_consumer_0", "now": "2024-12-23T11:24:25.909271", "boot_at": "2024-12-23T11:13:55.255815", "pending": 1, "lag": 0, "done": 0, "failed": 0, "current": {"id": "34d74b00c0dd11efa8cb0242ac160006", "doc_id": "6c07af94c08211efbc000242ac160006", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "7174a526c07911ef9ec30242ac160006", "parser_id": "knowledge_graph", "parser_config": {"entity_types": ["organization", "person", "location", "event", "time"], "chunk_token_num": 8192, "delimiter": "\n!?;\u3002\uff1b\uff01\uff1f"}, "name": "\u6d4b\u8bd5\u7b14\u5f55.txt", "type": "doc", "location": "\u6d4b\u8bd5\u7b14\u5f55.txt", "size": 3633, "tenant_id": "fed540f0c03411ef87ab0242ac160006", "language": "Chinese", "embd_id": "quentinz/bge-large-zh-v1.5@Ollama", "pagerank": 0, "img2txt_id": "llava@Ollama", "asr_id": "", "llm_id": "qwen2@Ollama", "update_time": 1734924177869}}
35 task_consumer_0 reported heartbeat: {"name": "task_consumer_0", "now": "2024-12-23T11:24:55.942688", "boot_at": "2024-12-23T11:13:55.255815", "pending": 1, "lag": 0, "done": 0, "failed": 0, "current": {"id": "34d74b00c0dd11efa8cb0242ac160006", "doc_id": "6c07af94c08211efbc000242ac160006", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "7174a526c07911ef9ec30242ac160006", "parser_id": "knowledge_graph", "parser_config": {"entity_types": ["organization", "person", "location", "event", "time"], "chunk_token_num": 8192, "delimiter": "\n!?;\u3002\uff1b\uff01\uff1f"}, "name": "\u6d4b\u8bd5\u7b14\u5f55.txt", "type": "doc", "location": "\u6d4b\u8bd5\u7b14\u5f55.txt", "size": 3633, "tenant_id": "fed540f0c03411ef87ab0242ac160006", "language": "Chinese", "embd_id": "quentinz/bge-large-zh-v1.5@Ollama", "pagerank": 0, "img2txt_id": "llava@Ollama", "asr_id": "", "llm_id": "qwen2@Ollama", "update_time": 1734924177869}}
35 HTTP Request: POST http://10.12.4.31:11434/api/chat "HTTP/1.1 200 OK"
35 Chunking(126.85584738105536) 测试笔录.txt/测试笔录.txt done
35 MINIO PUT(测试笔录.txt):0
35 Build document 测试笔录.txt: 126.86s
35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: None, progress_msg: Page(1100000001): Generate 3 chunks
35 HTTP Request: POST http://10.12.4.31:11434/api/embeddings "HTTP/1.1 200 OK"
35 HTTP Request: POST http://10.12.4.31:11434/api/embeddings "HTTP/1.1 200 OK"
35 HTTP Request: POST http://10.12.4.31:11434/api/embeddings "HTTP/1.1 200 OK"
35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.6333333333333333, progress_msg:
35 HTTP Request: POST http://10.12.4.31:11434/api/embeddings "HTTP/1.1 200 OK"
35 HTTP Request: POST http://10.12.4.31:11434/api/embeddings "HTTP/1.1 200 OK"
35 HTTP Request: POST http://10.12.4.31:11434/api/embeddings "HTTP/1.1 200 OK"
35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.7666666666666666, progress_msg:
35 Embedding chunks (2.85s)
35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: None, progress_msg: Page(1100000001): Embedding chunks (2.85s)
35 HEAD http://es01:9200/ragflow_fed540f0c03411ef87ab0242ac160006 [status:200 duration:0.011s]
35 PUT http://es01:9200/ragflow_fed540f0c03411ef87ab0242ac160006/_bulk?refresh=false&timeout=60s [status:200 duration:0.105s]
35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 0.8333333333333334, progress_msg:
35 Indexing doc(测试笔录.txt), page(0-100000000), chunks(3), elapsed: 0.12
35 set_progress(34d74b00c0dd11efa8cb0242ac160006), progress: 1.0, progress_msg: Page(1100000001): Done (0.13s)
35 Chunk doc(测试笔录.txt), page(0-100000000), chunks(3), token(768), elapsed:0.13
35 handle_task done for task {"id": "34d74b00c0dd11efa8cb0242ac160006", "doc_id": "6c07af94c08211efbc000242ac160006", "from_page": 0, "to_page": 100000000, "retry_count": 0, "kb_id": "7174a526c07911ef9ec30242ac160006", "parser_id": "knowledge_graph", "parser_config": {"entity_types": ["organization", "person", "location", "event", "time"], "chunk_token_num": 8192, "delimiter": "\n!?;\u3002\uff1b\uff01\uff1f", "layout_recognize": true}, "name": "\u6d4b\u8bd5\u7b14\u5f55.txt", "type": "doc", "location": "\u6d4b\u8bd5\u7b14\u5f55.txt", "size": 3633, "tenant_id": "fed540f0c03411ef87ab0242ac160006", "language": "Chinese", "embd_id": "quentinz/bge-large-zh-v1.5@Ollama", "pagerank": 0, "img2txt_id": "llava@Ollama", "asr_id": "", "llm_id": "qwen2@Ollama", "update_time": 1734924177869}
35 task_consumer_0 reported heartbeat: {"name": "task_consumer_0", "now": "2024-12-23T11:25:25.974290", "boot_at": "2024-12-23T11:13:55.255815", "pending": 0, "lag": 0, "done": 1, "failed": 0, "current": null}`

The text was updated successfully, but these errors were encountered:

lxljw1314 · 2024-12-23T03:44:24Z

and the right result in demo env is like this , Why is the execution result different from what I deployed locally?

KevinHuSh · 2024-12-23T08:29:49Z

The graph extraction primarily depends on LLM.
So, which LLM did you choose?

dromeuf · 2024-12-23T08:51:54Z

The graph extraction primarily depends on LLM. So, which LLM did you choose?

Hi Kevin, I have the same problem on last v15.0 with Knowledge Graph. I use snowflake-arctic-embed2 for embedding and qwen2.5:14b for LLM system model settings. The result graph is blank not appears for Know & for Mind. I am testing embedding & small local LLM... result. I have to say that I'm dubious about the results so far.

I'd like to take this opportunity to ask you which LLM RAPTOR depends on for Paper/Book/Gen chunking method (I assume the one defined by default in system Model providers -> System model settings?).

Kind regards and thanks for your great work.

KevinHuSh · 2024-12-24T09:06:25Z

RAPTOR uses the llm set in system Model providers -> System model settings

lxljw1314 added the question Further information is requested label Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: Knowledge Graph is blank cant't display #4176

[Question]: Knowledge Graph is blank cant't display #4176

lxljw1314 commented Dec 23, 2024

lxljw1314 commented Dec 23, 2024

KevinHuSh commented Dec 23, 2024

dromeuf commented Dec 23, 2024

KevinHuSh commented Dec 24, 2024

[Question]: Knowledge Graph is blank cant't display #4176

[Question]: Knowledge Graph is blank cant't display #4176

Comments

lxljw1314 commented Dec 23, 2024

Describe your problem

lxljw1314 commented Dec 23, 2024

KevinHuSh commented Dec 23, 2024

dromeuf commented Dec 23, 2024

KevinHuSh commented Dec 24, 2024