[Website] [Arxiv Paper] [Open Database] [Achievements]
A Large-scale Hierarchical Dynamic Financial Knowledge Base
HiDy is a hierarchical, dynamic, robust, diverse, and large-scale financial benchmark KB that aims to provide various valuable financial knowledge as critical benchmarking data for fair model testing in different financial tasks. Specifically, HiDy currently contains 34 relation types, more than 505,800 relations, 17 entity types, and more than 51,000 entities. The scale of HiDy is steadily growing due to its continuous updates. To make HiDy easily accessible and retrieved, HiDy is organized in a well-formed financial hierarchy with four branches, Macro, Meso, Micro, and Others.
With HiDy, users can apply more in-depth, professional, logical, and interpreted knowledge to many common financial tasks, such as stock movement prediction (SMP), financial fraud detection (FFD), supply chain management (SCM), loan default risk prediction (LDRP) and financial event prediction (FEP).
🎉 NEWS:
- We now have updated to 1.10 version of the hierachical dynamic financial knowlegde base
HiDy
in Zenedo. - We have open-sourced the
Benchmarking Models of SMP and FFD Tasks
along with the test data. - We have open-sourced the
Stock Movement Prediction and Backtesting Demo Website
equipped with HiDy's knowledge. - We have open-sourced the
Pre-trained Extraction Models
. - We have open-sourced the
Knowledge Extraction Implementation
. - We have published the 1.0 version of the hierachical dynamic financial knowlegde base
HiDy
in Zenedo.
To install the cutting edge version of Knowledge Extraction Implementation
from the main branch of this repo, run:
git clone https://github.com/K-Quant/HiDy.git
cd HiDy
pip install -r requirements.txt
Our databases is open-access and available in Zenedo! The data description is shown in the following table:
- HiDy owns the metadata and releases as CC BY-NC 4.0.
- HiDy owns the copyright of the extracted data denoted by their relation type: mention, support_industry, tax_cut, tax_cut_subject, validity_period, supply / be_supplied, upstream / downstream, superior / subordinate, same_industry, increase_holding / be_increased_holding, reduce_holding / be_reduce_holding, invest / be_invested, cooperate, compete, rise, fall, dispute, positive, negative as CC BY-NC 4.0.
- HiDy does not own the copyright of the collected data by querying the Internet denoted by their relation type: SW_belongs_to, industry_belongs_to, hold, managed_by, has_invest_type_of, company_belong_to_exchange, exchange_belong_to_market, company_locate_in_city, city_locate_in_region, produce.
We present multiple benchmarking results for SMP and FFD tasks to foster community involvement in HiDy paper.
The Website provides users with various stock movement prediction models with backtesting. Among them, HIST and NRSR are equipped with HiDy dataset.
Although the validation results are promising, HiDy is
The codebase (this repo) is licensed under a Apache 2.0 License.
The HiDy dataset is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.