-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #55 from jakobdanel/results/methods
Write subsection about analysis of distributions
- Loading branch information
Showing
7 changed files
with
222 additions
and
123 deletions.
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
## Analysis of different distributions | ||
|
||
Analysis of data distributions is a critical aspect of our research, with a focus on comparing two or more distributions. Our objective extends beyond evaluating the disparities between species; we also aim to assess differences within a species. To gain a comprehensive understanding of the data, we employ various visualization techniques, including histograms, density functions, and box plots. | ||
|
||
In tandem with visualizations, descriptive statistics, such as means, standard errors, and quantiles, are leveraged to provide key insights into the central tendency and variability of the data. | ||
|
||
For a more quantitative analysis of distribution dissimilarity, statistical tests are employed. The Kullback-Leibler (KL) difference serves as a measure to compare the similarity of a set of distributions. This involves converting distributions into their density functions, with the standard error serving as the bandwidth. The KL difference is calculated for each pair of distributions, as it is asymmetric. For the two distributions the KL difference is defined as following [@kullback1951kullback]: | ||
|
||
$$ | ||
D_{KL}(P \, \| \, Q) = \sum_i P(i) \log\left(\frac{P(i)}{Q(i)}\right) | ||
$$ | ||
|
||
To obtain a symmetric score, the Jensen-Shannon Divergence (JSD) is utilized [@grosse2002analysis], expressed by the formula: | ||
|
||
$$ | ||
JS(P || Q) = \frac{1}{2} * KL(P || M) + \frac{1}{2} * KL(Q || M) | ||
$$ | ||
Here, $M = \frac{1}{2} * (P + Q)$. The JSD provides a balanced measure of dissimilarity between distributions [@Brownlee2019Calculate]. For comparing the different scores to each other, we will use averages. | ||
|
||
Additionally, the Kolmogorov-Smirnov Test is implemented to assess whether two distributions significantly differ from each other. This statistical test offers a formal evaluation of the dissimilarity between empirical distribution functions. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -9,7 +9,21 @@ toc-title: Contents | |
number-sections: true | ||
number-depth: 3 | ||
date: today | ||
author: Jakob Danel and Frederick Bruch | ||
author: | ||
- name: Jakob Danel | ||
email: [email protected] | ||
url: https://github.com/jakobdanel | ||
affiliations: | ||
- name: Universität Münster | ||
city: Münster | ||
country: Germany | ||
- name: Federick Bruch | ||
email: [email protected] | ||
url: https://www.uni-muenster.de/Geoinformatics/institute/staff/index.php/351/Frederick_Bruch | ||
affiliations: | ||
- name: Universität Münster | ||
city: Münster | ||
country: Germany | ||
bibliography: references.bib | ||
execute-dir: .. | ||
prefer-html: true | ||
|
@@ -23,7 +37,7 @@ This report documents the analysis of forest data for different tree species. | |
|
||
{{< include methods/data-aquisition.qmd >}} | ||
{{< include methods/preprocessing.qmd >}} | ||
|
||
{{< include methods/distribution-analysis.qmd >}} | ||
# Results | ||
{{< include results/researched-areas.qmd >}} | ||
|
||
|
@@ -43,6 +57,8 @@ This report documents the analysis of forest data for different tree species. | |
|spruce |oberhundem | 0.0162678| | ||
|spruce |osterwald | 0.0129892| | ||
|
||
|
||
|
||
# References | ||
|
||
::: {#refs} | ||
|