Name		Name	Last commit message	Last commit date
parent directory ..
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
algorithm.r		algorithm.r
manifest.json		manifest.json

README.md

Probabilistic Suffix Tree (PST)


Citekey	SunEtAl2006Mining
Source code	http://r-forge.r-project.org/projects/pst
Learning type	unsupervised
Input Dimensionality	univariate

Dependencies

System dependencies (apt)
- build-essential (make, gcc, ...)
- r-base
R-packages
- jsonlite
- PST
- TraMineR
- arules
- pkgcond
- BBmisc

Notes

In the paper Mining for Outliers in Sequential Databases using a PST for anomaly detection is only proposed for discrete data. Since we want to evaluate this algorithm also using continuous data, we have added a discretization step before the actual algorithm. This discretization step discretizes the input time-series into breaks number of buckets by frequency (breaks is a custom parameter which has to be given to the algorithm).

PST computes anomaly scores for sequences. However, this algorithm already converts those anomaly scores for sequences into anomaly scores for points by computing the average anomaly score for each point over all sequences it is included.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pst

pst

README.md

Probabilistic Suffix Tree (PST)

Dependencies

Notes

Files

pst

Directory actions

More options

Directory actions

More options

Latest commit

History

pst

Folders and files

parent directory

README.md

Probabilistic Suffix Tree (PST)

Dependencies

Notes