Skip to content

Releases: abstractqqq/polars_ds_extension

v0.4.5

23 May 01:58
359c747
Compare
Choose a tag to compare

Breaking Changes

Previously, if you want to compare Edit distance between one column and a single string, you would do

df.select(
    pds.str_leven(pl.col("c"), "word")
)

But now you have to do

df.select(
    pds.str_leven(pl.col("c"), pl.lit("word"))
)

The previous behavior will now look for a column named "word" instead of using the word "word."

What's Changed

New Contributors

Full Changelog: v0.4.4...v0.4.5

v0.4.4

12 May 02:18
b61b89a
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.4.3...v0.4.4

v0.4.3

28 Apr 23:43
940beb5
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.4.2...v0.4.3

v0.4.2

19 Apr 02:51
ed1ca70
Compare
Choose a tag to compare

Highlights

In Diagnosis, you can now generate 2d principal component graphs. This can help you visualize higher dimensional data.

dia = DIA(df)
dia.plot_pc2(pl.all().exclude("species"), by = "species")

In addition, the following PCA related queries are available:

df.select(
    pds.query_pca("a", "b") # singular values and weight vectors
).unnest("a")

df.select(
    pds.query_singular_values("a", "b", center = True, as_explained_var=True)
)

The Xi - Correlation is also implemented:

df.select(
    pds.xi_corr("x", "y")
)

What's Changed

Full Changelog: v0.4.1-release...v0.4.2

v0.4.1-release

11 Apr 13:17
d486f5e
Compare
Choose a tag to compare

Original release note

What's Changed

Full Changelog: v0.4.1-fix-release...v0.4.1-release

v0.4.1-fix-release

09 Apr 16:50
448a095
Compare
Choose a tag to compare

v0.4.1

09 Apr 14:30
e58aa0f
Compare
Choose a tag to compare

What's Changed

  • updated packages, OLS now falls back to LU decomp when X^TX is not strictly positive definite. by @abstractqqq in #122

v0.4.0 was yanked due to incomplete upload

v0.4.0 Breaking Changes

For almost all old functions that were invoked like

pl.col("a").name_space.method_call(, ...)

You can now call them via

import polars_ds as pds

pds.method_call(...)

Linters now should recognize the package and all its methods.

v0.4.0 What's Changed

New Contributors

Full Changelog: v0.3.5...v0.4.0
Full Changelog: v0.4.0...v0.4.1

v0.4.1-test-new-ci

09 Apr 21:34
Compare
Choose a tag to compare
v0.4.1-test-new-ci Pre-release
Pre-release
add glob pattern

v0.4.0

06 Apr 21:51
594d8cc
Compare
Choose a tag to compare

Breaking Changes

For almost all old functions that were invoked like

pl.col("a").name_space.method_call(, ...)

You can now call them via

import polars_ds as pds

pds.method_call(...)

Linters now should recognize the package and all its methods.

What's Changed

New Contributors

Full Changelog: v0.3.5...v0.4.0

v0.3.5

25 Mar 16:00
3de1684
Compare
Choose a tag to compare

Breaking Changes:

  1. Now there are two types of random column generation methods: 1. With a column reference. These methods are renamed from sample_xxx to rand_xxx. They behave the same as the old way and must be called with a reference column. New methods generate random columns without any reference, therefore new methods won't respect null and won't use reference's statistics. But new methods are easier to use. Using new methods to get a random df.
import polars as pl
import polars_ds as pds

df = pds.random_data(size=100_000, n_cols = 1).select(
    pds.random(0.0, 12.0).alias("uniform_1"),
    pds.random(0.0, 1.0).alias("uniform_2"),
    pds.random_exp(0.5).alias("exp"),
    pds.random_normal(0.0, 1.0).alias("normal"),
    pds.random_normal(0.0, 1000.0).alias("fat_normal"),
)
df.head()

What's Changed

New Contributors

Full Changelog: v0.3.4-fix-release...v0.3.5