question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

I am facing the issues while running the Kolmogorov-Smirnov (K-S) tests for the continuous numerical features data drift detection on iris dataset

See original GitHub issue

I am facing the issues while running the Kolmogorov-Smirnov (K-S) tests for the continuous numerical features data drift detection on iris dataset . For reference I have attached the code and Iris dataset ,below shared the Github link with you –https://github.com/RitikaKulshresth/Iris_Drift_Detection

Getting Error- ValueError: The internally computed table of expected frequencies has a zero element at (0, 0).

`

`

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:5 (1 by maintainers)

github_iconTop GitHub Comments

1reaction
RitikaKulshresthcommented, Apr 16, 2021

Thanks for your quick help .You can close this issue

0reactions
arnaudvlcommented, Apr 15, 2021

@yanhong-zhao-ef 's comment is correct. categories_per_feature = {0: None, 1: None, 2: None, 3: None} indeed means that features 0, 1, 2 and 3 are categorical but you just want to infer the categories of those features from the reference data. If you just keep categories_per_feature=None (the default) that should fix your issues.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Understanding Kolmogorov-Smirnov (KS) Tests for Data Drift ...
The first step to address those issues is to be able to detect and monitor for data drift. There are multiple approaches to...
Read more >
Concept Drift Detection Based on Kolmogorov–Smirnov Test
This study aims to automate pipeline design for online learning while ... [Show full abstract] continuously adapting to data drift. For this ...
Read more >
Practical Data Drift - NannyML
We call this 'data drift', and for continuous variables the starting point for identifying data drift is the Kolmogorov–Smirnov test.
Read more >
kolmogorov-smirnov goodness-of-fit test - WorldWideScience
The Kolmogorov-Smirnov (K-S) test is a statistical method often used for comparing two distributions. In high-throughput screening (HTS) studies, ...
Read more >
Drift detection - Medium
The first two statistical tests KS test & PSI are for unidimensional data (one feature column at a time). Model based test allow...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found