Dev Observability
Product
Pricing
Docs
Resources
Blog
Company
Debug Wordle

question-mark

Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

I am facing the issues while running the Kolmogorov-Smirnov (K-S) tests for the continuous numerical features data drift detection on iris dataset

See original GitHub issue

I am facing the issues while running the Kolmogorov-Smirnov (K-S) tests for the continuous numerical features data drift detection on iris dataset . For reference I have attached the code and Iris dataset ,below shared the Github link with you –https://github.com/RitikaKulshresth/Iris_Drift_Detection

Getting Error- ValueError: The internally computed table of expected frequencies has a zero element at (0, 0).

`

`

Issue Analytics

State:
Created 2 years ago
Comments:5 (1 by maintainers)

Top GitHub Comments

1reaction

RitikaKulshresthcommented, Apr 16, 2021

Thanks for your quick help .You can close this issue

0reactions

arnaudvlcommented, Apr 15, 2021

@yanhong-zhao-ef 's comment is correct. categories_per_feature = {0: None, 1: None, 2: None, 3: None} indeed means that features 0, 1, 2 and 3 are categorical but you just want to infer the categories of those features from the reference data. If you just keep categories_per_feature=None (the default) that should fix your issues.

Read more comments on GitHub >

Top Results From Across the Web

Understanding Kolmogorov-Smirnov (KS) Tests for Data Drift ...

The first step to address those issues is to be able to detect and monitor for data drift. There are multiple approaches to...

Concept Drift Detection Based on Kolmogorov–Smirnov Test

This study aims to automate pipeline design for online learning while ... [Show full abstract] continuously adapting to data drift. For this ...

Practical Data Drift - NannyML

We call this 'data drift', and for continuous variables the starting point for identifying data drift is the Kolmogorov–Smirnov test.

kolmogorov-smirnov goodness-of-fit test - WorldWideScience

The Kolmogorov-Smirnov (K-S) test is a statistical method often used for comparing two distributions. In high-throughput screening (HTS) studies, ...

Drift detection - Medium

The first two statistical tests KS test & PSI are for unidimensional data (one feature column at a time). Model based test allow...

Top Related Medium Post

No results found

Top Related StackOverflow Question

No results found

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Top Related Reddit Thread

No results found

Top Related Hackernoon Post

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

Top Related Hashnode Post

No results found

Example notebooks out of date

what to do if ref and test data have different categories in chisquare?