FracdiffStat enormously slow
See original GitHub issueHi! I was trying to differentiate an Ethereum price dataset (attached below) but it is taking more than 1h and also uses more than 10 GB of memory. I don’t mind the high memory usage, but it’s taking too much time
from fracdiff import FracdiffStat
import pandas as pd
df = pd.read_csv('ETHUSDT.csv.gz')
FracdiffStat().fit_transform(df.values)
Issue Analytics
- State:
- Created 2 years ago
- Comments:9 (9 by maintainers)
Top Results From Across the Web
FracdiffStat - fracdiff documentation - GitHub Pages
A scikit-learn transformer to compute fractional differentiation, where the order is chosen as the minumum order that makes fracdiff stationary. Parameters:.
Read more >Fracdiff: Super-fast Fractional Differentiation - GitHub
FracdiffStat : Automatically fracdiff which makes a set of time-series stationary while preserving their maximum memory. Compatible with scikit-learn API.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Thank you! There is in fact a gpu implementation of KPSS https://github.com/rietmann-nv/cuml/pull/1/files
Thanks for letting me know!
I think the ADF test (
adfuller
in statsmodels) is the bottleneck. As you point out, it performs linear regression which needs matrix multiplication and inversion.Fractional differentiation is the linear order of n_samples (since it is essentially convolution) and order is binary-searched. These operations are optimized and unlikely to be the bottleneck.