SDFLoader doesn't handle zipped/gzipped files
See original GitHub issue~The dc.utils.save.load_sdf_files
operates entirely in memory. This means that loading a large .sdf file with SDFLoader
operates entirely in memory and isn’t scalable to very large SDF files.~ SDFLoader
also doesn’t have a good way to handle zipped/gzipped files
Issue Analytics
- State:
- Created 3 years ago
- Comments:5 (3 by maintainers)
Top Results From Across the Web
Reading csv zipped files in python - Stack Overflow
I used the zipfile module to import the ZIP directly to pandas dataframe. Let's say the file name is "intfile" and it's in...
Read more >Zip and unzip files - Microsoft Support
Zipped (compressed) files take up less storage space and can be transferred to other computers more quickly than uncompressed files. In Windows, you...
Read more >Data — deepchem 2.6.2.dev documentation
DeepChem dc.data provides APIs for handling your data. If your data is stored by the file like CSV and SDF, you can use...
Read more >To zip or not to zip · Issue #2107 · IQSS/dataverse - GitHub
This is the question - some datasets would benefit for keeping zip file (or a similar packaged or compressed file) for all the...
Read more >SFTP Site with .Zip files (with more than just data in the .zip file)
I understand that DSS is able to open .zip files and get at the content. However, I'm not clear if I can control...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@BusinessFawn Thanks for volunteering here 😃. I’m thinking just a gzipped/zipped
.sdf
file would be the desired input to support. One of the sample SDF test files used bySDFLoader
but just gzipped/zipped should be a good test caseHi, @rbharath. I have submitted a PR. Kindly have a look if you have time for it.