Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Packageify fastMRI

See original GitHub issue

This post is intended to both introduce the issue and be modified to keep track of progress.

The fastMRI repository was originally designed for wholly self-contained experiments. This was good for readability as it kept all experimental parameters, models, and training scripts in a single file. At the time, the only deep learning model in the repository was a U-Net, and so having modules or reusable components wasn’t as much of a priority.

Since then we’ve introduced a number of new components to the repository, including the end-to-end variational network and new mask sampling. We’ve also added PyTorch Lightning. As a result, modular components could be more useful now, but we’ve generally kept the old structure. Moving towards a more modular structure would help compartmentalize things a bit more, helping users mix modules and samplers into their projects. We could also clearly separate experimental parameters that we’ve run for leaderboard submissions with exact hyperparameters and keep these experiments frozen in time.

Toward this I’d like to propose to “packageify” fastMRI. This will largely be a code refactor with a few new areas. Essentially, I’m thinking of the following structure:

.
├── fastmri
│   ├── data
│   │   ├── dataset.py
│   │   ├── subsample.py
│   │   ├── transforms.py
│   │   └── volume_sampler.py
│   ├── models
│   │   ├── unet.py
│   │   └── varnet.py
│   ├── evaluate.py
│   ├── recon.py
│   └── mri_module.py
├── experimental
│   ├── cs
│   │   └── run_bart.py
│   ├── unet
│   │   ├── train_unet_demo.py
│   │   ├── unet_brain_challenge_submission_YYYY-MM-DD.py
│   │   └── unet_module.py
│   ├── varnet
│   │   ├── train_varnet_demo.py
│   │   ├── varnet_brain_challenge_submission_YYYY-MM-DD.py
│   │   └── varnet_module.py
│   ├── zero_filled
│   │   └── run_zero_filled.py

The files in experimental would import training_module, update it with their own model and training parameters, and these would be committed to the GitHub to make reproducibility more obvious.

Task list for completing this issue:

Refactor basic fastMRI models (unet, varnet, zero_fill, cs) into the framework outlined in this issue.
Run new experiments and test leaderboard submissions.
Update all PyTorch Lightning training modules to the current version. PTL often breaks backwards compatibility, but hopefully this will be less of a problem going forward.
Formatting: apply black and Google comment style formatting consistently throughout package.
Update requirements.txt to more up-to-date packages and use >= rather than ==.
(Optional) refactor all tests to a tests directory at the top level.
(Optional) Distribute on PyPI.

Making these changes should not require modifications to existing working code. Once they’re done we can decide about deprecating and removing old folders.

Issue Analytics

State:
Created 3 years ago
Comments:6 (4 by maintainers)

Top GitHub Comments

1reaction

adefaziocommented, Jul 16, 2020

We can move the banding removal code under experimental perhaps? I am hesitant to try to combine it with the rest of the code-base as even a minor change may break reproducibility.

0reactions

mmuckleycommented, Aug 24, 2020

At this point the core components of the refactor have been finished from the former code and new models have been updated to the leaderboard. The codes for generating the leaderboard models are now in experimental/varnet/varnet_brain_leaderboard_submission_2020-08-21.py and experimental/unet/unet_brain_leaderboard_submission_2020-08-18.py.

It’s still undecided on PyPI distribution - might need to discuss around about this. It’s also undecided whether we might want to move the PyTorch Lightning training modules in to the main package folder for distribution or leave them in experimental.

Top Results From Across the Web

FastMRI

fastMRI is a collaborative research project between Facebook AI Research (FAIR) and NYU Langone Health. The aim is to investigate the use of...

fastMRI Dataset

The deidentified imaging dataset provided by NYU Langone comprises raw k-space data in several sub-dataset groups. Curation of these data are part of...

fastMRI from facebookresearch - Giter Club

fastMRI is a collaborative research project from Facebook AI Research (FAIR) and NYU Langone Health to investigate the use of AI to make...

fastmri - PyPI

fastMRI is a collaborative research project from Facebook AI Research (FAIR) and NYU Langone Health to investigate the use of AI to make...

fastMRI Dataset - Papers With Code

The fastMRI dataset includes two types of MRI scans: knee MRIs and the brain (neuro) MRIs, and containing training, validation, and masked test...