[KED-2131] Incomplete documentation about the Spaceflights tutorial
See original GitHub issueDescription
The docs regarding the Spaceflights tutorial are incomplete, which makes it harder to successfully finish it.
This has been partially discussed in https://github.com/quantumblacklabs/kedro-examples/issues/58 (including the issue reproducibility). Therefore, I will discuss it in a complementary way.
Context
There is apparently an ongoing internal issue about improving the organization and sync between the repos kedro-examples, kedro-training, and kedro-starter-spaceflights (https://github.com/quantumblacklabs/kedro-training/pull/1).
As I understood, spaceflight full-repo is moving from kedro-examples/kedro-tutorial to kedro-training/kedro/exercises/spaceflight.
I’m not sure about what is being tracked internally, so I will list what I’ve found related to Spaceflights’ requirements.txt:
- Spaceflights tutorial requires
kedro[pandas.CSVDataSet,pandas.ExcelDataSet]in requirements. Still, it is not specified in the docs tutorial. This results in the missing requirementpandasandxlrdwhen trying to load the datasets in Set up the data. kedro-examples/kedro-tutorialis also missing this requirement in itssrc/requirements.txt, which yielded https://github.com/quantumblacklabs/kedro-examples/issues/58.- Both
kedro-trainingandkedro-starter-spaceflightsalready have this requirement in theirsrc/requirements.txt.
Actionable
kedro-examples/kedro-tutorial’srequirements.txtshould be updated to containkedro[pandas.CSVDataSet,pandas.ExcelDataSet].- Docs should be improved to show that
kedro[pandas.CSVDataSet,pandas.ExcelDataSet]is required.Set up the spaceflights project#Install project dependenciesis probably the right place. - This also applies to kedro-training docs at
Create a new project#kedro install. - Latest and stable Kedro docs points to
kedro-examplesas the full source to the spaceflights project. I’m not sure, but I guess that this will/should be eventually changed tokedro-training/kedro-exercises/spaceflight.
I could work on 1., 2., and 3. if it makes sense (note that they are at 3 different repos).
PS: Sorry for the cross-repos references everywhere. I considered that here was the best place to report it.
Issue Analytics
- State:
- Created 3 years ago
- Reactions:1
- Comments:6 (5 by maintainers)

Top Related StackOverflow Question
Thanks for addressing it, @921kiyo.
If there is still time, I’d like to add three more notes regarding the
docsandkedro-training:scikit-learndependency tosrc/requirements.txtand then runskedro install. Shouldn’t it besrc/requirements.in+kedro build-deps && kedro installinstead?kedro-trainingalso usesscikit-learn, and specifies it in the full projectrequirements.txt, but there is no instruction stating thatsklearnshould be installed when followingkedro-training docs.kedro_tutorial.io.xls_local.ExcelLocalDataSet(well, it is actually defined if you follow kedro-training):Closing this as resolved through linked PRs/issues, as well as https://github.com/quantumblacklabs/kedro/commit/589d6a7a329f453ac91662814ec044dc41d1063c and https://github.com/quantumblacklabs/kedro/commit/0fd6b623bdaa2f69623ad4fdacb022a5862fb590 . Please feel free to open a new issue if there are other observations!