Create a JupyterHub "exchange" service to replace the exchange directory
See original GitHub issueI would eventually like to replace the nbgrader exchange directory with a more robust solution, namely, a JupyterHub “exchange” service that manages released assignments, submissions, and feedback. This has the drawback that people won’t be able to use nbgrader’s file management capabilities unless they are using JupyterHub. In practice, I don’t think anyone is using nbgrader’s file management capabilities without JupyterHub anyway, though (but if I am wrong about this, someone should correct me!). Here’s how I will imagine this working.
@lgpage @minrk @ellisonbg @willingc @dsblank I would appreciate any feedback you have on this proposal!
Permissions
All authentication will be handled by JupyterHub, which will tell the service who the current user is and what group(s) they are a part of. The exchange service will handle any number of courses on the same machine, and for each course, will require that there are two groups specified: one for instructors (who are allowed to release, fetch, submit, and collect assignments and return feedback) and one for students (who are allowed to fetch and submit assignments and download feedback). This would be configured something like this:
c.ExchangeApp.groups = {
'course1': dict(instructors='instructors_course1', students='students_course1'),
'course2': dict(instructors='instructors_course2', students='students_course2'),
...
}
API
The exchange service will define a REST api that the nbgrader commands (release, fetch, submit, collect, etc.) can access.
/api/assignments
GET /api/assignments/<course_id>
– list all assignments for a course (students+instructors)
/api/assignment
GET /api/assignment/<course_id>/<assignment_id>
– download a copy of an assignment (students+instructors)POST /api/assignment/<course_id>/<assignment_id>
– release an assignment (instructors only)
/api/submissions
GET /api/submissions/<course_id>/<assignment_id>
– list all submissions for an assignment from all students (instructors only)GET /api/submissions/<course_id>/<assignment_id>/<student_id>
– list all submissions for an assignment from a particular student (instructors+students, though students are restricted to only viewing their own submissions)
/api/submission
POST /api/submission/<course_id>/<assignment_id>/<student_id>
– submit a copy of an assignment (students+instructors)GET /api/submission/<course_id>/<assignment_id>/<student_id>
– download a student’s submitted assignment (instructors only)
/api/feedback
POST /api/feedback/<course_id>/<assignment_id>/<student_id>
– upload feedback on a student’s assignment (instructors only)GET /api/feedback/<course_id>/<assignment_id>/<student_id>
– download feedback on a student’s assignment (instructors+students, though students are restricted to only viewing their own feedback)
Exchange implementation
Under the hood, the exchange service will continue to store files directly on the filesystem, but they will all have the same permissions (read and write only for the user running the exchange service). I think this is a better option that doing it with a database because we don’t really need any fancy relational features here and this also makes it easier for instructors to inspect files in the exchange manually. If someone feels strongly that a database should be used then I might be able to be convinced otherwise, though.
Regardless, I do want to implement some form of checksumming, though, because I have noticed at least in the current implementation that sometimes if the system is under heavy load that the submissions are occasionally incomplete or corrupted (e.g. missing timestamp.txt or something).
Existing nbgrader apps
The existing nbgrader apps will be reworked to make requests to the exchange API rather than copying to and from the exchange directory.
One thing I am not quite sure of is how the command line apps get properly authenticated, because the authentication is normally happening in the browser, not the command line. I see two possible solutions:
- One solution to this is to say that these commands can only be used through the server extension, and then have that extension pass the authentication information to the command line apps. This is probably the easiest but then it means you can’t just run the commands from the command line anymore.
- The other solution is to require some how that users re-authenticate from the command line. I am not really sure how to this in a general way that handles all the forms of authentication that JupyterHub uses. Maybe @minrk can weigh in on the feasibility of this, but from what I know about how this works it doesn’t seem like a particularly feasible option to me?
Issue Analytics
- State:
- Created 7 years ago
- Reactions:3
- Comments:26 (25 by maintainers)
Top GitHub Comments
A page to get a token would be helpful!
On Mon, Feb 6, 2017 at 2:33 AM, Min RK notifications@github.com wrote:
– Brian E. Granger Associate Professor of Physics and Data Science Cal Poly State University, San Luis Obispo @ellisonbg on Twitter and GitHub bgranger@calpoly.edu and ellisonbg@gmail.com
I believe we can close this.
In response to:
We have the following solution:
Each course gets it’s own database in a central database server, and a directory on a central FileStore server.
When an instructor starts their notebook server in our system, the database URL is calculated & set for that course, and the directory in the central FileStore is mounted. We also set
c.CourseDirectory.root
to a path specific for that course.Thus all instructors have access to the same database, and the same course files:
source
,release
,submitted
,autograded
,feedback
.… Oh, and we found it useful to set
c.CourseDirectory.directory_structure = '{nbgrader_step}/{assignment_id}/{student_id}'
- but that’s just us…[How 10 markers manage the 200 submissions is not in the solution: they all see the same dataset]