question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Serializable Fonduer model

See original GitHub issue

Is your feature request related to a problem? Please describe.

I develop a Fonduer-based app locally on my laptop. Once it’s done, I’d like to package the whole Fonduer pipeline (parsing, extraction, featurization, and classification) and deploy it to a remote place to serve. However, a Fonduer-based app is not easy to package hence not easy to deploy.

Describe the solution you’d like

Add a Fonduer model class that is

  1. Serializable (e.g., a class with save and load member methods like below)
class FonduerModel:
    def save(path_to_save):
    def load(path_to_load):
  1. Capable of executing any phase of the Fonduer pipeline
  2. (Hopefully) Manageable by MLflow

Describe alternatives you’ve considered

I can create one or more of python scripts that do all the phase, package them, and deploy it. This is cumbersome because the python script has to include many things (matchers, mention_classes, mention_spaces, candidate_classes, etc.) and it is not obvious what should be included for serving.

Additional context

I’d like to make Fonduer more deployable and servable. I’ve been testing MLflow to package a Fonduer-based app and found it was difficult to do so when there is no serializable Fonduer model.

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Comments:8 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
HiromuHotacommented, Apr 24, 2020

I think fonduer-mlflow became in good shape and ready for to be submitted as a PR against fonduer. Let me create a PR and submit it.

1reaction
HiromuHotacommented, Aug 28, 2019

@SenWu @trungtv I’ve created a new repository (https://github.com/HiromuHota/fonduer-mlflow) for this custom MLflow model for Fonduer. @SenWu I’d like this custom MLflow model for Fonduer (fonduer_model.py) to be merged to the Fonduer repository in the future. So please take a look at the repository and get familiar with it. Let me know if you have any question, suggestion, etc.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Machine Learning Model Serialization - Christopher Flynn, PhD
The most common method is to serialize the model using some particular format after training, and deserialize that model in the production ...
Read more >
Make a Python Class JSON Serializable - PYnative
In this article, I will show you how to serialize arbitrary Python objects to JSON so that you can convert any custom Python...
Read more >
A Complete Serialization Example | Domain Modeling Made ...
To demonstrate the practice of serializing and deserializing a domain object to and from JSON, let's build a small example.
Read more >
Real transactions are serializable - Cockroach Labs
Ben is the co-founder and Chief Architect of Cockroach Labs where he worked on the distributed consensus protocols that underpin CockroachDB's ...
Read more >
How to use Serializers in the Django Python web framework
Model serializers. When serializing data, you often need to do it from a database, therefore, from your models. A ModelSerializer, like a ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found