Different data owners
See original GitHub issueHow can we use this library for “actual” different data owners?
For example, in the “simple-average” example: https://github.com/tf-encrypted/tf-encrypted/blob/master/examples/simple-average/run.py
It works fine because each owner’s data is randomly generated in runtime:
def provide_input() -> tf.Tensor:
return tf.random_normal(shape=(10,))
Hence each machine will generate its own data during graph execution.
But what if the data should come from external sources, like a .csv file? For example, ‘inputter-0’ has “data0.csv” on its machine 0, ‘inputter-1’ has “data1.csv” on its machine 1, etc… NOTE that these files are NOT on the machine that executes the script “run.py”.
Issue Analytics
- State:
- Created 4 years ago
- Comments:8 (5 by maintainers)
Top Results From Across the Web
The Datamasters: Data Owners vs. Data Stewards vs ... - Blog
A Data Owner is accountable for Data Governance outcomes, whereas a Data Steward is responsible for the Data Governance tasks required to ...
Read more >Assigning Data Ownership - The Data Governance Institute
Assigning Data Ownership · – A person (staff member, contractor, partner, supplier, etc.) · – A business resource (the Data Owner, the person's...
Read more >Assigning Data Owners for GDPR Compliance - Talend
Data owners are either individuals or teams who make decisions such as who has the right to access and edit data and how...
Read more >Data Owners vs. Data Stewards vs. Data Custodians
A data owner is a person who is generally in a senior company position, responsible for the categorization, protection, usage, and quality of ......
Read more >Data Ownership - The Office of Research Integrity
Creator – The party that creates or generate data · Consumer – The party that uses the data owns the data · Compiler...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Hi @mortendahl , thanks for the response!
I tried some simple code with tf.data, indeed, it solves my problem 👍
From what I can tell, TensorFlow IO aims to produce a
tf.data.Dataset
object from a variety of file system backends. Since we don’t make any assumptions about how this object is created, just that it exists, it should work without any adjustments to TFE (in theory!). Would love to see an example/POC around this.