Lack of document about consistency.
See original GitHub issuePage https://www.alluxio.io/blog/4-different-ways-to-write-to-alluxio/
Summary
A main different between HDFS and S3 is consistency, this blog mentioned it, but details are lack. I hope in the summary table, there could be a column Consistency
just like Write Speed
and Fault Tolerance
.
What I especially want to know is, what consistency Alluxio gives when under storage is S3, and Write Type
is one of MUST_CACHE
,THROUGH
,CACHE_THROUGH
,ASYNC_THROUGH
, and what will happen when a worker crashes.
Issue Analytics
- State:
- Created 3 years ago
- Comments:8 (5 by maintainers)
Top Results From Across the Web
The 3 C's of Process Documentation (Consistency ... - LinkedIn
There's one thing worse than no documentation. It's inconsistent, inaccurate or incomplete documentation.
Read more >Is There Consistency in All Your Company Documents?
Without a thorough documentation audit, you could suffer productivity challenges and loss of reputation. A documentation audit is an in-depth review of ...
Read more >Document Consistency
It is vitally important that the different documents stay consistent with each other. When the project documents are out of sync, confusion, errors,...
Read more >Tools and Methods to Achieve Consistency in Technical ...
At the end of this study, we come to the conclusion that the lack of consistency in documentation can be the result of...
Read more >Concept Drift and Consistency: Two Keys To Document ...
Concept Drift and Consistency: Two Keys To Document Review ... This is the kind of quality loss where key documents could be overlooked....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
No, the data will only be written to the UFS once
Sure thing I can clarify that part.
If you are writing with
ASYNC_THROUGH
and the worker crashes before you persist the data, then you will incur a data loss. However, we do provide a feature we call “durable writes” which will split the write among 1 or more workers simultaneously during the write. This prevents data loss in the case of one or more machines. You can enable durable writes by setting the propertyalluxio.user.file.replication.durable
. By default it is one, but if you want to ensure the availability of data in the case of a node loss, you may increase this to 2, 3, or more to produce replicas synchronously during the write.