question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Metrics "Short-circuit Read" and "Cache Hit Rate" for Alluxio Local does not update.

See original GitHub issue

Alluxio Version: Alluxio-2.0

Describe the bug Metrics like short circuit read and Cache Hit Rate for Alluxio Local does not update or more precisely it always gives zero on running ./alluxio fsadmin report metrics.

So even if it’s caching or not I don’t know because it’s updating other metrics like Under Filesystem Read and Cache Hit Rate for Alluxio Remote. But my Spark-UI shows PROCESS_LEVEL and NODE_LEVEL of data locality.

Note :- flag alluxio.user.metrics.collection.enabled=true is already there in alluxio-site.conf

To Reproduce

val df = spark.read.parquet("alluxio:///path/to/parquet/file")
df.agg(sum("XYZ")).show

Expected behavior If my spark UI shows the data locality level at process-local and node-local then Alluxio metric must show some short circuit read overall.

Additional context To make sure of data-locality I’m running my setup with a single worker node and spark and alluxio-worker both running on that same node.

I also needed one clarification -

spark.read.parquet("alluxio:///path/to/parquet/file") I think by the time spark does this transformation, it gathers node information where it’s going to deploy executors according to data locality. Is it so?

If not then when and how spark get to know the partition location on worker nodes?

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Comments:8 (5 by maintainers)

github_iconTop GitHub Comments

2reactions
jinhangcommented, Mar 11, 2020

I also encountered this problem, did you solve it? @bf8086 Total IO: Short-circuit Read 0B Short-circuit Read (Domain Socket) 0B From Remote Instances 787.84GB Under Filesystem Read 617.89GB Alluxio Write 32.60GB Alluxio Write (Domain Socket) 0B Under Filesystem Write 0B Total IO Throughput (Last Minute): Short-circuit Read 0B Short-circuit Read (Domain Socket) 0B From Remote Instances 7.37KB Under Filesystem Read 0B Alluxio Write 2623B Alluxio Write (Domain Socket) 0B Under Filesystem Write 0B Cache Hit Rate (Percentage): Alluxio Local 0.00 Alluxio Remote 100 Miss 0.00

0reactions
yuzhucommented, Nov 3, 2021

fixed in 2.7.0

Read more comments on GitHub >

github_iconTop Results From Across the Web

Performance Tuning - Alluxio v2.9.0 (stable) Documentation
In Alluxio versions after 2.6, we included an additional metric Cluster.CacheHitRate , which indicates the cache hit ratio. Here the cache hit ratio...
Read more >
List of Metrics - Alluxio v2.9.0 (stable) Documentation
Name Type Description Master.AbsentCacheHits GAUGE Number of cache hits on the absent cache Master.AbsentCacheMisses GAUGE Number of cache misses on the absent cache Master.AbsentCacheSize GAUGE Size...
Read more >
Scalability Tuning - Alluxio v2.9.0 (stable) Documentation
The Alluxio worker's network bandwidth to UFS determines the rate at which it can read data to serve or populate the cache from...
Read more >
List of Configuration Properties - Alluxio v2.9.0 (stable ...
Property Name Default Description alluxio.conf.dynamic.update.enabled false Whether to support dynamic update pro... alluxio.fuse.fs.name alluxio‑fuse The FUSE file system name. alluxio.fuse.special.command.enabled false
Read more >
Architecture - Alluxio v2.9.0 (stable) Documentation
Local Cache Hit ; Remote Cache Hit; Cache Miss; Cache Skip ... Data is only read once from the under storage system when...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found