question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Odd iSCSI read performance, at or around 1MB block size (TrueNAS Core)

See original GitHub issue

After figuring out the issue on #238 it seems there is an odd performance issue on iSCSI LUNs presented to OpenShift; at least on a Windows 10 VM running in OpenShift Virtualization.

The environment for the performance test is: Dell R730xd OpenShift worker node (Bare Metal), 2x 40Gbps ConnectX-3 Pro NICs in a bond (LACP)
Dell R730xd TrueNAS-13.0-U2 (Bare Metal) with 24x SAMSUNG MZILS1T9HEJH0D3 SAS SSDs in RAID10 (With no tunables set)
Arista 7050QX switch iSCSI presented to OpenShift with democratic-csi helm 0.13.5

I have tried a number of combinations of storage class configuration with inconclusive results, but always read performance degradation somewhere above 512KB block size, with “Disk Active Time” pegged at 100% and very high disk average response time (Sometimes > 6000ms), CPUs are relatively idle during this time with 4vCPUs provisioned… I have not tried MPIO with multiple targets as I know it is best practice with iSCSI links but the LACP policy is set to layer 2 hash, and is primarily there for redundancy and ease of use as TrueNAS hosts NFS as well. Additionally the MTU on the network is 1500, not 9K.

csi-perf-issue-ext4-lz4-16k-false-512

There IS THIS that looks suspicious and is already merged into OpenZFS 2.1.6:
https://github.com/openzfs/zfs/discussions/13448?sort=new
https://github.com/openzfs/zfs/pull/13452

But current TrueNAS Core (13.0-U2) currently runs (And rightfully so, as 2.1.6 just dropped 15 days ago):

truenas# zpool -V
zfs-2.1.5-1
zfs-kmod-v2022081800-zfs_27f9f911a

Adjusting zfetch_array_rd_sz seems to have no effect as PR author points out.

Then again, I have no idea is the issue is in TrueNAS, or something OpenShift does while leveraging the CSI to present storage (ext4/xfs/block) and potentially a filesystem cache that might exist in some intermediary layer… though issue persists with volumeMode: Block. VirtIO drivers are loaded in guest VM:

image

Issue Analytics

  • State:open
  • Created a year ago
  • Comments:44 (12 by maintainers)

github_iconTop GitHub Comments

3reactions
travisghansencommented, Oct 20, 2022

This on the iX radar. I think 13.0-U3 should be out relatively soon and maybe the included 2.1.6 will help.

1reaction
travisghansencommented, Dec 5, 2022

I’d like to chat about spdk with you if you’re willing…

Read more comments on GitHub >

github_iconTop Results From Across the Web

The path to success for block storage | TrueNAS Community
While there is no one-size-fits-all rule, doing anything more than trite VM storage seems to go poorly with less than 64GB RAM.
Read more >
80% max utilization - myth or reality? | TrueNAS Community
I've seen this discussed elsewhere and it's heavily hinted at in TrueNAS that exceeding 80% utilization of a pool is a Very Bad...
Read more >
iSCSI performance, slow read speed. | TrueNAS Community
Hi all. Help solve the problem with low sequential read speed. It is 2 times lower than the write speed. Installed TrueNAS Scale:...
Read more >
Zfs layout - best write performance? | TrueNAS Community
ZFS can guess that there's a good chance that, once you've read the first 1MB block, you might read the second 1MB block,...
Read more >
A hilly iSCSI write performance (extreme COW) - TrueNAS
TrueNAS by default recommends minimal reasonable ZVOL block size, which for your 10-wide RAIDZ should be about 64KB. It may be not good...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found