caseSensitiveFS=false (the default) scan performance penalty scales up, and it can be extreme

a24 · April 18, 2024, 2:05pm

For any scenario where local reads (as in listing directories and similar, not necessarily raw storage bandwidth) take a little longer (real life examples: MacOS (example from the forum, although I’m not familiar with the mechanism that slows it down in particular), GlusterFS (that’s clear), modern Android (clear too)) the scanning can get 100x, possibly 1000x slower or more. From tens of seconds or minutes to days or weeks. I presume if you have a regular block device with full access, cached by the OS everything goes quickly and no drop is noticeable. The extra ReadDir() mentioned in one of the threads below that it’s done for each file is most likely growing linearly with the number of files, so we increase our scanning time with the square of the number of files, not linearly, which can be very bad.

I think there should be some mechanism that makes it clear what’s going on, not some cryptic advanced configuration with a VERY misleading description in the documentation: “will provide a small improvement in performance”.

Examples:

any thread from various places about “stuck at scanning” Android WhatsApp backup/media directory (there is nothing wrong with syncing that, not an issue with the files being updated all the time or kept busy, it’s just that).

calmh · April 18, 2024, 2:36pm

Yeah. I’ve proposed using a much longer-lived cache at some point, someone needs to investigate.

bt90 · April 18, 2024, 3:17pm

The last time I had a look, the cache timestamp was created before obtaining the files from the scanner.

So for a slow directory listing you’d always end up with instant outdated entries.

a24 · April 19, 2024, 5:37am

While of course finding a way to speed this up would be preferable (if it can be done safely and efficiently at the same time at all, with no way of just getting a consistent atomic snapshot of a directory from the OS) I’d say to not let perfect be the enemy of good.

Just general awareness would go a long way here. Ideally making it into the documentation instead of the laughable “will provide a small improvement in performance” and maybe even some runtime warning if the scan takes more than some time (5 minutes? 1h? 12h?).

The last use case I mentioned with WhatsApp directory is present in some places (one example below) but I’ve found NOWHERE the simple fix to just flip caseSensitiveFS. People go on all the wrong tangents while investigating this, including recommending other tools. While there’s nothing wrong with it, it would work quickly, reliably and efficiently if only this extra (completely unnecessary in this case, as in most) safety check is disabled. This would also benefit people thinking it works fine for them if the scanning takes an hour (but don’t know it could finish in 5s), and who don’t realize “works for me it doesn’t work for you” hits a user for which it would finish in 1 minute but it takes a week (and it actually never finishes if it gets restarted from time to time on the phones).

bt90 · April 19, 2024, 7:07am

Might also help. Speeding up normalization has the nice side effect of keeping the cache entries alive for a bit longer.

If I find the time to further tinker with it in the foreseeable future

calmh · April 19, 2024, 8:24am

As always, there’s the question of what’s representative. On my Mac laptop, which is one of those environments where this is deemed problematic, my Lightroom database folder is ~7000 files, ~8000 directories. That’s not enormous in any way at all, but it’s the same magnitude as the median user according to usage stats.

Scanning that folder with default settings takes 5.6 seconds. Scanning it in case-sensitive mode takes 3.7 seconds. For sure it’s a significant difference percentage wise, but it’s still insignificant in either case.

When writing docs, and code, I can usually only base it on what I see.

a24 · April 19, 2024, 8:52am

The problem is how it scales (it seems quadratically with the number of files or even worse if most of the files are in one large directory instead of a spread structure).
Is there enough empirical information provided and/or known from the inner-workings of the software to be able to consider now this thing “seen” or are there some more specific tests needed?

calmh · April 19, 2024, 8:56am

We have no data on how this performs in the wild, really, although we could add some timing statistics to the usage reporting data. I’m pretty sure there’s nothing quadratic about what it does, but I do think it scales with the width times depth of a given directory structure, though the cache is intended to handle the depth aspect (all the parent lookups). It’s quite possible the cache is ineffective when the lookups themselves become slow enough; I’ve suggested increasing the cache time.

Ideally, someone who actually has a problem with this could figure out what the problem is and see if tweaking the existing values has an effect. That’s harder to do when it’s working as intended on all the systems I see.

calmh · April 19, 2024, 9:05am

What we do have, though, is the metrics endpoint which among other things show the number of FS operations we do and the time spent doing them. If the problem is long listdirs, for example, that’d show up there. Restarting Syncthing, letting it run one of those problematic scans (or as much of one as you have patience for) and pasting a copy of the metrics output would show what Syncthing sees when talking to the filesystem, to begin with.

% curl -H "Authorization: Bearer $apikey" http://localhost:8384/metrics

calmh · April 19, 2024, 9:21am

As for where to weak cache settings, that would be these:

github.com

syncthing/syncthing/blob/01b820dc78e1d8240485fa79fdbd12160ffb4fa6/lib/fs/casefs.go#L21-L25


      
          const (
          	// How long to consider cached dirnames valid
          	caseCacheTimeout   = time.Second
          	caseCacheItemLimit = 4 << 10
          )

both of which may be way too low for large setups (or slow setups, like some glusterfs thing etc).

bt90 · April 19, 2024, 11:20am

@calmh we could parallelize this section as the expensive part is the unicode normalization which is 99% CPU/alloc bound.

If we want parallelization deep down here on the other hand

calmh · April 19, 2024, 12:01pm

So you say, but my feeling is the CPU cost is negligible compared to the system call cost/latency. Listing a large directory can take seconds, normalising a string will be microseconds even on a raspberry…

bt90 · April 19, 2024, 2:34pm

I agree, the syscall is still orders of magnitude slower than everything we do afterwards, but it’s something we can’t optimize.

bt90 · April 19, 2024, 2:40pm

While readdir is quite costly, we’re using a lot of CPU time for the unicode stuff.

a24 · April 20, 2024, 7:40am

We can in the sense of just not doing so many syscalls, not that I prefer to prioritize speed over safety of the data but at least to have it spelled out - a factor of 35x speedup as in the last example is nothing so sneeze at, versus some over cautious protection that might not even be applicable for that setup.

I am testing how this scales with the number of the files when listdir (or whatever else similar) is slow and the only platform from the mentioned ones which I have easily available is Android. But I think it’s for the best, as it’s popular (if not for syncthing but still billions of devices deployed), and syncthing is one of the VERY rare programs that actually syncs both ways (as mostly anything from Nextcloud to Google Drive are just a pimped up web view of the server, not actual sync with the local files). In any case it takes a while to clock the long runs, by definition - I’ll come back with the results.

lightrush · April 20, 2024, 6:04pm

Turning on caseSensitiveFS cut scan time from 45-60 minutes to under 10 on my system.

This should be called out in the documentation prominently and perhaps the switch should go into the main config interface for folders.

a24 · April 21, 2024, 4:44am

And here are the results. Each line shows the times in seconds for 3 consecutive rest/db/scan on the same files with the number of files shown at the end (4 files, 8 files, and so on). The 2nd and 3rd scan should do the same (just walk the unchanged directory) and they do give the same results repeatedly (usually better than 1%). While I took extreme care to have such repeatable results the absolute numbers won’t be relevant for anyone else for sure. However, my initial hunch that the default scenario is quadratic while the caseSensiveFS=true is I hope 100% confirmed. I’m not sure why that was disputed before, if we do a listdir per file and the listdir (time) grows linearly with the number of files, and the number of files grows linearly with the number of files (doh) the result is quadratic.

Once we get “out of the noise” doubling the number of files gets 4x larger scanning times with the default and 2x with the tweaked “caseSensiveFS=true”

I strongly hope this won’t be read as “who’s going to do 40 times the median number of files and then complain it’s slow”, the point is that taking 40x longer can be PERFECTLY acceptable (and achievable with a click, if only people know about it), but taking 1600x longer isn’t fine (and 40x speedup, the FACTOR scaling with the number of files, certainly isn’t “a small improvement in performance” as documented).

caseSensiveFS=false (default)

0.37 0.37 0.36 4  
0.41 0.43 0.39 8  
0.48 0.46 0.45 16  
0.59 0.53 0.50 32  
0.86 0.66 0.69 64  
1.54 1.11 1.02 128  
2.81 1.72 1.71 256  
5.40 3.27 3.00 512  
11.12 6.44 6.29 1024  
25.39 13.61 13.65 2048  
65.69 33.00 33.61 4096  
172.38 89.56 90.38 8192  
574.08 286.18 291.38 16384  
2160.36 1038.34 1023.59 32768  
8421.55 3974.39 3974.03 65536  
37307.72 17700.22 17727.72 131072

caseSensiveFS=true

1.16 1.17 1.16 4
1.26 1.20 1.25 8
1.24 1.27 1.22 16
1.41 1.40 1.28 32
1.74 1.54 1.51 64
2.16 1.79 1.80 128
3.12 2.40 2.32 256
5.04 3.58 3.47 512
8.90 5.86 5.93 1024
16.72 10.63 10.69 2048
32.19 19.87 19.96 4096
64.01 38.51 38.61 8192
130.92 75.80 75.34 16384
291.39 155.83 152.42 32768
595.60 312.69 316.36 65536
1247.23 660.10 651.00 131072
2594.75 1343.65 1353.86 262144 (this wasn't run for the slower scenario, as I expect to take over 40h only for the first run, while we're here under 45 minutes for it)

bt90 · April 21, 2024, 6:52am

@a24 could you test this PR with caseSensitiveFS=false ? Note that it will only improve performance for ASCII-only paths. This might or might not be the case depending on your test data.

a24 · April 22, 2024, 5:41am

I’ll be trying that, although it’s more complicated than it seems, the numbers above were for Android and last time I tried to build something for Android it took some work, so I’m doing regular Linux tests now for this patch but can’t get reproducible results yet even with vanilla build. Here it just bogs down all over the place, and there can be 20x factor time jump when just doubling the number of file.
In contrast nothing was loaded with the Android runs, and everything was like clockwork, probably delayed precisely with whatever timing SAF (or what we’re using to access the files/listdir) allows.

I’m not sure if I can’t edit my previous post or I just can’t find the button, but I said well not to take the numbers in absolute, there is a small error in the script and the correct number of files is the one before the line, as in all FILE NUMBERS should be halved. For example when it says 2048 it should be read 1024 (read the number from above, or just divide what you see by 2).

bt90 · April 22, 2024, 6:13am

Dropping filesystem caches inbetween test runs might help to get more stable results:

echo 3 | sudo tee /proc/sys/vm/drop_caches