feat: add symlinks capabilities for S3-compatible systems, and fix multipart copy source-size handling by alexsavio · Pull Request #1 · dectris-cloud/geesefs

alexsavio · 2026-01-22T12:11:50Z

https://dectris.atlassian.net/wiki/spaces/DCSDN/pages/1423114306/ADR0015+Symlinks+support+for+GeeseFS+against+AWS+S3

…n save failure updateSymlinksFile() was mutating the shared symlinksCache in place before confirming the S3 save succeeded. If SaveSymlinksFileWithRetry failed, the in-memory cache would contain changes never persisted, causing phantom symlinks or missed deletions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

loadSymlinksCache() and updateSymlinksFile() were holding parent.mu during S3 network calls, blocking all concurrent FUSE operations on the same directory. Follow the existing GeeseFS convention (loadListing, listObjectsFlat) of temporarily releasing the lock during I/O. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

loadSymlinksCache() was issuing a conditional GET on every call with no TTL guard. An ls -la on 100 files could trigger 100+ S3 requests for the same symlinks file. Now skips the network call if the cache was loaded within StatCacheTTL. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The --symlink-attr flag was accidentally removed when adding the new --enable-symlinks-file flags. This left SymlinkAttr as "" for all CLI-launched mounts, breaking symlink detection via the old metadata path. Restore the flag and wire it back into PopulateFlags. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

isNotExist, isNotModified, and isPreconditionFailed were using strings.Contains on error messages (e.g., "404", "412"), which could false-match unrelated errors. Now check awserr.RequestFailure status codes first, with tight fallbacks for non-AWS backends. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The previous commit changed error detection to use awserr.RequestFailure status codes, but existing tests only exercised the fallback paths via the mock backend. Add tests covering the real S3 paths (awserr typed errors) and verify no false positives on unrelated error messages. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Renaming a virtual symlink now properly removes the entry from the source directory's .geesefs_symlinks and adds it to the destination's, ensuring cross-mount visibility. Adds docker integration tests for both cases. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… symlinks The previous check (userMetadata[SymlinkAttr] != nil) could not distinguish virtual symlinks (stored in .geesefs_symlinks, no S3 object) from S3-backed symlinks (created via the old metadata path). This could cause orphaned S3 objects on unlink or incorrect behavior on rename. Adds isVirtualSymlink bool to Inode, set at all virtual symlink creation sites and checked in Unlink, Rename, and listing cleanup paths.

Extract newVirtualSymlinkInode() and applyVirtualSymlinkAttrs() helpers to replace 9 duplicate virtual symlink creation/update blocks.

Prevents a race where another mount adds a symlink between our empty check and the delete. The If-Match conditional write fails with 412 if the file was modified, letting the retry logic merge correctly.

Avoids map lookups, mutex lock, and string formatting on every file lookup when debug logging is disabled.

Neither the struct nor the function were referenced anywhere.

Randomizes sleep duration in [backoff/2, backoff) to avoid thundering herd when multiple mounts contend on the same symlinks file.

Replace json.MarshalIndent with json.Marshal to reduce S3 storage and bandwidth for .geesefs_symlinks files.

…ability Use an explicit capability flag instead of hardcoded backend name matching to determine conditional write support.

Add tests for corrupted symlinks file loading, recovery after corruption, missing symlinks field, and forward version compatibility.

When a file grows beyond its original S3 size, copyUnmodifiedParts was clipping part ranges to Attributes.Size (the new local size) instead of knownSize (the source S3 object size). This caused S3 to return 400 errors for copy ranges exceeding the source object bounds.

Each symlink create/delete previously triggered an immediate S3 PUT to .geesefs_symlinks. Creating 10 symlinks = 10 PUTs. This batches them into a single PUT per directory using a configurable delay timer (--symlinks-batch-delay, default 100ms). The in-memory cache is updated eagerly so local readers see changes immediately, but the S3 PUT is deferred. Uses per-directory time.AfterFunc timers matching the existing ScheduleRetryFlush pattern. Flush is triggered by: timer expiry, sync/fsync, or unmount. On conflict, the merge function replays all batched changes. On failure, changes are re-queued for retry.

completeMultipart() was reading Attributes.Size after its goroutine re-acquired inode.mu, but concurrent writes could extend the file in the gap between goroutine launch and lock acquisition. This caused finalSize to include parts not yet uploaded, leading to knownSize being set larger than the actual S3 object after CompleteMultipartUpload (which skips nil parts). The wrong knownSize then triggered spurious conflict detection and EINVAL errors on subsequent flushes. Capture finalSize in the caller while the lock is held and canComplete has verified all dirty parts are flushed.

knownSize grows as parts are flushed via updateFromFlush(), so by the time copyUnmodifiedParts runs it no longer reflects the actual S3 source object size. Capture knownSize at MPU begin as mpuSourceSize and clip copy ranges to that instead, preventing HTTP 400 (InvalidArgument) when copying ranges beyond the source object.

The previous approach (mpuSourceSize captured from knownSize at MPU begin) fails when knownSize diverges from the actual S3 object size. This happens when updateFromFlush sets knownSize to the local file size after a completed MPU, but the resulting S3 object is smaller (e.g. due to clipped copies in a prior cycle). The next MPU then uses the wrong size for range clipping, causing S3 400 InvalidArgument. Instead, do a single HeadBlob at the start of copyUnmodifiedParts to get the ground-truth source object size from S3, and clip copy ranges to that. This adds one HEAD request per flush cycle that needs copies — negligible compared to the UploadPartCopy operations that follow.

Track only directories with pending symlink changes for sync/shutdown flushes and remove redundant fallocate extension zero-fill path. Co-Authored-By: Warp <agent@warp.dev>

Capture finalSize under lock, use HeadBlob source-size validation, and fail fast on impossible unmodified-copy ranges to avoid copy thrash. Co-Authored-By: Warp <agent@warp.dev>

Co-Authored-By: Warp <agent@warp.dev>

alexsavio force-pushed the symlinks branch 2 times, most recently from 92afd1c to ca440fb Compare January 27, 2026 10:20

alexsavio changed the base branch from master to dev February 19, 2026 13:22

alexsavio changed the title ~~Symlinks~~ feat: add symlinks capabilities for S3-compatible systems, and fix multipart copy source-size handling Feb 19, 2026

alexsavio force-pushed the dev branch from a67cd58 to 5c0b07c Compare February 23, 2026 12:12

alexsavio force-pushed the symlinks branch from f69b6f5 to 2622687 Compare February 23, 2026 12:17

alexsavio and others added 24 commits February 23, 2026 13:20

feat: add .symlinks file support for AWS S3 compatibility

defb5c0

fixes

2b79a9d

restrict symlinks to s3-compatible systems

439ebaf

rename .symlinks to .geesefs_symlinks

e3b3931

fixes

7780cda

docs and tests

a8fea61

fixes

dc145dc

hide symlink files

f623a78

fixes

eedded4

update doc

7f7e865

split tests

bcd330f

add test for direct symlink access

ed56850

refactor: extract virtual symlink helpers to reduce duplication

03d2732

Extract newVirtualSymlinkInode() and applyVirtualSymlinkAttrs() helpers to replace 9 duplicate virtual symlink creation/update blocks.

fix: use conditional write before deleting empty symlinks file

ab67ceb

Prevents a race where another mount adds a symlink between our empty check and the delete. The If-Match conditional write fails with 412 if the file was modified, letting the retry logic merge correctly.

perf: guard debug logging in LookUpCached with level check

621eeca

Avoids map lookups, mutex lock, and string formatting on every file lookup when debug logging is disabled.

cleanup: remove dead code (SymlinksFileCache, isSymlinkFromCache)

c43f554

Neither the struct nor the function were referenced anywhere.

alexsavio and others added 14 commits February 23, 2026 13:20

fix: add jitter to retry backoff in SaveSymlinksFileWithRetry

3965b0c

Randomizes sleep duration in [backoff/2, backoff) to avoid thundering herd when multiple mounts contend on the same symlinks file.

perf: use compact JSON for symlinks file serialization

ed4ee63

Replace json.MarshalIndent with json.Marshal to reduce S3 storage and bandwidth for .geesefs_symlinks files.

refactor: replace IsS3Compatible() with SupportsConditionalWrites cap…

83e7ec2

…ability Use an explicit capability flag instead of hardcoded backend name matching to determine conditional write support.

test: add cache corruption recovery and edge case tests

0071597

Add tests for corrupted symlinks file loading, recovery after corruption, missing symlinks field, and forward version compatibility.

fix: harden UploadPartCopy range handling

933da63

fix: zero-fill fallocate extensions

bfcfab4

perf: optimize symlink flush tracking and fallocate path

d1ae566

Track only directories with pending symlink changes for sync/shutdown flushes and remove redundant fallocate extension zero-fill path. Co-Authored-By: Warp <agent@warp.dev>

fix: harden multipart copy source-size handling

ad04a74

Capture finalSize under lock, use HeadBlob source-size validation, and fail fast on impossible unmodified-copy ranges to avoid copy thrash. Co-Authored-By: Warp <agent@warp.dev>

fix: restore fallocate extension zero-fill

cb9d086

Co-Authored-By: Warp <agent@warp.dev>

alexsavio force-pushed the symlinks branch from 2622687 to cb9d086 Compare February 23, 2026 12:22

chore: set version number for this release

ae9d669

alexsavio merged commit 3a92537 into dev Mar 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add symlinks capabilities for S3-compatible systems, and fix multipart copy source-size handling#1

feat: add symlinks capabilities for S3-compatible systems, and fix multipart copy source-size handling#1
alexsavio merged 39 commits into
devfrom
symlinks

alexsavio commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

alexsavio commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant