Go to file

sneak d479bfcd52 Adopt sneak.berlin/go/vaultik vanity import path, README overhaul

Module path changed from git.eeqj.de/sneak/vaultik to
sneak.berlin/go/vaultik (vanity redirect). All imports, ldflags,
Dockerfile, goreleaser config, and docs updated. App data/config
directories now use plain "vaultik" instead of the reverse-DNS name.

README:
- New copy-pasteable quickstart at top: go install, config init,
  age keypair, config set for key + file:// destination, home backup
- All command names in command details are code-quoted
- config set/get gained sequence index support (age_recipients.0)
  so lists are settable from the CLI
- Dockerfile build is CGO_ENABLED=0 to match the pure-Go build

2026-06-10 11:37:23 -07:00

.gitea/workflows

Add make check target and CI workflow (#42 )

2026-03-17 12:39:44 +01:00

cmd/vaultik

Adopt sneak.berlin/go/vaultik vanity import path, README overhaul

2026-06-10 11:37:23 -07:00

docs

Merge origin/main: resolve conflicts in CLI surface, --prune, helpers

2026-05-02 02:56:51 +02:00

internal

Adopt sneak.berlin/go/vaultik vanity import path, README overhaul

2026-06-10 11:37:23 -07:00

test

Remove daemon mode references and unused config fields

2026-05-01 06:19:50 +02:00

.dockerignore

Add make check target and CI workflow (#42 )

2026-03-17 12:39:44 +01:00

.gitignore

Fix --cron silence, add snapshot cleanup, fix .gitignore

2026-06-09 13:45:54 -04:00

.goreleaser.yaml

Adopt sneak.berlin/go/vaultik vanity import path, README overhaul

2026-06-10 11:37:23 -07:00

AGENTS.md

Refresh docs: remove PROCESS.md, fix snapshot ID format, document pre-1.0 migration policy

2026-05-01 07:07:18 +02:00

ARCHITECTURE.md

Consolidate docs: rewrite README as primary reference, remove TODO.md

2026-06-09 12:57:33 -04:00

CLAUDE.md

Fix FK constraint errors in batched file insertion

2025-12-19 19:48:48 +07:00

config.example.yml

Remove daemon mode references and unused config fields

2026-05-01 06:19:50 +02:00

Dockerfile

Adopt sneak.berlin/go/vaultik vanity import path, README overhaul

2026-06-10 11:37:23 -07:00

go.mod

Adopt sneak.berlin/go/vaultik vanity import path, README overhaul

2026-06-10 11:37:23 -07:00

go.sum

Fix CLI semantics: exit codes, --prune, dedup, deep-verify

2026-05-01 07:04:37 +02:00

LICENSE

Major refactoring: UUID-based storage, streaming architecture, and CLI improvements

2025-07-22 14:56:44 +02:00

Makefile

Adopt sneak.berlin/go/vaultik vanity import path, README overhaul

2026-06-10 11:37:23 -07:00

README.md

Adopt sneak.berlin/go/vaultik vanity import path, README overhaul

2026-06-10 11:37:23 -07:00

test-config.yml

Major refactoring: UUID-based storage, streaming architecture, and CLI improvements

2025-07-22 14:56:44 +02:00

README.md

vaultik (ваултик)

vaultik is an incremental backup tool written in Go. It encrypts data using an age public key and uploads each encrypted blob directly to a remote S3-compatible object store. It requires no private keys, secrets, or credentials (other than those required to PUT to encrypted object storage, such as S3 API keys) stored on the backed-up system.

quickstart

# install
go install sneak.berlin/go/vaultik/cmd/vaultik@latest

# create a default config file (prints the path it wrote to)
vaultik config init

# generate an age keypair; keep key.txt somewhere safe and offline —
# you need it to restore, and the backed-up machine does not need it
age-keygen -o key.txt
grep 'public key' key.txt

# configure the encryption key and backup destination
vaultik config set age_recipients.0 age1YOUR_PUBLIC_KEY_HERE
vaultik config set storage_url "file:///Volumes/usbstick/mybackup"

# back up your home directory (the default config includes a "home"
# snapshot of ~ with sensible excludes)
vaultik snapshot create

# see what you have
vaultik snapshot list

Features:

modern encryption (age, X25519 + XChaCha20-Poly1305)
content-defined chunking with deduplication (FastCDC)
incremental backups (only changed files are re-chunked)
multithreaded zstd compression at configurable levels
content-addressed immutable storage
local state tracking in SQLite (enables write-only incremental backups)
no mutable remote metadata
no plaintext file paths or metadata in remote storage
packs small files into large blobs (keeps S3 operation counts down)
backs up regular files, symlinks, empty directories, and file permissions
pluggable storage backends: S3, local filesystem, rclone (70+ providers)
pure Go (no CGO), cross-compiles to linux/darwin × amd64/arm64

why

Other backup tools like restic, borg, and duplicity are designed for environments where the source host can store secrets and has access to decryption keys. vaultik is for environments where you don't want to store backup decryption keys on your hosts — only public keys for encryption.

Requirements that no existing tool meets:

open source
no passphrases or private keys on the source host
incremental
compressed
encrypted
s3 compatible without an intermediate step or tool

daily use

# verify a snapshot (shallow: checks all blobs exist)
vaultik snapshot verify <snapshot-id>

# deep verify (downloads and cryptographically verifies every blob)
VAULTIK_AGE_SECRET_KEY='AGE-SECRET-KEY-...' vaultik snapshot verify --deep <snapshot-id>

# restore (requires the private key)
VAULTIK_AGE_SECRET_KEY='AGE-SECRET-KEY-...' vaultik restore <snapshot-id> /tmp/restored

# daily cron job: back up, keep a 4-week rolling window of snapshots
# 0 3 * * * vaultik snapshot create --cron --prune --keep-newer-than 4w

cli

commands

vaultik [--config <path>] config init
vaultik [--config <path>] config edit
vaultik [--config <path>] config get <key>
vaultik [--config <path>] config set <key> <value>
vaultik [--config <path>] snapshot create [snapshot-names...] [--cron] [--prune] [--keep-newer-than <duration>] [--skip-errors]
vaultik [--config <path>] snapshot list [--json]
vaultik [--config <path>] snapshot verify <snapshot-id> [--deep] [--json]
vaultik [--config <path>] snapshot purge [--keep-latest | --older-than <duration>] [--snapshot <name>...] [--force]
vaultik [--config <path>] snapshot remove <snapshot-id|--all> [--dry-run] [--force] [--remote] [--json]
vaultik [--config <path>] snapshot prune
vaultik [--config <path>] snapshot cleanup
vaultik [--config <path>] restore <snapshot-id> <target-dir> [paths...] [--verify]
vaultik [--config <path>] prune [--force] [--json]
vaultik [--config <path>] info
vaultik [--config <path>] remote info [--json]
vaultik [--config <path>] store info
vaultik [--config <path>] database purge [--force]
vaultik version

global flags

--config <path>: Path to config file (default: $VAULTIK_CONFIG, then platform config dir, then /etc/vaultik/config.yml)
--verbose, -v: Enable verbose output
--debug: Enable debug output
--quiet, -q: Suppress non-error output

environment variables

VAULTIK_AGE_SECRET_KEY: Age private key for decryption (required for restore and verify --deep)
VAULTIK_CONFIG: Path to config file (overridden by --config)
VAULTIK_INDEX_PATH: Override local SQLite index path

command details

config init: Write a default config file with commented explanations for every setting. Writes to the path from --config, $VAULTIK_CONFIG, or the platform config directory (~/Library/Application Support/vaultik/ on macOS, ~/.config/vaultik/ on Linux, /etc/vaultik/ as root). Refuses to overwrite an existing file. Created with mode 0600 since it will contain credentials.

config edit: Open the config file in $EDITOR (falls back to vi).

config get: Print a config value addressed by dotted YAML path (e.g. vaultik config get s3.bucket). Non-scalar values print as YAML.

config set: Set a scalar config value by dotted YAML path (e.g. vaultik config set compression_level 9). Comments and formatting in the file are preserved; intermediate maps are created as needed.

snapshot create: Perform incremental backup of configured snapshots.

Optional snapshot names argument to create specific snapshots (default: all)
--cron: Silent unless error (for crontab)
--prune: After backup, drop older snapshots of each backed-up name and remove orphaned blobs from remote storage. By default keeps only the latest snapshot per name; use --keep-newer-than for a rolling window.
--keep-newer-than <duration>: With --prune, keep snapshots newer than this duration instead of only the latest (e.g. 4w, 30d, 6mo, 1y)
--skip-errors: Skip file read errors (log them loudly but continue)

snapshot list: List all snapshots with their timestamps and sizes.

--json: Output in JSON format

snapshot verify: Verify snapshot integrity.

Default (shallow): checks that all blobs referenced in the manifest exist in storage
--deep: Downloads and decrypts each blob, verifies chunk hashes against the encrypted metadata database
--json: Output results as JSON

snapshot purge: Remove old snapshots based on criteria. Retention is per-snapshot-name (--keep-latest keeps the latest of each name, not the latest globally).

--keep-latest: Keep only the most recent snapshot of each name
--older-than <duration>: Remove snapshots older than duration (e.g. 30d, 6m, 1y)
--snapshot <name>: Restrict to specific snapshot names (repeat for multiple)
--force: Skip confirmation prompt

snapshot remove: Remove a specific snapshot from the local database.

--remote: Also remove snapshot metadata from remote storage
--all: Remove all snapshots (requires --force)
--dry-run: Show what would be deleted without deleting
--force: Skip confirmation prompt
--json: Output result as JSON

snapshot prune: Clean orphaned data from the local database (files, chunks, blobs not referenced by any snapshot).

snapshot cleanup: Remove stale local snapshot records that have no corresponding metadata in remote storage. These are typically left behind by incomplete or interrupted backups. Does not touch remote storage.

restore: Restore files from a backup snapshot.

Requires VAULTIK_AGE_SECRET_KEY environment variable
Optional path arguments to restore specific files/directories (default: all)
Preserves file permissions, timestamps, ownership (ownership requires root), symlinks, and empty directories
--verify: After restoring, verify every file's chunk hashes match

prune: Remove unreferenced blobs from remote storage.

Scans all snapshot manifests for referenced blobs, deletes any blob not referenced
--force: Skip confirmation prompt
--json: Output stats as JSON

info: Display system configuration, storage settings, encryption recipients, and local database statistics.

remote info: Show detailed remote storage information including per-snapshot metadata sizes, blob counts, and orphaned blob detection.

--json: Output as JSON

store info: Display storage backend type and statistics.

database purge: Delete the local SQLite state database entirely. Remote storage is unaffected; the next backup will do a full scan and re-deduplicate against existing remote blobs.

--force: Skip confirmation prompt

storage backends

vaultik supports three storage backends, selected via the storage_url config field:

S3 (s3://bucket/prefix?endpoint=host&region=us-east-1): Any S3-compatible object store. Credentials are read from s3.access_key_id and s3.secret_access_key in the config file.

Local filesystem (file:///path/to/backup): Stores blobs and metadata on a local or mounted filesystem. Useful for testing or backing up to a NAS.

Rclone (rclone://remote/path): Uses rclone's 70+ supported cloud providers. Requires rclone to be configured separately (rclone config).

Legacy S3 configuration via s3.* fields (endpoint, bucket, prefix, etc.) is still supported for backward compatibility. storage_url takes precedence if both are set.

architecture

remote storage layout

<bucket>/<prefix>/
├── blobs/
│   └── <aa>/<bb>/<full_blob_hash>
└── metadata/
    └── <snapshot_id>/
        ├── db.zst.age          # Encrypted binary SQLite database
        └── manifest.json.zst   # Unencrypted blob list (for pruning)

Blobs are two-level directory sharded using the first 4 hex chars of the blob hash
db.zst.age is a binary SQLite database (zstd compressed, age encrypted) containing all file metadata, chunk mappings, and relationships for the snapshot
manifest.json.zst is an unencrypted compressed JSON blob list, enabling pruning without the private key

Snapshot IDs follow the format <hostname>_<snapshot-name>_<RFC3339-timestamp> (e.g. server1_home_2025-06-01T12:00:00Z).

data flow

backup:

Open local SQLite index, load known files and chunks into memory
Walk source directories, compare mtime/size/mode against index
For changed/new files: chunk using content-defined chunking (FastCDC)
For symlinks and directories: record metadata (no chunking)
For each chunk: hash, check dedup, add to blob packer
When blob reaches size threshold: compress (zstd), encrypt (age), upload
Build snapshot metadata database, compress, encrypt, upload
Create unencrypted blob manifest for pruning support

restore:

Download and decrypt metadata/<snapshot_id>/db.zst.age
Open the binary SQLite database
Query files (optionally filtered by paths)
Download and decrypt required blobs
Extract chunks, reconstruct files
Restore permissions, timestamps, ownership, symlinks

prune:

List all snapshot manifests
Build set of all referenced blob hashes
List all blobs in storage
Delete any blob not in the referenced set

chunking and deduplication

Content-defined chunking using the FastCDC algorithm
Average chunk size: configurable (default 10MB)
Deduplication at file level (unchanged files skipped) and chunk level (identical chunks across files stored once)
Multiple chunks packed into blobs to reduce object count

encryption

Asymmetric encryption using age (X25519 + XChaCha20-Poly1305)
Only the public key is needed on the source host
Each blob and each metadata database is encrypted independently
Multiple recipients supported (encrypt to multiple keys)

compression

zstd compression at configurable level (1-19, default 3)
Applied before encryption at the blob level

configuration reference

Run vaultik config init to generate a fully commented config file. Key fields:

Field	Default	Description
`age_recipients`	(required)	Age public keys for encryption
`snapshots`	(required)	Named snapshot definitions with paths and excludes
`storage_url`		Storage backend URL (`s3://`, `file://`, `rclone://`)
`s3.*`		Legacy S3 configuration (endpoint, bucket, credentials)
`exclude`		Global exclude patterns (applied to all snapshots)
`chunk_size`	`10MB`	Average chunk size for content-defined chunking
`blob_size_limit`	`10GB`	Maximum blob size before splitting
`compression_level`	`3`	zstd compression level (1-19)
`hostname`	system hostname	Hostname used in snapshot IDs
`index_path`	platform data dir	Local SQLite index path

limitations

No extended attributes (xattrs). ACLs, macOS Finder metadata, quarantine flags, SELinux labels, and other extended attributes are not backed up or restored.
No hard link detection. Two hard links to the same inode are backed up as independent files. Content deduplication means the data is stored once, but the hard link relationship is lost on restore.
No sparse file support. Sparse files are fully materialized during backup. A 100 GB sparse VM disk that is mostly zeros will consume the full (compressed) size in storage.
No bandwidth limiting. Uploads and downloads use whatever bandwidth is available. There is no --bwlimit flag yet.
No parallel blob downloads during restore. Blobs are fetched sequentially. Restore speed is bound by single-stream throughput.
Device nodes, named pipes, and sockets are silently skipped. Only regular files, directories, and symlinks are backed up.
No database migrations. If the local SQLite schema changes between versions, delete the local database (vaultik database purge) and run a full backup. Remote storage is unaffected.
Files that change during backup may be inconsistent. There is no filesystem snapshot or freeze. If a file is modified between the scan and chunk phases, the backed-up copy may reflect a partial write.
Ownership restoration requires root. File uid/gid are recorded and restored, but chown requires elevated privileges. Without root, files are restored with the current user's ownership.

roadmap

Items for future releases:

Error-condition tests (network failures, disk full, corrupted/missing blobs)
Parallel blob downloads during restore
Bandwidth limiting (--bwlimit)
Security audit of encryption implementation
Man pages and richer --help examples

requirements

Go 1.26 or later
S3-compatible object storage (or local filesystem, or rclone remote)

development workflow

All changes follow this workflow. No exceptions.

Create a feature branch off main.
Write tests.
Write the implementation.
Fix implementation errors until it compiles and tests pass.
Fix linting errors (make lint).
Update documentation and README as required by the change.
Format code (make fmt).
Run make check (lint + fmt-check + test). Fix any issues. Repeat until clean.
Commit on the branch.
Merge to main.
Push.

Do not commit directly to main. Do not skip steps.

Repository policies for AI agents are in AGENTS.md.

license

MIT

author

Made with love and lots of expensive SOTA AI by sneak in Berlin in the summer of 2025.

Released as a free software gift to the world, no strings attached.

Contact: sneak@sneak.berlin

https://keys.openpgp.org/vks/v1/by-fingerprint/5539AD00DE4C42F3AFE11575052443F4DF2A55C2

README.md Unescape Escape

vaultik (ваултик)

quickstart

why

daily use

cli

commands

global flags

environment variables

command details

storage backends

architecture

remote storage layout

data flow

chunking and deduplication

encryption

compression

configuration reference

limitations

roadmap

requirements

development workflow

license

author

README.md