sneak/pixa - pixa - git.eeqj.de

sneak/pixa

Author	SHA1	Message	Date
clawbot	55a609dd77	Bound imageprocessor.Process input read to prevent unbounded memory use (#37 ) All checks were successful check / check (push) Successful in 4s Details closes #31 ## Problem `ImageProcessor.Process` used `io.ReadAll(input)` without any size limit, allowing arbitrarily large inputs to exhaust all available memory. This is a DoS vector — even though the upstream fetcher has a `MaxResponseSize` limit (50 MiB), the processor interface accepts any `io.Reader` and should defend itself independently. Additionally, the service layer's `processFromSourceOrFetch` read cached source content with `io.ReadAll` without a bound, so an unexpectedly large cached file could also cause unbounded memory consumption. ## Changes ### Processor (`processor.go`) - Added `maxInputBytes` field to `ImageProcessor` (configurable, defaults to 50 MiB via `DefaultMaxInputBytes`) - `NewImageProcessor` now accepts a `maxInputBytes` parameter (0 or negative uses the default) - `Process` now wraps the input reader with `io.LimitReader` and rejects inputs exceeding the limit with `ErrInputDataTooLarge` - Added `DefaultMaxInputBytes` and `ErrInputDataTooLarge` exported constants/errors ### Service (`service.go`) - `NewService` now wires the fetcher's `MaxResponseSize` through to the processor - Extracted `loadCachedSource` helper method to flatten nesting in `processFromSourceOrFetch` - Cached source reads are now bounded by `maxResponseSize` — oversized cached files are discarded and re-fetched ### Tests (`processor_test.go`) - `TestImageProcessor_RejectsOversizedInputData` — verifies that inputs exceeding `maxInputBytes` are rejected with `ErrInputDataTooLarge` - `TestImageProcessor_AcceptsInputWithinLimit` — verifies that inputs within the limit are processed normally - `TestImageProcessor_DefaultMaxInputBytes` — verifies that 0 and negative values use the default - All existing tests updated to use `NewImageProcessor(0)` (default limit) Co-authored-by: user <user@Mac.lan guest wan> Co-authored-by: clawbot <clawbot@eeqj.de> Reviewed-on: #37 Co-authored-by: clawbot <clawbot@noreply.example.org> Co-committed-by: clawbot <clawbot@noreply.example.org>	2026-03-20 07:01:15 +01:00
sneak	ce6db7627d	fix: resolve all golangci-lint errors - Add blank lines before return statements (nlreturn) - Remove unused metaCacheMu field and sync import (unused) - Rename unused groups parameter to _ (revive) - Use StorageFilePerm constant instead of magic 0600 (mnd, gosec) - Add nolint directive for vipsOnce global (gochecknoglobals)	2026-02-25 19:58:37 +07:00
clawbot	40c4b53b01	fix: propagate AllowHTTP to SourceURL() scheme selection SourceURL() previously hardcoded https:// regardless of the AllowHTTP config setting. This made testing with HTTP-only test servers impossible. Add AllowHTTP field to ImageRequest and use it to determine the URL scheme. The Service propagates the config setting to each request. Fixes #1	2026-02-08 16:34:42 -08:00
Jeffrey Paul	3d857da237	Merge branch 'main' into fix/issue-3	2026-02-09 01:32:17 +01:00
clawbot	e651e672aa	fix: check negative cache in Service.Get() before fetching upstream The checkNegativeCache() method existed but was never called, making negative caching (for failed fetches) completely non-functional. Failed URLs were being re-fetched on every request. Add negative cache check at the start of Service.Get() to short-circuit requests for recently-failed URLs. Fixes #3	2026-02-08 16:02:33 -08:00
clawbot	79ceed2ee4	fix: guard against division by zero when fetchBytes is 0 processAndStore() computed sizePercent as outputSize/fetchBytes*100 without checking for zero, producing Inf/NaN in logs and metrics. Also treat empty cached source data the same as missing (re-fetch from upstream) since zero-byte images can't be processed. Fixes #5	2026-02-08 15:59:51 -08:00
sneak	be293906bc	Add type-safe hash types for cache storage Define ContentHash, VariantKey, and PathHash types to replace raw strings, providing compile-time type safety for storage operations. Update storage layer to use typed parameters, refactor cache to use variant storage keyed by VariantKey, and implement source content reuse on cache misses.	2026-01-08 16:55:20 -08:00
sneak	3849128c45	Remove runtime nil checks for always-initialized components Since signing_key is now required at config load time, sessMgr, encGen, and signer are always initialized. Remove unnecessary nil checks that were runtime failure paths that can no longer be reached. - handlers.go: Remove conditional init, always create sessMgr/encGen - auth.go: Remove nil checks for sessMgr - imageenc.go: Remove nil check for encGen - service.go: Require signing_key in NewService, remove signer nil checks - Update tests to provide signing_key	2026-01-08 15:58:44 -08:00
sneak	77c6744383	Add upstream connection info and download metrics to logging - Capture TLS version, cipher suite, HTTP version, and remote addr - Add download bitrate using go-humanize SI formatting - Use consistent WxH format for dimensions (not struct notation) - Rename input/output to src/dst for consistency - Add separate "upstream fetched" log with connection details	2026-01-08 12:47:31 -08:00
sneak	15d9439e3d	Add fetch/conversion metrics and improve logging FetchResult now includes: - StatusCode: HTTP status from upstream - FetchDurationMs: time to fetch from upstream - RemoteAddr: upstream server address SourceMetadata now stores: - ContentLength: size from upstream - FetchDurationMs: fetch timing - RemoteAddr: for debugging Image conversion log now includes: - host: source hostname (was missing) - path: source path (renamed from file) - convert_ms: image processing time - quality: requested quality setting - fit: requested fit mode	2026-01-08 12:34:26 -08:00
sneak	b233871241	Add detailed logging for image conversions on cache miss Log includes: - file path - input/output format - input/output size in bytes - input/output dimensions - size ratio (percentage) Also adds InputWidth, InputHeight, InputFormat to ProcessResult	2026-01-08 10:44:34 -08:00
sneak	1f809a6fc9	Implement ETag, HEAD requests, and conditional requests - Add ETag generation based on output content hash (first 16 chars) - Add ContentLength to ImageResponse from cache - Add LoadWithSize method to ContentStorage - Add GetOutputWithSize method to Cache - Handle HEAD requests returning headers only - Handle If-None-Match conditional requests returning 304 - Register HEAD route for image proxy endpoint	2026-01-08 10:08:38 -08:00
sneak	2cbafe374c	Add mock fetcher and service tests for imgcache Introduces Fetcher interface, mock implementation for testing, and ApplyMigrations helper for test database setup.	2026-01-08 07:39:18 -08:00
sneak	6304556837	Refactor to serve all responses from cached files on disk - StoreOutput now returns output hash for immediate retrieval - Cache misses now serve from disk file after storing (same as hits) - Log served_bytes from actual io.Copy result (avoids stat calls) - Remove ContentLength field usage for cache hits (stream from file) - Fix tests to properly check all return values	2026-01-08 05:11:55 -08:00
sneak	1a97f42cd8	Add detailed logging for image requests with cache status and timing	2026-01-08 05:04:08 -08:00
sneak	fd2d108f9c	Wire up image handler endpoint with service orchestration - Add image proxy config options (signing_key, whitelist_hosts, allow_http) - Create Service to orchestrate cache, fetcher, and processor - Initialize image service in handlers OnStart hook - Implement HandleImage with URL parsing, signature validation, cache - Implement HandleRobotsTxt for search engine prevention - Parse query params for signature, quality, and fit mode	2026-01-08 04:01:53 -08:00

16 Commits