rbv/migrations at 6dd99e3b0ade98307883f7263fb4a1fed1752e6c - rbv

rbv/rbv

Files

Add image captioning service with auto-download from HuggingFace

New standalone rbv-caption binary that generates image captions using
ONNX models. Fetches images via CDN, writes captions to a new captions
table, and integrates with search (both quick and advanced modes).

Supported models:
- vit-gpt2: ViT encoder + GPT-2 decoder (auto-download from Xenova)
- florence-2-base: Florence-2 4-stage pipeline using fine-tuned variant
  from onnx-community (auto-download)
- blip-base, git-base: manual ONNX export required

Key implementation details:
- Florence-2 task tokens are natural language prompts, not special tokens
- Uses non-merged decoder ONNX models (no KV cache) for simplicity
- Systemd template unit for deploying multiple models concurrently
- Deploy script targets quadbrat for GPU inference

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-07 08:45:31 +03:00

0001_extensions.sql

Initial commit: rbv workspace with ingest, API, UI, and ML client

2026-03-22 16:51:50 +02:00

0002_galleries.sql

Initial commit: rbv workspace with ingest, API, UI, and ML client

2026-03-22 16:51:50 +02:00

0003_images.sql

Initial commit: rbv workspace with ingest, API, UI, and ML client

2026-03-22 16:51:50 +02:00

0004_clip.sql

Initial commit: rbv workspace with ingest, API, UI, and ML client