
Generate Perceptual Hashes for Artwork Variants
Source:R/create-artwork-hash.R
createArtworkHash.RdCreates perceptual image hashes (pHash) for all artwork variants to enable duplicate detection and similarity search across the platform. Hashes are stored in artwork_hash table.
Value
data.table with columns: art_hash, art_binary, artist_uuid, art_uuid, variant_id, added_utc. Returns NULL if no variants found in CDN.
data.table with columns: art_hash, art_binary, artist_uuid, art_uuid, variant_id, added_utc. Returns NULL if no variants found.
Details
Perceptual hashing generates fingerprints that remain similar for visually similar images, unlike cryptographic hashes. This enables finding: - Exact duplicates (same artwork uploaded twice) - Near-duplicates (cropped, resized, or recompressed versions) - Potentially plagiarized works
See also
[artpixeltrace::image_phash()] for the hashing algorithm
Other pipeline functions:
build-record,
commit_artwork(),
create_gallery_images(),
frame-analytics,
frame-metrics,
render_certificate(),
validate_table_list()